A primary transcript is the single-stranded ribonucleic acid (RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs are modified in preparation for translation. For example, a precursor mRNA (pre-mRNA) is a type of primary transcript that becomes a messenger RNA (mRNA) after processing.
Pre-mRNA is synthesized from a DNA template in the cell nucleus by transcription. Pre-mRNA comprises the bulk of heterogeneous nuclear RNA (hnRNA). Once pre-mRNA has been completely processed, it is termed "mature messenger RNA", or simply "messenger RNA". The term hnRNA is often used as a synonym for pre-mRNA, although, in the strict sense, hnRNA may include nuclear RNA transcripts that do not end up as cytoplasmic mRNA.
There are several steps contributing to the production of primary transcripts. All these steps involve a series of interactions to initiate and complete the transcription of DNA in the nucleus of eukaryotes. Certain factors play key roles in the activation and inhibition of transcription, where they regulate primary transcript production. Transcription produces primary transcripts that are further modified by several processes. These processes include the 5' cap, 3'-polyadenylation, and alternative splicing. In particular, alternative splicing directly contributes to the diversity of mRNA found in cells. The modifications of primary transcripts have been further studied in research seeking greater knowledge of the role and significance of these transcripts. Experimental studies based on molecular changes to primary transcripts and the processes before and after transcription have led to greater understanding of diseases involving primary transcripts.
Production
Main article: Transcription (genetics)The steps contributing to the production of primary transcripts involve a series of molecular interactions that initiate transcription of DNA within a cell's nucleus. Based on the needs of a given cell, certain DNA sequences are transcribed to produce a variety of RNA products to be translated into functional proteins for cellular use. To initiate the transcription process in a cell's nucleus, DNA double helices are unwound and hydrogen bonds connecting compatible nucleic acids of DNA are broken to produce two unconnected single DNA strands. One strand of the DNA template is used for transcription of the single-stranded primary transcript mRNA. This DNA strand is bound by an RNA polymerase at the promoter region of the DNA.
In eukaryotes, three kinds of RNA—rRNA, tRNA, and mRNA—are produced based on the activity of three distinct RNA polymerases, whereas, in prokaryotes, only one RNA polymerase exists to create all kinds of RNA molecules. RNA polymerase II of eukaryotes transcribes the primary transcript, a transcript destined to be processed into mRNA, from the antisense DNA template in the 5' to 3' direction, and this newly synthesized primary transcript is complementary to the antisense strand of DNA. RNA polymerase II constructs the primary transcript using a set of four specific ribonucleoside monophosphate residues (adenosine monophosphate (AMP), cytidine monophosphate (CMP), guanosine monophosphate (GMP), and uridine monophosphate (UMP)) that are added continuously to the 3' hydroxyl group on the 3' end of the growing mRNA.
Studies of primary transcripts produced by RNA polymerase II reveal that an average primary transcript is 7,000 nucleotides in length, with some growing as long as 20,000 nucleotides in length. The inclusion of both exon and intron sequences within primary transcripts explains the size difference between larger primary transcripts and smaller, mature mRNA ready for translation into protein.
Regulation
A number of factors contribute to the activation and inhibition of transcription and therefore regulate the production of primary transcripts from a given DNA template.
Activation of RNA polymerase activity to produce primary transcripts is often controlled by sequences of DNA called enhancers. Transcription factors, proteins that bind to DNA elements to either activate or repress transcription, bind to enhancers and recruit enzymes that alter nucleosome components, causing DNA to be either more or less accessible to RNA polymerase. The unique combinations of either activating or inhibiting transcription factors that bind to enhancer DNA regions determine whether or not the gene that enhancer interacts with is activated for transcription or not. Activation of transcription depends on whether or not the transcription elongation complex, itself consisting of a variety of transcription factors, can induce RNA polymerase to dissociate from the Mediator complex that connects an enhancer region to the promoter.
Inhibition of RNA polymerase activity can also be regulated by DNA sequences called silencers. Like enhancers, silencers may be located at locations farther up or downstream from the genes they regulate. These DNA sequences bind to factors that contribute to the destabilization of the initiation complex required to activate RNA polymerase, and therefore inhibit transcription.
Histone modification by transcription factors is another key regulatory factor for transcription by RNA polymerase. In general, factors that lead to histone acetylation activate transcription while factors that lead to histone deacetylation inhibit transcription. Acetylation of histones induces repulsion between negative components within nucleosomes, allowing for RNA polymerase access. Deacetylation of histones stabilizes tightly coiled nucleosomes, inhibiting RNA polymerase access. In addition to acetylation patterns of histones, methylation patterns at promoter regions of DNA can regulate RNA polymerase access to a given template. RNA polymerase is often incapable of synthesizing a primary transcript if the targeted gene's promoter region contains specific methylated cytosines— residues that hinder binding of transcription-activating factors and recruit other enzymes to stabilize a tightly bound nucleosome structure, excluding access to RNA polymerase and preventing the production of primary transcripts.
R-loops
R-loops are formed during transcription. An R-loop is a three-stranded nucleic acid structure containing a DNA-RNA hybrid region and an associated non-template single-stranded DNA. Actively transcribed regions of DNA often form R-loops that are vulnerable to DNA damage. Introns reduce R-loop formation and DNA damage in highly expressed yeast genes.
Transcription stress
DNA damages arise in each cell, every day, with the number of damages in each cell reaching tens to hundreds of thousands, and such DNA damages can impede primary transcription. The process of gene expression itself is a source of endogenous DNA damages resulting from the susceptibility of single-stranded DNA to damage. Other sources of DNA damage are conflicts of the primary transcription machinery with the DNA replication machinery, and the activity of certain enzymes such as topoisomerases and base excision repair enzymes. Even though these processes are tightly regulated and are usually accurate, occasionally they can make mistakes and leave behind DNA breaks that drive chromosomal rearrangements or cell death.
RNA processing
Main article: Post-transcriptional modificationTranscription, a highly regulated phase in gene expression, produces primary transcripts. However, transcription is only the first step which should be followed by many modifications that yield functional forms of RNAs. Otherwise stated, the newly synthesized primary transcripts are modified in several ways to be converted to their mature, functional forms to produce different proteins and RNAs such as mRNA, tRNA, and rRNA.
Processing
The basic primary transcript modification process is similar for tRNA and rRNA in both eukaryotic and prokaryotic cells. On the other hand, primary transcript processing varies in mRNAs of prokaryotic and eukaryotic cells. For example, some prokaryotic bacterial mRNAs serve as templates for synthesis of proteins at the same time they are being produced via transcription. Alternatively, pre-mRNA of eukaryotic cells undergo a wide range of modifications prior to their transport from the nucleus to cytoplasm where their mature forms are translated. These modifications are responsible for the different types of encoded messages that lead to translation of various types of products. Furthermore, primary transcript processing provides a control for gene expression as well as a regulatory mechanism for the degradation rates of mRNAs. The processing of pre-mRNA in eukaryotic cells includes 5' capping, 3' polyadenylation, and alternative splicing.
5' capping
Main article: Five-prime capShortly after transcription is initiated in eukaryotes, a pre-mRNA's 5' end is modified by the addition of a 7-methylguanosine cap, also known as a 5' cap. The 5' capping modification is initiated by the addition of a GTP to the 5' terminal nucleotide of the pre-mRNA in reverse orientation followed by the addition of methyl groups to the G residue. 5' capping is essential for the production of functional mRNAs since the 5' cap is responsible for aligning the mRNA with the ribosome during translation.
Polyadenylation
Main article: PolyadenylationIn eukaryotes, polyadenylation further modifies pre-mRNAs during which a structure called the poly-A tail is added. Signals for polyadenylation, which include several RNA sequence elements, are detected by a group of proteins which signal the addition of the poly-A tail (approximately 200 nucleotides in length). The polyadenylation reaction provides a signal for the end of transcription and this reaction ends approximately a few hundred nucleotides downstream from the poly-A tail location.
Alternative splicing
Main article: Alternative splicingEukaryotic pre-mRNAs have their introns spliced out by spliceosomes made up of small nuclear ribonucleoproteins.
In complex eukaryotic cells, one primary transcript is able to prepare large amounts of mature mRNAs due to alternative splicing. Alternative splicing is regulated so that each mature mRNA may encode a multiplicity of proteins.
The effect of alternative splicing in gene expression can be seen in complex eukaryotes which have a fixed number of genes in their genome yet produce much larger numbers of different gene products. Most eukaryotic pre-mRNA transcripts contain multiple introns and exons. The various possible combinations of 5' and 3' splice sites in a pre-mRNA can lead to different excision and combination of exons while the introns are eliminated from the mature mRNA. Thus, various kinds of mature mRNAs are generated. Alternative splicing takes place in a large protein complex called the spliceosome. Alternative splicing is crucial for tissue-specific and developmental regulation in gene expression. Alternative splicing can be affected by various factors, including mutations such as chromosomal translocation.
In prokaryotes, splicing is done by autocatalytic cleavage or by endolytic cleavage. Autocatalytic cleavages, in which no proteins are involved, are usually reserved for sections that code for rRNA, whereas endolytic cleavage corresponds to tRNA precursors.
Research
5-Fluorouracil (FUra) exposure in methotrexate-resistant KB cells led to a two-fold reduction in total dihydrofolate reductase (DHFR) mRNA levels, while the level of DHFR pre-mRNA with certain introns remained unaffected. The half-life of DHFR mRNA or pre-mRNA did not change significantly, but the transition rate of DHFR RNA from the nucleus to the cytoplasm decreased, suggesting that FUra may influence mRNA processing and/or nuclear DHFR mRNA stability.
In Drosophila and Aedes, hnRNA (pre-mRNA) size was larger in Aedes due to its larger genome, despite both species producing mature mRNA of similar size and sequence complexity. This indicates that hnRNA size increases with genome size.
In HeLa cells, spliceosome groups on pre-mRNA were found to form within nuclear speckles, with this formation being temperature-dependent and influenced by specific RNA sequences. Pre-mRNA targeting and splicing factor loading in speckles were critical for spliceosome group formation, resulting in a speckled pattern.
Recruiting pre-mRNA to nuclear speckles significantly increased splicing efficiency and protein levels, indicating that proximity to speckles enhances splicing efficiency.
Related diseases
Research has also led to greater knowledge about certain diseases related to changes within primary transcripts. One study involved estrogen receptors and differential splicing. The article entitled, "Alternative splicing of the human estrogen receptor alpha primary transcript: mechanisms of exon skipping" by Paola Ferro, Alessandra Forlani, Marco Muselli and Ulrich Pfeffer from the laboratory of Molecular Oncology at National Cancer Research Institute in Genoa, Italy, explains that 1785 nucleotides of the region in the DNA that codes for the estrogen receptor alpha (ER-alpha) are spread over a region that holds more than 300,000 nucleotides in the primary transcript. Splicing of this pre-mRNA frequently leads to variants or different kinds of the mRNA lacking one or more exons or regions necessary for coding proteins. These variants have been associated with breast cancer progression. In the life cycle of retroviruses, proviral DNA is incorporated in transcription of the DNA of the cell being infected. Since retroviruses need to change their pre-mRNA into DNA so that this DNA can be integrated within the DNA of the host it is affecting, the formation of that DNA template is a vital step for retrovirus replication. Cell type, the differentiation or changed state of the cell, and the physiological state of the cell, result in a significant change in the availability and activity of certain factors necessary for transcription. These variables create a wide range of viral gene expression. For example, tissue culture cells actively producing infectious virions of avian or murine leukemia viruses (ASLV or MLV) contain such high levels of viral RNA that 5–10% of the mRNA in a cell can be of viral origin. This shows that the primary transcripts produced by these retroviruses do not always follow the normal path to protein production and convert back into DNA in order to multiply and expand.
See also
References
- ^ Strachan T, Read AP (January 2004). Human Molecular Genetics 3. Garland Science. pp. 16–17. ISBN 978-0-8153-4184-0.
- ^ Alberts B (1994). "RNA Synthesis and RNA Processing". Molecular Biology of the Cell (3rd ed.). New York: Garland Science – via NCBI.
- Griffiths AJ (2000). "An Introduction to Genetic Analysis". NCBI. New York: W.H. Freeman.
- ^ Gilbert SF (15 July 2013). Developmental Biology. Sinauer Associates, Incorporated. pp. 38–39, 50. ISBN 978-1-60535-173-5.
- Brown TA (2002). "Assembly of the Transcription Initiation Complex". Genomes (2nd ed.). Oxford: Wiley-Liss.
- Lodish H (2008). Molecular Cell Biology. W. H. Freeman. pp. 303–306. ISBN 978-0-7167-7601-7.
- Bonnet A, Grosso AR, Elkaoutari A, Coleno E, Presle A, Sridhara SC, et al. (August 2017). "Introns Protect Eukaryotic Genomes from Transcription-Associated Genetic Instability". Molecular Cell. 67 (4): 608–621.e6. doi:10.1016/j.molcel.2017.07.002. PMID 28757210.
- ^ Milano L, Gautam A, Caldecott KW (January 2024). "DNA damage and transcription stress". Mol Cell. 84 (1): 70–79. doi:10.1016/j.molcel.2023.11.014. PMID 38103560. This article incorporates text available under the CC BY 4.0 license.
- ^ Cooper GM (2000). "RNA Processing and Turnover". The Cell: A Molecular Approach (2nd ed.). Sunderland (MA): Sinauer Associates; 2000.
- Weaver RF (2005). Molecular Biology. New York, NY: McGraw-Hill. pp. 432–448. ISBN 0-07-284611-9.
- Wahl MC, Will CL, Lührmann R (February 2009). "The spliceosome: design principles of a dynamic RNP machine". Cell. 136 (4): 701–18. doi:10.1016/j.cell.2009.02.009. hdl:11858/00-001M-0000-000F-9EAB-8. PMID 19239890. S2CID 21330280.
- Will CL, Dolnick BJ (December 1989). "5-Fluorouracil inhibits dihydrofolate reductase precursor mRNA processing and/or nuclear mRNA stability in methotrexate-resistant KB cells". The Journal of Biological Chemistry. 264 (35): 21413–21. doi:10.1016/S0021-9258(19)30096-1. PMID 2592384.
- Lengyel J, Penman S (July 1975). "hnRNA size and processing as related to different DNA content in two dipterans: Drosophila and Aedes". Cell. 5 (3): 281–90. doi:10.1016/0092-8674(75)90103-8. PMID 807333. S2CID 39038640.
- Melčák I, Melčáková Š, Kopsky V, Večeřová J, Raška I (February 2001). "Prespliceosomal assembly on microinjected precursor mRNA takes place in nuclear speckles". Molecular Biology of the Cell. 12 (2): 393–406. CiteSeerX 10.1.1.324.8865. doi:10.1091/mbc.12.2.393. PMC 30951. PMID 11179423.
- Bhat P, Chow A, Emert B, Ettlin O, Quinodoz SA, Strehle M, et al. (May 2024). "Genome organization around nuclear speckles drives mRNA splicing efficiency". Nature. 629 (8014): 1165–1173. Bibcode:2024Natur.629.1165B. doi:10.1038/s41586-024-07429-6. PMC 11164319. PMID 38720076.
- Ferro P, Forlani A, Muselli M, Pfeffer U (September 2003). "Alternative splicing of the human estrogen receptor alpha primary transcript: mechanisms of exon skipping". International Journal of Molecular Medicine. 12 (3): 355–63. PMID 12883652.
- Coffin JM, Hughes SH, Varmus HE, eds. (1997). "Transcription". Retroviruses. Cold Spring Harbor (NY): Cold Spring Harbor Laboratory Press.
External links
Post-transcriptional modification | |||||||||
---|---|---|---|---|---|---|---|---|---|
Nuclear |
| ||||||||
Cytosolic |
Types of nucleic acids | |||||||
---|---|---|---|---|---|---|---|
Constituents | |||||||
Ribonucleic acids (coding, non-coding) |
| ||||||
Deoxyribonucleic acids | |||||||
Analogues | |||||||
Cloning vectors | |||||||