{"title":"mRNAs of plants and green algae lack the m7G cap-1 structure","authors":"Chen Xiao, Qiongfang Li, Shangwei Wu, Feng Zhang, Hailei Zhang, Chen Zhang, Zongwei Cai, Yiji Xia","doi":"10.1111/nph.70033","DOIUrl":null,"url":null,"abstract":"<p>The 7-methylguanosine (m<sup>7</sup>G) cap is a characteristic feature found at the 5′ end of eukaryotic mRNA and certain noncoding RNAs. The cap is linked to mRNA through a 5′–5′ pyrophosphate bond (Ramanathan <i>et al</i>., <span>2016</span>). The formation of the cap involves a capping enzyme (CE) that adds the guanosine (G) cap to the 5′ end of a nascent transcript after <i>c</i>. 30 nucleotides have been synthesized. This G cap is subsequently methylated by RNA guanosine-7 methyltransferase (RNMT) to produce the m<sup>7</sup>G cap, which is also referred to as the m<sup>7</sup>G Cap-0 form (m<sup>7</sup>GpppN; Shuman, <span>2002</span>; Cowling, <span>2010</span>). The m<sup>7</sup>G cap plays a vital role in recruiting proteins to form a cap-binding complex, which is essential for transcription elongation, splicing, polyadenylation, and translation initiation, as well as providing protection from degradation by 5′–3′ exonucleases (Galloway & Cowling, <span>2019</span>). While m<sup>7</sup>G capping was once considered a constitutive housekeeping process for all eukaryotic mRNA, emerging evidence suggests that it is regulated in a gene-specific manner in response to various stimuli (Borden <i>et al</i>., <span>2021</span>). Additionally, recent findings indicate that some RNAs in both prokaryotic and eukaryotic organisms possess noncanonical caps, such as the nicotinamide adenine dinucleotide (NAD) cap (Wolfram-Schauerte & Höfer, <span>2023</span>), highlighting the complexity of gene regulation mechanisms involving RNA capping.</p>\n<p>In addition to the Cap-0 form, the first transcribed nucleotides of mRNA can be methylated at the 2′ hydroxyl position of its ribose during mRNA biogenesis, resulting in the formation of the Cap-1 mRNA (m<sup>7</sup>GpppNm) (Galloway & Cowling, <span>2019</span>). After mRNA is exported into the cytosol, Cap-1 mRNAs can undergo further methylation at the second nucleotide, also at the 2′ hydroxyl of the ribose, to produce Cap-2 mRNA (m<sup>7</sup>GpppNmNm).</p>\n<p>Mammalian mRNA typically possesses the Cap-1 structure, whereas mRNAs that possess only the Cap-0 structure are recognized as nonself RNA, triggering the innate immune response (Daffis <i>et al</i>., <span>2010</span>). Further, a large portion of Cap-1 mRNAs in human cells is further converted to the Cap-2 form, whereas an elevated level of Cap-1 mRNA can still activate the innate immune response although not as potent as Cap-0 mRNA (Despic & Jaffrey, <span>2023</span>). Besides preventing excessive activation of the immune response, Cap-1 and Cap-2 play additional roles such as mRNA stability and increased translation (Despic & Jaffrey, <span>2023</span>).</p>\n<p>Although it has been known for over 40 yr that plant mRNAs also contain the m<sup>7</sup>G cap (Nichols, <span>1979</span>; Haugland & Cline, <span>1980</span>), it remains unclear whether any plant RNAs possess the Cap-1 or Cap-2 forms. To further investigate the different forms of mRNA caps and their roles in gene regulation, we aimed to determine whether plant mRNAs also contain Cap-1 and Cap-2 structures. Here, we report our findings that mRNAs in both plants and green algae are deficient in the Cap-1 structure, and consequently, they also do not possess the Cap-2 structure. This suggests a fundamental evolutionary divergence in the canonical mRNA cap modification and function between animal and plant lineages.</p>\n<p>Recently, a method called CapTag-seq has been developed to quantify and identify Cap-0, Cap-1, and Cap-2 structures in human cells (Despic & Jaffrey, <span>2023</span>). This method utilizes RNase T2, which cleaves all phosphodiester bonds in RNA except for the bond between 2′-<i>O</i>-methylated nucleotides and one nucleotide downstream of a 2′-<i>O</i>-methylated nucleotide (Motorin & Marchand, <span>2018</span>). In CapTag-seq, mRNAs are first treated with an m<sup>7</sup>G decapping enzyme to remove 7-methylguanosine diphosphate (m<sup>7</sup>GDP) from the cap, resulting in a 5′-monophosphorylated end that is ligated to a 5′ adaptor. The adaptor consists of 2′-<i>O</i>-methylated nucleotides, making it resistant to RNase T2 cleavage. After ligation to the adaptor, the sample is digested with RNase T2. If the original mRNA contains only the Cap-0 structure, a single nucleotide (i.e. the first nucleotide) will remain attached to the adaptor, whereas mRNAs with Cap-1 or Cap-2 structures will have two and three nucleotides attached to the adaptor, respectively.</p>\n<p>To carry out the CapTag-seq analysis (Fig. 1a), we extracted total RNAs from Arabidopsis (<i>Arabidopsis thaliana</i>), maize (<i>Zea mays</i>), green algae (<i>Selenastrum</i> sp.), human HEK293T cells, baker's yeast (<i>Saccharomyces cerevisiae</i>), and <i>Escherichia coli</i> (<i>E. coli</i>). Agarose gel electrophoresis indicated that the total RNAs from different organisms were intact (Supporting Information Fig. S1). mRNAs from the eukaryotic organisms were enriched using oligo d(T) beads, while mRNAs from <i>E. coli</i> were enriched with an rRNA removal kit.</p>\n<figure><picture>\n<source media=\"(min-width: 1650px)\" srcset=\"/cms/asset/1df8a464-83fb-4001-8a2f-8fb64610910f/nph70033-fig-0001-m.jpg\"/><img alt=\"Details are in the caption following the image\" data-lg-src=\"/cms/asset/1df8a464-83fb-4001-8a2f-8fb64610910f/nph70033-fig-0001-m.jpg\" loading=\"lazy\" src=\"/cms/asset/d5271dce-7696-4995-8653-9edebd161405/nph70033-fig-0001-m.png\" title=\"Details are in the caption following the image\"/></picture><figcaption>\n<div><strong>Fig. 1<span style=\"font-weight:normal\"></span></strong><div>Open in figure viewer<i aria-hidden=\"true\"></i><span>PowerPoint</span></div>\n</div>\n<div>CapTag-seq and CapTag-polyacrylamide gel electrophoresis (PAGE) reveal that mRNAs of plants and green algae are deficient in the 7-methylguanosine Cap-1 structure. (a) Schematic diagram of the CapTag-seq and CapTag-PAGE workflow used in the experiment. The workflow of CapTag-seq was adopted from (Despic & Jaffrey, <span>2023</span>). The schematic diagrams were created in BioRender (https://Biorender.com/e74y565). (b–g) Read counts representing different cap structures in mRNAs from Arabidopsis (b), <i>Escherichia coli</i> (c), <i>Saccharomyces cerevisiae</i> (d), <i>Zea mays</i> (e), <i>Selenastrum</i> sp. (f), and a mixture of Arabidopsis and HEK293T cells (g). The 20 nt read length indicates self-ligation of the 5′ and 3′ adapters, while the 21, 22, and 23 nt read lengths correspond to the Cap-0, Cap-1, and Cap-2 structures, respectively. Data represent mean ± SD (<i>n</i> = 3), with all individual data points represented as dots. Different letters in (b–e) indicate significant differences as determined by one-way ANOVA with Tukey's test (<i>P</i> < 0.05). (h) Analysis of cap structures using CapTag-PAGE. This method enables the separation of bands with single-nucleotide variations in the polymerase chain reaction (PCR) products, ranging from 68 to 71 bp (69 bp for Cap-0, 70 bp for Cap-1, and 71 bp for Cap-2). The red-dashed line indicates the location of the 69 bp PCR product. CIP, calf intestinal alkaline phosphatase; RT, reverse transcription.</div>\n</figcaption>\n</figure>\n<p>To minimize the ligation of the 5′ adaptor with uncapped monophosphorylated RNAs that might be present due to partially degraded or sheared RNAs, we first treated the samples with calf intestinal alkaline phosphatase (CIP) to remove the 5′ phosphate. The samples were then subjected to m<sup>7</sup>G decapping, resulting in 5′-monophosphate RNAs that were ligated to a biotin-labeled, 2′-<i>O</i>-methylated 5′ RNA adaptor. Following RNase T2 cleavage, we enriched the RNA fragments containing the adaptor using streptavidin beads. The samples were then ligated to a 3′ adaptor containing a random 20 nt (N20) sequence, and the ligation products were reverse transcribed into complementary DNA (cDNA).</p>\n<p>cDNAs were amplified using index polymerase chain reaction (PCR) to create cDNA libraries. We determined that 15 cycles of PCR were optimal for our experimental conditions, yielding sufficient products for sequencing analysis without overamplification. The resulting cDNA libraries were sequenced using Illumina sequencing and analyzed with the SOAP<span>nuke</span> v2.1.9 software (Chen <i>et al</i>., <span>2018</span>). A read of 20 nt indicates a product from self-ligation of the 5′ and 3′ adaptors, which contains only the N20 sequence, while 21, 22, and 23 nt reads correspond to the Cap-0, Cap-1, and Cap-2 forms, respectively (Fig. 1a).</p>\n<p>In the Arabidopsis cDNA library, 78.0% of the total reads were 21 nt reads, indicating their origin from Cap-0 mRNAs (Fig. 1b). However, these 21 nt reads may not be exclusively derived from Cap-0 mRNAs, as partially sheared or degraded RNAs might not be completely dephosphorylated and could still ligate to the 5′ adaptor. This observation also explains why 6% of the reads from <i>E. coli</i>, which do not possess the Cap-0 structure, were 21 nt in length, indicating the presence of some 5′-monophosphate RNAs in the sample during ligation to the 5′ adaptor (Fig. 1c).</p>\n<p>In addition to the 21 nt reads in the Arabidopsis library, 20.6% of the reads were 20 nt long, likely resulting from self-ligation of the two adaptors. The remaining 1.4% of the reads comprised lengths of 19, 22, 23, and 24 nt (Fig. 1b). The minimal presence of 22 and 23 nt reads is unlikely to indicate the presence of Cap-1 and Cap-2 mRNAs; rather, it is more likely a result of noise, similar to the minimal levels of 19 and 24 nt reads. This noise could arise from incomplete digestion by RNase T2, sequencing errors, or other factors. This is further supported by the presence of comparable levels of 22 and 23 nt reads in the libraries from <i>E. coli</i> and yeast (Fig. 1c,d), which have also been reported to lack Cap-1 or Cap-2 structures (Sripati <i>et al</i><i>.,</i> <span>1976</span>). These results suggest that Arabidopsis mRNA does not contain the Cap-1 or Cap-2 forms. Similarly, we did not detect Cap-1 or Cap-2 mRNAs in maize or green algae (Fig. 1e,f).</p>\n<p>The failure to detect Cap-1 or Cap-2 structures in the plant and green algae samples is unlikely due to a technical issue, as we have repeated similar experiments multiple times and consistently obtained comparable results. Furthermore, in our experiments, we included an RNA sample as a positive control, in which Arabidopsis mRNAs were mixed with mRNAs from human HEK293T cells at a 60% : 40% ratio. It has been reported that HEK293T cells lack Cap-0-only mRNAs (Despic & Jaffrey, <span>2023</span>); however, their mRNAs generally contain Cap-1, with a significant portion further methylated to form Cap-2 (Furuichi <i>et al</i>., <span>1975</span>; Despic & Jaffrey, <span>2023</span>). In the mixed Arabidopsis–human samples, 54.4% of the reads were 21 nt in length, likely attributed solely to Arabidopsis Cap-0 mRNAs, while the 22 and 23 nt reads accounted for a total of 20.0% of the total reads (Fig. 1g). These results further support the conclusion that plants and green algae lack the Cap-1 and Cap-2 forms.</p>\n<p>In addition to the CapTag-seq method, we employed polyacrylamide gel electrophoresis (PAGE) to analyze the different forms of cap structures. This modified method, which we have named CapTag-PAGE (Fig. 1a), is more cost-effective as it does not require sequencing of the cDNA libraries, although it is not as quantitative as CapTag-seq. In CapTag-PAGE, RNAs were processed similarly to CapTag-seq until the synthesis of the first-strand cDNA via reverse transcription. Subsequently, short primers were added to the cDNA fragments, which were then amplified using standard PCR. The resulting PCR products were separated using 15% PAGE on a long gel (<i>c</i>. 130 mm) for <i>c</i>. 5 h, allowing for the distinction of products ranging from 68 to 71 bp, representing the origins of adaptor self-ligation, Cap-0, Cap-1, and Cap-2 mRNAs, with single-nucleotide resolution. CapTag-PAGE revealed that mRNAs from plants, green algae, and yeast exclusively contain the m<sup>7</sup>G Cap-0 structure, as indicated by a clear 69 bp band, with no detectable 70 bp (Cap-1) or 71 bp (Cap-2) bands (Fig. 1h). By contrast, only the Cap-1 and Cap-2 forms were detectable in human HEK293T cells, as demonstrated by two distinct bands at 70 and 71 bp (Fig. 1h), but no detectable 69 bp (Cap-0) product. As a negative control, <i>E. coli</i> displayed a single clear band at 68 bp, representing the ligation products of the adaptors (Fig. 1h).</p>\n<p>From the CapTag-seq data, we also analyzed the compositions of the first nucleotides in the mRNA samples, which represent the transcription start sites (TSS). All of the organisms exhibited a similar TSS preference, with <i>c</i>. 60% being adenine (A) and <i>c</i>. 20% being guanine (G), followed by the pyrimidine nucleotide cytosine (C), while thymine (T) was present in the lowest abundance (Fig. S2).</p>\n<p>In animals, cap methyltransferase 1 (CMTR1) and cap methyltransferase 2 (CMTR2) catalyze the successive 2′-<i>O</i>-methylation of the first and second nucleotides, leading to the formation of Cap-1 and Cap-2 structures, respectively (Smietanski <i>et al</i>., <span>2014</span>). A homolog search of plant protein sequences in the National Center for Biotechnology Information databases revealed no plant sequences resembling CMTR1 (Fig. S3a). Similarly, a BLAST analysis comparing human CMTR1 with the Arabidopsis TAIR database (Araport11 protein sequences database) also showed no significant matches (Fig. S3b). We cannot entirely dismiss the possibility that a distinct methyltransferase exists in plants, capable of catalyzing the 2′-<i>O</i>-methylation of the first transcribed nucleotide in mRNAs from certain genes; however, if such a modification is present in plants, it falls below the detection limits of the methods employed in this study.</p>\n<p>Although it has been known for over 40 yr that plant mRNAs possess the m<sup>7</sup>G cap, research on the mechanisms of m<sup>7</sup>G capping and its role in gene regulation in plants has been very limited. This may be partly due to the assumption that plants utilize a capping mechanism similar to that of animals. This study reveals that mRNAs in plants and green algae, similar to yeast mRNAs, primarily carry the Cap-0 structure, with no presence of Cap-1 or Cap-2 forms. By contrast, human Cap-0 mRNAs are generally methylated at the first transcribed nucleotide to form Cap-1 mRNA, with a significant portion further modified to produce the Cap-2 structure. Our findings regarding the different forms of mRNA cap structures suggest a fundamental evolutionary divergence in cap modification and function between animals and other eukaryotic organisms, although m<sup>7</sup>G Cap-0 capping is a common feature that likely evolved in a shared ancestor of these eukaryotic groups. Recently, the CE and RNMT responsible for the formation of the m<sup>7</sup>G cap in Arabidopsis have been identified and characterized, showing significant sequence similarity to these enzymes in animals (Kerk <i>et al</i>., <span>2008</span>; Xiao <i>et al</i>., <span>2023</span>; Ning <i>et al</i>., <span>2024</span>). In addition to these two core enzymes for m<sup>7</sup>G capping, mammalian RNMT is activated by a miniprotein known as RNMT-activating miniprotein (RAM) (Gonatopoulos-Pournatzis <i>et al</i>., <span>2011</span>). However, we have recently discovered that Arabidopsis RNMT1 is activated by Decapping and exoribonuclease 1 (DXO1) (Xiao <i>et al</i>., <span>2023</span>), which is also known as a decapping enzyme for NAD-capped RNAs (Kwasnik <i>et al</i>., <span>2019</span>; Pan <i>et al</i>., <span>2020</span>; Yu <i>et al</i>., <span>2021</span>). By contrast, RAM does not appear to have any other known functions. These findings suggest a unique mechanism in plants for mediating m<sup>7</sup>G capping, potentially linked to NAD capping/decapping. The absence of Cap-1 and Cap-2 forms in plants further indicates a distinct mechanism and function of mRNA capping in gene regulation.</p>","PeriodicalId":214,"journal":{"name":"New Phytologist","volume":"39 1","pages":""},"PeriodicalIF":8.3000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Phytologist","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/nph.70033","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The 7-methylguanosine (m7G) cap is a characteristic feature found at the 5′ end of eukaryotic mRNA and certain noncoding RNAs. The cap is linked to mRNA through a 5′–5′ pyrophosphate bond (Ramanathan et al., 2016). The formation of the cap involves a capping enzyme (CE) that adds the guanosine (G) cap to the 5′ end of a nascent transcript after c. 30 nucleotides have been synthesized. This G cap is subsequently methylated by RNA guanosine-7 methyltransferase (RNMT) to produce the m7G cap, which is also referred to as the m7G Cap-0 form (m7GpppN; Shuman, 2002; Cowling, 2010). The m7G cap plays a vital role in recruiting proteins to form a cap-binding complex, which is essential for transcription elongation, splicing, polyadenylation, and translation initiation, as well as providing protection from degradation by 5′–3′ exonucleases (Galloway & Cowling, 2019). While m7G capping was once considered a constitutive housekeeping process for all eukaryotic mRNA, emerging evidence suggests that it is regulated in a gene-specific manner in response to various stimuli (Borden et al., 2021). Additionally, recent findings indicate that some RNAs in both prokaryotic and eukaryotic organisms possess noncanonical caps, such as the nicotinamide adenine dinucleotide (NAD) cap (Wolfram-Schauerte & Höfer, 2023), highlighting the complexity of gene regulation mechanisms involving RNA capping.
In addition to the Cap-0 form, the first transcribed nucleotides of mRNA can be methylated at the 2′ hydroxyl position of its ribose during mRNA biogenesis, resulting in the formation of the Cap-1 mRNA (m7GpppNm) (Galloway & Cowling, 2019). After mRNA is exported into the cytosol, Cap-1 mRNAs can undergo further methylation at the second nucleotide, also at the 2′ hydroxyl of the ribose, to produce Cap-2 mRNA (m7GpppNmNm).
Mammalian mRNA typically possesses the Cap-1 structure, whereas mRNAs that possess only the Cap-0 structure are recognized as nonself RNA, triggering the innate immune response (Daffis et al., 2010). Further, a large portion of Cap-1 mRNAs in human cells is further converted to the Cap-2 form, whereas an elevated level of Cap-1 mRNA can still activate the innate immune response although not as potent as Cap-0 mRNA (Despic & Jaffrey, 2023). Besides preventing excessive activation of the immune response, Cap-1 and Cap-2 play additional roles such as mRNA stability and increased translation (Despic & Jaffrey, 2023).
Although it has been known for over 40 yr that plant mRNAs also contain the m7G cap (Nichols, 1979; Haugland & Cline, 1980), it remains unclear whether any plant RNAs possess the Cap-1 or Cap-2 forms. To further investigate the different forms of mRNA caps and their roles in gene regulation, we aimed to determine whether plant mRNAs also contain Cap-1 and Cap-2 structures. Here, we report our findings that mRNAs in both plants and green algae are deficient in the Cap-1 structure, and consequently, they also do not possess the Cap-2 structure. This suggests a fundamental evolutionary divergence in the canonical mRNA cap modification and function between animal and plant lineages.
Recently, a method called CapTag-seq has been developed to quantify and identify Cap-0, Cap-1, and Cap-2 structures in human cells (Despic & Jaffrey, 2023). This method utilizes RNase T2, which cleaves all phosphodiester bonds in RNA except for the bond between 2′-O-methylated nucleotides and one nucleotide downstream of a 2′-O-methylated nucleotide (Motorin & Marchand, 2018). In CapTag-seq, mRNAs are first treated with an m7G decapping enzyme to remove 7-methylguanosine diphosphate (m7GDP) from the cap, resulting in a 5′-monophosphorylated end that is ligated to a 5′ adaptor. The adaptor consists of 2′-O-methylated nucleotides, making it resistant to RNase T2 cleavage. After ligation to the adaptor, the sample is digested with RNase T2. If the original mRNA contains only the Cap-0 structure, a single nucleotide (i.e. the first nucleotide) will remain attached to the adaptor, whereas mRNAs with Cap-1 or Cap-2 structures will have two and three nucleotides attached to the adaptor, respectively.
To carry out the CapTag-seq analysis (Fig. 1a), we extracted total RNAs from Arabidopsis (Arabidopsis thaliana), maize (Zea mays), green algae (Selenastrum sp.), human HEK293T cells, baker's yeast (Saccharomyces cerevisiae), and Escherichia coli (E. coli). Agarose gel electrophoresis indicated that the total RNAs from different organisms were intact (Supporting Information Fig. S1). mRNAs from the eukaryotic organisms were enriched using oligo d(T) beads, while mRNAs from E. coli were enriched with an rRNA removal kit.
Fig. 1
Open in figure viewerPowerPoint
CapTag-seq and CapTag-polyacrylamide gel electrophoresis (PAGE) reveal that mRNAs of plants and green algae are deficient in the 7-methylguanosine Cap-1 structure. (a) Schematic diagram of the CapTag-seq and CapTag-PAGE workflow used in the experiment. The workflow of CapTag-seq was adopted from (Despic & Jaffrey, 2023). The schematic diagrams were created in BioRender (https://Biorender.com/e74y565). (b–g) Read counts representing different cap structures in mRNAs from Arabidopsis (b), Escherichia coli (c), Saccharomyces cerevisiae (d), Zea mays (e), Selenastrum sp. (f), and a mixture of Arabidopsis and HEK293T cells (g). The 20 nt read length indicates self-ligation of the 5′ and 3′ adapters, while the 21, 22, and 23 nt read lengths correspond to the Cap-0, Cap-1, and Cap-2 structures, respectively. Data represent mean ± SD (n = 3), with all individual data points represented as dots. Different letters in (b–e) indicate significant differences as determined by one-way ANOVA with Tukey's test (P < 0.05). (h) Analysis of cap structures using CapTag-PAGE. This method enables the separation of bands with single-nucleotide variations in the polymerase chain reaction (PCR) products, ranging from 68 to 71 bp (69 bp for Cap-0, 70 bp for Cap-1, and 71 bp for Cap-2). The red-dashed line indicates the location of the 69 bp PCR product. CIP, calf intestinal alkaline phosphatase; RT, reverse transcription.
To minimize the ligation of the 5′ adaptor with uncapped monophosphorylated RNAs that might be present due to partially degraded or sheared RNAs, we first treated the samples with calf intestinal alkaline phosphatase (CIP) to remove the 5′ phosphate. The samples were then subjected to m7G decapping, resulting in 5′-monophosphate RNAs that were ligated to a biotin-labeled, 2′-O-methylated 5′ RNA adaptor. Following RNase T2 cleavage, we enriched the RNA fragments containing the adaptor using streptavidin beads. The samples were then ligated to a 3′ adaptor containing a random 20 nt (N20) sequence, and the ligation products were reverse transcribed into complementary DNA (cDNA).
cDNAs were amplified using index polymerase chain reaction (PCR) to create cDNA libraries. We determined that 15 cycles of PCR were optimal for our experimental conditions, yielding sufficient products for sequencing analysis without overamplification. The resulting cDNA libraries were sequenced using Illumina sequencing and analyzed with the SOAPnuke v2.1.9 software (Chen et al., 2018). A read of 20 nt indicates a product from self-ligation of the 5′ and 3′ adaptors, which contains only the N20 sequence, while 21, 22, and 23 nt reads correspond to the Cap-0, Cap-1, and Cap-2 forms, respectively (Fig. 1a).
In the Arabidopsis cDNA library, 78.0% of the total reads were 21 nt reads, indicating their origin from Cap-0 mRNAs (Fig. 1b). However, these 21 nt reads may not be exclusively derived from Cap-0 mRNAs, as partially sheared or degraded RNAs might not be completely dephosphorylated and could still ligate to the 5′ adaptor. This observation also explains why 6% of the reads from E. coli, which do not possess the Cap-0 structure, were 21 nt in length, indicating the presence of some 5′-monophosphate RNAs in the sample during ligation to the 5′ adaptor (Fig. 1c).
In addition to the 21 nt reads in the Arabidopsis library, 20.6% of the reads were 20 nt long, likely resulting from self-ligation of the two adaptors. The remaining 1.4% of the reads comprised lengths of 19, 22, 23, and 24 nt (Fig. 1b). The minimal presence of 22 and 23 nt reads is unlikely to indicate the presence of Cap-1 and Cap-2 mRNAs; rather, it is more likely a result of noise, similar to the minimal levels of 19 and 24 nt reads. This noise could arise from incomplete digestion by RNase T2, sequencing errors, or other factors. This is further supported by the presence of comparable levels of 22 and 23 nt reads in the libraries from E. coli and yeast (Fig. 1c,d), which have also been reported to lack Cap-1 or Cap-2 structures (Sripati et al.,1976). These results suggest that Arabidopsis mRNA does not contain the Cap-1 or Cap-2 forms. Similarly, we did not detect Cap-1 or Cap-2 mRNAs in maize or green algae (Fig. 1e,f).
The failure to detect Cap-1 or Cap-2 structures in the plant and green algae samples is unlikely due to a technical issue, as we have repeated similar experiments multiple times and consistently obtained comparable results. Furthermore, in our experiments, we included an RNA sample as a positive control, in which Arabidopsis mRNAs were mixed with mRNAs from human HEK293T cells at a 60% : 40% ratio. It has been reported that HEK293T cells lack Cap-0-only mRNAs (Despic & Jaffrey, 2023); however, their mRNAs generally contain Cap-1, with a significant portion further methylated to form Cap-2 (Furuichi et al., 1975; Despic & Jaffrey, 2023). In the mixed Arabidopsis–human samples, 54.4% of the reads were 21 nt in length, likely attributed solely to Arabidopsis Cap-0 mRNAs, while the 22 and 23 nt reads accounted for a total of 20.0% of the total reads (Fig. 1g). These results further support the conclusion that plants and green algae lack the Cap-1 and Cap-2 forms.
In addition to the CapTag-seq method, we employed polyacrylamide gel electrophoresis (PAGE) to analyze the different forms of cap structures. This modified method, which we have named CapTag-PAGE (Fig. 1a), is more cost-effective as it does not require sequencing of the cDNA libraries, although it is not as quantitative as CapTag-seq. In CapTag-PAGE, RNAs were processed similarly to CapTag-seq until the synthesis of the first-strand cDNA via reverse transcription. Subsequently, short primers were added to the cDNA fragments, which were then amplified using standard PCR. The resulting PCR products were separated using 15% PAGE on a long gel (c. 130 mm) for c. 5 h, allowing for the distinction of products ranging from 68 to 71 bp, representing the origins of adaptor self-ligation, Cap-0, Cap-1, and Cap-2 mRNAs, with single-nucleotide resolution. CapTag-PAGE revealed that mRNAs from plants, green algae, and yeast exclusively contain the m7G Cap-0 structure, as indicated by a clear 69 bp band, with no detectable 70 bp (Cap-1) or 71 bp (Cap-2) bands (Fig. 1h). By contrast, only the Cap-1 and Cap-2 forms were detectable in human HEK293T cells, as demonstrated by two distinct bands at 70 and 71 bp (Fig. 1h), but no detectable 69 bp (Cap-0) product. As a negative control, E. coli displayed a single clear band at 68 bp, representing the ligation products of the adaptors (Fig. 1h).
From the CapTag-seq data, we also analyzed the compositions of the first nucleotides in the mRNA samples, which represent the transcription start sites (TSS). All of the organisms exhibited a similar TSS preference, with c. 60% being adenine (A) and c. 20% being guanine (G), followed by the pyrimidine nucleotide cytosine (C), while thymine (T) was present in the lowest abundance (Fig. S2).
In animals, cap methyltransferase 1 (CMTR1) and cap methyltransferase 2 (CMTR2) catalyze the successive 2′-O-methylation of the first and second nucleotides, leading to the formation of Cap-1 and Cap-2 structures, respectively (Smietanski et al., 2014). A homolog search of plant protein sequences in the National Center for Biotechnology Information databases revealed no plant sequences resembling CMTR1 (Fig. S3a). Similarly, a BLAST analysis comparing human CMTR1 with the Arabidopsis TAIR database (Araport11 protein sequences database) also showed no significant matches (Fig. S3b). We cannot entirely dismiss the possibility that a distinct methyltransferase exists in plants, capable of catalyzing the 2′-O-methylation of the first transcribed nucleotide in mRNAs from certain genes; however, if such a modification is present in plants, it falls below the detection limits of the methods employed in this study.
Although it has been known for over 40 yr that plant mRNAs possess the m7G cap, research on the mechanisms of m7G capping and its role in gene regulation in plants has been very limited. This may be partly due to the assumption that plants utilize a capping mechanism similar to that of animals. This study reveals that mRNAs in plants and green algae, similar to yeast mRNAs, primarily carry the Cap-0 structure, with no presence of Cap-1 or Cap-2 forms. By contrast, human Cap-0 mRNAs are generally methylated at the first transcribed nucleotide to form Cap-1 mRNA, with a significant portion further modified to produce the Cap-2 structure. Our findings regarding the different forms of mRNA cap structures suggest a fundamental evolutionary divergence in cap modification and function between animals and other eukaryotic organisms, although m7G Cap-0 capping is a common feature that likely evolved in a shared ancestor of these eukaryotic groups. Recently, the CE and RNMT responsible for the formation of the m7G cap in Arabidopsis have been identified and characterized, showing significant sequence similarity to these enzymes in animals (Kerk et al., 2008; Xiao et al., 2023; Ning et al., 2024). In addition to these two core enzymes for m7G capping, mammalian RNMT is activated by a miniprotein known as RNMT-activating miniprotein (RAM) (Gonatopoulos-Pournatzis et al., 2011). However, we have recently discovered that Arabidopsis RNMT1 is activated by Decapping and exoribonuclease 1 (DXO1) (Xiao et al., 2023), which is also known as a decapping enzyme for NAD-capped RNAs (Kwasnik et al., 2019; Pan et al., 2020; Yu et al., 2021). By contrast, RAM does not appear to have any other known functions. These findings suggest a unique mechanism in plants for mediating m7G capping, potentially linked to NAD capping/decapping. The absence of Cap-1 and Cap-2 forms in plants further indicates a distinct mechanism and function of mRNA capping in gene regulation.
期刊介绍:
New Phytologist is an international electronic journal published 24 times a year. It is owned by the New Phytologist Foundation, a non-profit-making charitable organization dedicated to promoting plant science. The journal publishes excellent, novel, rigorous, and timely research and scholarship in plant science and its applications. The articles cover topics in five sections: Physiology & Development, Environment, Interaction, Evolution, and Transformative Plant Biotechnology. These sections encompass intracellular processes, global environmental change, and encourage cross-disciplinary approaches. The journal recognizes the use of techniques from molecular and cell biology, functional genomics, modeling, and system-based approaches in plant science. Abstracting and Indexing Information for New Phytologist includes Academic Search, AgBiotech News & Information, Agroforestry Abstracts, Biochemistry & Biophysics Citation Index, Botanical Pesticides, CAB Abstracts®, Environment Index, Global Health, and Plant Breeding Abstracts, and others.