Identification of long noncoding RNAs (lncRNAs) and co-transcriptional analysis of mRNAs and lncRNAs in transcriptomes of Anopheles gambiae.

Frontiers in RNA research Pub Date : 2025-01-01 Epub Date: 2025-04-15 DOI:10.3389/frnar.2025.1555885
Jiannong Xu, Kai Hu, Michelle M Riehle, Vedbar S Khadka
{"title":"Identification of long noncoding RNAs (lncRNAs) and co-transcriptional analysis of mRNAs and lncRNAs in transcriptomes of <i>Anopheles gambiae</i>.","authors":"Jiannong Xu, Kai Hu, Michelle M Riehle, Vedbar S Khadka","doi":"10.3389/frnar.2025.1555885","DOIUrl":null,"url":null,"abstract":"<p><p><i>Anopheles gambiae</i> is a primary malaria vector mosquito in Africa. RNA-seq based transcriptome analysis has been widely used to study gene expressions underlying mosquito life traits such as development, reproduction, immunity, metabolism, and behavior. While it is widely appreciated that long non-coding RNAs (lncRNAs) are expressed ubiquitously in transcriptomes across metazoans, lncRNAs remain relatively underexplored in <i>An. gambiae</i>, including their identity, expression profiles, and biological functions. The lncRNA genes were poorly annotated in the current reference of the PEST genome of <i>An. gambiae</i>. In this study, a set of publicly available RNA-seq datasets was leveraged to identify lncRNAs across diverse contexts, including whole mosquitoes, mosquito cells or tissues including hemocytes, midguts, and salivary glands, as well as under different physiological conditions including sugar-feeding, blood-feeding, bacterial challenges, and <i>Plasmodium</i> infections. A Transcript Discovery module implemented in CLC genomics workbench was used to identify lncRNAs from selected published RNA-seq datasets. Across this pool of transcriptomes, 2684 unique lncRNA genes, comprising 4082 transcripts, were identified. Following their identification, these lncRNA genes were integrated into the mosquito transcriptome annotation, which was then used as a reference to analyze both mRNAs and lncRNAs for transcriptional dynamics in different conditions. Unsurprisingly and similar to what has been reported for mRNAs, lncRNAs exhibited context-dependent expression patterns. Co-expression networks constructed using weighted gene co-expression network analysis (WGCNA) highlighted the interconnections among lncRNAs and mRNAs, which provides potential functional networks in which these lncRNAs are involved. Furthermore, we identified polysome-associated lncRNAs within polysome-captured transcripts, suggesting the involvement of lncRNAs in translation regulation and coding capacity for micropeptides. The analysis of a ChIP-seq dataset unveiled a correlation of transcriptional activities between lncRNAs and observed epigenetic signatures. Overall, our study demonstrated that lncRNAs are transcribed alongside mRNAs in various biological contexts. The genome-wide annotation of lncRNA genes and integration into the PEST reference genome enables the co-analysis of mRNA and lncRNA simultaneously, which will enhance our understanding of their functions, shedding light on their regulatory roles in <i>An. gambiae</i> biology.</p>","PeriodicalId":73105,"journal":{"name":"Frontiers in RNA research","volume":"3 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12376855/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in RNA research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frnar.2025.1555885","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/15 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Anopheles gambiae is a primary malaria vector mosquito in Africa. RNA-seq based transcriptome analysis has been widely used to study gene expressions underlying mosquito life traits such as development, reproduction, immunity, metabolism, and behavior. While it is widely appreciated that long non-coding RNAs (lncRNAs) are expressed ubiquitously in transcriptomes across metazoans, lncRNAs remain relatively underexplored in An. gambiae, including their identity, expression profiles, and biological functions. The lncRNA genes were poorly annotated in the current reference of the PEST genome of An. gambiae. In this study, a set of publicly available RNA-seq datasets was leveraged to identify lncRNAs across diverse contexts, including whole mosquitoes, mosquito cells or tissues including hemocytes, midguts, and salivary glands, as well as under different physiological conditions including sugar-feeding, blood-feeding, bacterial challenges, and Plasmodium infections. A Transcript Discovery module implemented in CLC genomics workbench was used to identify lncRNAs from selected published RNA-seq datasets. Across this pool of transcriptomes, 2684 unique lncRNA genes, comprising 4082 transcripts, were identified. Following their identification, these lncRNA genes were integrated into the mosquito transcriptome annotation, which was then used as a reference to analyze both mRNAs and lncRNAs for transcriptional dynamics in different conditions. Unsurprisingly and similar to what has been reported for mRNAs, lncRNAs exhibited context-dependent expression patterns. Co-expression networks constructed using weighted gene co-expression network analysis (WGCNA) highlighted the interconnections among lncRNAs and mRNAs, which provides potential functional networks in which these lncRNAs are involved. Furthermore, we identified polysome-associated lncRNAs within polysome-captured transcripts, suggesting the involvement of lncRNAs in translation regulation and coding capacity for micropeptides. The analysis of a ChIP-seq dataset unveiled a correlation of transcriptional activities between lncRNAs and observed epigenetic signatures. Overall, our study demonstrated that lncRNAs are transcribed alongside mRNAs in various biological contexts. The genome-wide annotation of lncRNA genes and integration into the PEST reference genome enables the co-analysis of mRNA and lncRNA simultaneously, which will enhance our understanding of their functions, shedding light on their regulatory roles in An. gambiae biology.

冈比亚按蚊转录组长链非编码rna (lncRNAs)的鉴定及mrna和lncRNAs的共转录分析。
冈比亚按蚊是非洲主要的疟疾病媒蚊子。基于RNA-seq的转录组分析已被广泛用于研究蚊子发育、繁殖、免疫、代谢和行为等生命特征的基因表达。虽然人们普遍认为长链非编码rna (lncRNAs)在后生动物的转录组中普遍表达,但lncRNAs在An中的研究仍然相对不足。冈比亚,包括他们的身份,表达谱和生物学功能。在目前的参考资料中,lncRNA基因注释较差。冈比亚按蚊。在这项研究中,利用一组公开可用的RNA-seq数据集来鉴定不同背景下的lncrna,包括整只蚊子、蚊子细胞或组织(包括血细胞、肠道和唾液腺),以及不同生理条件(包括糖摄食、血液摄食、细菌挑战和疟原虫感染)。在CLC基因组学工作台中实现的转录发现模块用于从选定的已发表的RNA-seq数据集中鉴定lncrna。在这个转录组库中,鉴定出2684个独特的lncRNA基因,包括4082个转录本。鉴定后,将这些lncRNA基因整合到蚊子转录组注释中,然后作为参考分析mrna和lncRNA在不同条件下的转录动态。不出所料,与已报道的mrna相似,lncRNAs表现出上下文依赖的表达模式。利用加权基因共表达网络分析(WGCNA)构建的共表达网络突出了lncrna和mrna之间的相互联系,提供了这些lncrna参与的潜在功能网络。此外,我们在多聚体捕获的转录本中发现了多聚体相关的lncRNAs,这表明lncRNAs参与了微肽的翻译调控和编码能力。对ChIP-seq数据集的分析揭示了lncrna之间的转录活性与观察到的表观遗传特征之间的相关性。总的来说,我们的研究表明lncrna在各种生物学背景下与mrna一起转录。lncRNA基因的全基因组注释和整合到PEST参考基因组中,可以同时对mRNA和lncRNA进行联合分析,这将增强我们对它们功能的理解,揭示它们在An中的调控作用。冈比亚按蚊的生物学。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信