Peilin Xie , Jiahui Guan , Xuxin He , Zhihao Zhao , Yilin Guo , Zhenglong Sun , Lantian Yao , Tzong-Yi Lee , Ying-Chih Chiang
{"title":"CAP-m7G: A capsule network-based framework for specific RNA N7-methylguanosine site identification using image encoding and reconstruction layers","authors":"Peilin Xie , Jiahui Guan , Xuxin He , Zhihao Zhao , Yilin Guo , Zhenglong Sun , Lantian Yao , Tzong-Yi Lee , Ying-Chih Chiang","doi":"10.1016/j.csbj.2025.02.029","DOIUrl":null,"url":null,"abstract":"<div><div>N7-methylguanosine (m7G) modifications play a pivotal role in RNA stability, mRNA export, and protein translation. They are closely associated with ribosome function and the regulation of gene expression. Dysregulation of m7G has been implicated in various diseases, including cancers and neurodegenerative disorders, where the loss of m7G can lead to genomic instability and uncontrolled cell proliferation. Accurate identification of m7G sites is thus essential for elucidating these mechanisms. Due to the high cost of experimentally validating m7G sites, several artificial intelligence models have been developed to predict these sites. However, the performance of these models is not yet optimal, and a user-friendly web server is still needed. To address these issues, we developed CAP-m7G, an innovative model that integrates Chaos Game Representation, Capsule Networks, and reconstruction layers. CAP-m7G achieved an accuracy of 96.63%, a specificity of 95.07%, and a Matthews correlation coefficient (MCC) of 0.933 on independent test data. Our results demonstrate that the integration of Chaos Game Representation with Capsule Network can effectively capture the crucial sequence information associated with m7G sites. The web server can be accessed at <span><span>https://awi.cuhk.edu.cn/~biosequence/CAP-m7G/index.php</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":10715,"journal":{"name":"Computational and structural biotechnology journal","volume":"27 ","pages":"Pages 804-812"},"PeriodicalIF":4.4000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational and structural biotechnology journal","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2001037025000595","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
N7-methylguanosine (m7G) modifications play a pivotal role in RNA stability, mRNA export, and protein translation. They are closely associated with ribosome function and the regulation of gene expression. Dysregulation of m7G has been implicated in various diseases, including cancers and neurodegenerative disorders, where the loss of m7G can lead to genomic instability and uncontrolled cell proliferation. Accurate identification of m7G sites is thus essential for elucidating these mechanisms. Due to the high cost of experimentally validating m7G sites, several artificial intelligence models have been developed to predict these sites. However, the performance of these models is not yet optimal, and a user-friendly web server is still needed. To address these issues, we developed CAP-m7G, an innovative model that integrates Chaos Game Representation, Capsule Networks, and reconstruction layers. CAP-m7G achieved an accuracy of 96.63%, a specificity of 95.07%, and a Matthews correlation coefficient (MCC) of 0.933 on independent test data. Our results demonstrate that the integration of Chaos Game Representation with Capsule Network can effectively capture the crucial sequence information associated with m7G sites. The web server can be accessed at https://awi.cuhk.edu.cn/~biosequence/CAP-m7G/index.php.
期刊介绍:
Computational and Structural Biotechnology Journal (CSBJ) is an online gold open access journal publishing research articles and reviews after full peer review. All articles are published, without barriers to access, immediately upon acceptance. The journal places a strong emphasis on functional and mechanistic understanding of how molecular components in a biological process work together through the application of computational methods. Structural data may provide such insights, but they are not a pre-requisite for publication in the journal. Specific areas of interest include, but are not limited to:
Structure and function of proteins, nucleic acids and other macromolecules
Structure and function of multi-component complexes
Protein folding, processing and degradation
Enzymology
Computational and structural studies of plant systems
Microbial Informatics
Genomics
Proteomics
Metabolomics
Algorithms and Hypothesis in Bioinformatics
Mathematical and Theoretical Biology
Computational Chemistry and Drug Discovery
Microscopy and Molecular Imaging
Nanotechnology
Systems and Synthetic Biology