{"title":"im7G-DCT: A two-branch strategy model based on improved DenseNet and transformer for m7G site prediction","authors":"Rufeng Lei , Jian Jia , Lulu Qin","doi":"10.1016/j.compbiolchem.2025.108473","DOIUrl":null,"url":null,"abstract":"<div><div>N-7 methylguanosine (m7G) is an important RNA modification that plays a key role in regulating gene expression and cellular physiological functions. Medical research has shown that m7G is closely associated with the development of a variety of diseases, including cancer, neurodegenerative diseases and viral infections. Therefore, accurate identification of m7G sites in mRNA is important for clinical applications and development of therapeutic strategies. With the rapid development of computational methods, deep learning prediction models are widely used in the field of m7G site. We developed a model called im7G-DCT based on the improved Densely Connected Convolutional Network and Transformer, which employs a two-branching strategy to extract local and global features from the original feature code in parallel. We found that this innovative strategy can mine the potential feature information of m7G locus sequences in a deeper way, which enhances the richness of features and improves the accuracy of prediction. As an innovative deep learning method, the im7G-DCT model shows important research value in the field of bioinformatics and has broad application prospects. In the results of independent test experiments, the im7G-DCT model achieved 81.68 %, 90.52 %, 86.10 %, and 72.48 % in sensitivity, specificity, accuracy, and Matthews' correlation coefficient, respectively. The im7G-DCT model performs well in all evaluation metrics and it significantly outperforms other existing prediction models.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"118 ","pages":"Article 108473"},"PeriodicalIF":2.6000,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Biology and Chemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1476927125001331","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
N-7 methylguanosine (m7G) is an important RNA modification that plays a key role in regulating gene expression and cellular physiological functions. Medical research has shown that m7G is closely associated with the development of a variety of diseases, including cancer, neurodegenerative diseases and viral infections. Therefore, accurate identification of m7G sites in mRNA is important for clinical applications and development of therapeutic strategies. With the rapid development of computational methods, deep learning prediction models are widely used in the field of m7G site. We developed a model called im7G-DCT based on the improved Densely Connected Convolutional Network and Transformer, which employs a two-branching strategy to extract local and global features from the original feature code in parallel. We found that this innovative strategy can mine the potential feature information of m7G locus sequences in a deeper way, which enhances the richness of features and improves the accuracy of prediction. As an innovative deep learning method, the im7G-DCT model shows important research value in the field of bioinformatics and has broad application prospects. In the results of independent test experiments, the im7G-DCT model achieved 81.68 %, 90.52 %, 86.10 %, and 72.48 % in sensitivity, specificity, accuracy, and Matthews' correlation coefficient, respectively. The im7G-DCT model performs well in all evaluation metrics and it significantly outperforms other existing prediction models.
期刊介绍:
Computational Biology and Chemistry publishes original research papers and review articles in all areas of computational life sciences. High quality research contributions with a major computational component in the areas of nucleic acid and protein sequence research, molecular evolution, molecular genetics (functional genomics and proteomics), theory and practice of either biology-specific or chemical-biology-specific modeling, and structural biology of nucleic acids and proteins are particularly welcome. Exceptionally high quality research work in bioinformatics, systems biology, ecology, computational pharmacology, metabolism, biomedical engineering, epidemiology, and statistical genetics will also be considered.
Given their inherent uncertainty, protein modeling and molecular docking studies should be thoroughly validated. In the absence of experimental results for validation, the use of molecular dynamics simulations along with detailed free energy calculations, for example, should be used as complementary techniques to support the major conclusions. Submissions of premature modeling exercises without additional biological insights will not be considered.
Review articles will generally be commissioned by the editors and should not be submitted to the journal without explicit invitation. However prospective authors are welcome to send a brief (one to three pages) synopsis, which will be evaluated by the editors.