{"title":"Feadm5C:利用理化分子图特征增强对RNA 5-甲基胞嘧啶修饰位点的预测。","authors":"Dongdong Jiang , Chunyan Ao , Yan Li , Liang Yu","doi":"10.1016/j.ygeno.2025.111037","DOIUrl":null,"url":null,"abstract":"<div><div>One common post-transcriptional modification that is essential to biological activities is RNA 5-methylcytosine (m5C). A large amount of RNA data containing m5C modification sites has been gathered as a result of the rapid development of high-throughput sequencing technology. While there are a lot of machine learning based techniques available for identifying m5C alteration sites, these models' accuracy still has to be raised. This study proposed a novel method, Feadm5C, which predicts m5C based on fusing molecular graph features and sequencing information together. 10-fold cross-validation was used to assess the model's predictive performance. In addition, we used t-SNE visualization to assess the model's stability and effectiveness. While keeping feature encoding and model structure straightforward, the approach suggested in this work outperforms the most recent approaches in use. The dataset and code of the model can be downloaded from GitHub (<span><span>https://github.com/LiangYu-Xidian/Feadm5C</span><svg><path></path></svg></span>).</div></div>","PeriodicalId":12521,"journal":{"name":"Genomics","volume":"117 3","pages":"Article 111037"},"PeriodicalIF":3.0000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feadm5C: Enhancing prediction of RNA 5-Methylcytosine modification sites with physicochemical molecular graph features\",\"authors\":\"Dongdong Jiang , Chunyan Ao , Yan Li , Liang Yu\",\"doi\":\"10.1016/j.ygeno.2025.111037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>One common post-transcriptional modification that is essential to biological activities is RNA 5-methylcytosine (m5C). A large amount of RNA data containing m5C modification sites has been gathered as a result of the rapid development of high-throughput sequencing technology. While there are a lot of machine learning based techniques available for identifying m5C alteration sites, these models' accuracy still has to be raised. This study proposed a novel method, Feadm5C, which predicts m5C based on fusing molecular graph features and sequencing information together. 10-fold cross-validation was used to assess the model's predictive performance. In addition, we used t-SNE visualization to assess the model's stability and effectiveness. While keeping feature encoding and model structure straightforward, the approach suggested in this work outperforms the most recent approaches in use. The dataset and code of the model can be downloaded from GitHub (<span><span>https://github.com/LiangYu-Xidian/Feadm5C</span><svg><path></path></svg></span>).</div></div>\",\"PeriodicalId\":12521,\"journal\":{\"name\":\"Genomics\",\"volume\":\"117 3\",\"pages\":\"Article 111037\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-03-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0888754325000539\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0888754325000539","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
Feadm5C: Enhancing prediction of RNA 5-Methylcytosine modification sites with physicochemical molecular graph features
One common post-transcriptional modification that is essential to biological activities is RNA 5-methylcytosine (m5C). A large amount of RNA data containing m5C modification sites has been gathered as a result of the rapid development of high-throughput sequencing technology. While there are a lot of machine learning based techniques available for identifying m5C alteration sites, these models' accuracy still has to be raised. This study proposed a novel method, Feadm5C, which predicts m5C based on fusing molecular graph features and sequencing information together. 10-fold cross-validation was used to assess the model's predictive performance. In addition, we used t-SNE visualization to assess the model's stability and effectiveness. While keeping feature encoding and model structure straightforward, the approach suggested in this work outperforms the most recent approaches in use. The dataset and code of the model can be downloaded from GitHub (https://github.com/LiangYu-Xidian/Feadm5C).
期刊介绍:
Genomics is a forum for describing the development of genome-scale technologies and their application to all areas of biological investigation.
As a journal that has evolved with the field that carries its name, Genomics focuses on the development and application of cutting-edge methods, addressing fundamental questions with potential interest to a wide audience. Our aim is to publish the highest quality research and to provide authors with rapid, fair and accurate review and publication of manuscripts falling within our scope.