Na Li , Xiao Wang , Ming Zeng , Feng Cao , Ke Qiu , Jianbo Qiao
{"title":"高效的RNA核苷酸编码增强了对ac4C修饰的准确预测。","authors":"Na Li , Xiao Wang , Ming Zeng , Feng Cao , Ke Qiu , Jianbo Qiao","doi":"10.1016/j.ymeth.2025.07.012","DOIUrl":null,"url":null,"abstract":"<div><div>RNA N4-acetylcytidine (ac4C) modification plays a vital role in gene regulation and cellular function. Accurate identification of ac4C sites is essential for elucidating their biological significance. However, existing prediction methods struggle to capture complex sequence patterns, limiting their accuracy. To address this, we propose GO-ac4C, an efficient prediction framework that integrates byte-pair encoding with nucleotide compositional features. GO-ac4C employs dynamic byte-pair encoding to learn optimal subsequence representations and enhances them with compositional features to effectively capture key motifs in RNA sequences. Experimental results demonstrate that GO-ac4C significantly outperforms state-of-the-art methods across multiple evaluation metrics and offers new insights into the mechanisms of RNA modification.</div></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"244 ","pages":"Pages 1-6"},"PeriodicalIF":4.3000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient RNA nucleotide encoding enhances the accurate prediction of ac4C modifications\",\"authors\":\"Na Li , Xiao Wang , Ming Zeng , Feng Cao , Ke Qiu , Jianbo Qiao\",\"doi\":\"10.1016/j.ymeth.2025.07.012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>RNA N4-acetylcytidine (ac4C) modification plays a vital role in gene regulation and cellular function. Accurate identification of ac4C sites is essential for elucidating their biological significance. However, existing prediction methods struggle to capture complex sequence patterns, limiting their accuracy. To address this, we propose GO-ac4C, an efficient prediction framework that integrates byte-pair encoding with nucleotide compositional features. GO-ac4C employs dynamic byte-pair encoding to learn optimal subsequence representations and enhances them with compositional features to effectively capture key motifs in RNA sequences. Experimental results demonstrate that GO-ac4C significantly outperforms state-of-the-art methods across multiple evaluation metrics and offers new insights into the mechanisms of RNA modification.</div></div>\",\"PeriodicalId\":390,\"journal\":{\"name\":\"Methods\",\"volume\":\"244 \",\"pages\":\"Pages 1-6\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Methods\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1046202325001860\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202325001860","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
Efficient RNA nucleotide encoding enhances the accurate prediction of ac4C modifications
RNA N4-acetylcytidine (ac4C) modification plays a vital role in gene regulation and cellular function. Accurate identification of ac4C sites is essential for elucidating their biological significance. However, existing prediction methods struggle to capture complex sequence patterns, limiting their accuracy. To address this, we propose GO-ac4C, an efficient prediction framework that integrates byte-pair encoding with nucleotide compositional features. GO-ac4C employs dynamic byte-pair encoding to learn optimal subsequence representations and enhances them with compositional features to effectively capture key motifs in RNA sequences. Experimental results demonstrate that GO-ac4C significantly outperforms state-of-the-art methods across multiple evaluation metrics and offers new insights into the mechanisms of RNA modification.
期刊介绍:
Methods focuses on rapidly developing techniques in the experimental biological and medical sciences.
Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.