Guodong Li , Yue Yang , Dongxu Li , Xiaorui Su , Zhi Zeng , Pengwei Hu , Lun Hu
{"title":"A bijective inference network for interpretable identification of RNA N6-methyladenosine modification sites","authors":"Guodong Li , Yue Yang , Dongxu Li , Xiaorui Su , Zhi Zeng , Pengwei Hu , Lun Hu","doi":"10.1016/j.patcog.2025.111541","DOIUrl":null,"url":null,"abstract":"<div><div>The accurate identification of N<span><math><msup><mrow></mrow><mrow><mn>6</mn></mrow></msup></math></span>-methyladenosine (m<span><math><msup><mrow></mrow><mrow><mn>6</mn></mrow></msup></math></span>A) modification sites is crucial for unraveling various functional mechanisms. While existing methods primarily focus on learning high-quality embeddings of RNA sequences for this task, few of them consider incorporating specific RNA secondary structures, limiting their interpretability for in-depth post-transcriptional analysis. In this work, we introduce a novel bijective inference network, named m<span><math><msup><mrow></mrow><mrow><mn>6</mn></mrow></msup></math></span>A-BIN, which integrates RNA sequences and secondary structures within a unified parameter-shared framework, enhancing the accuracy of m<span><math><msup><mrow></mrow><mrow><mn>6</mn></mrow></msup></math></span>A modification site identification through the auxiliary supervision of RNA secondary structures. To begin with, m<span><math><msup><mrow></mrow><mrow><mn>6</mn></mrow></msup></math></span>A-BIN constructs sequential and structural graphs from RNA sequences and secondary structures, respectively. Bijective mapping functions are then specifically designed to couple the procedures of graph representation learning and interpretable dependency inference, providing informative supervision for learning sequential and structural embeddings of RNA. By fusing these two types of RNA embeddings, m<span><math><msup><mrow></mrow><mrow><mn>6</mn></mrow></msup></math></span>A-BIN efficiently performs the identification task. The attribution phase of m<span><math><msup><mrow></mrow><mrow><mn>6</mn></mrow></msup></math></span>A-BIN further ascribes the prediction results to nucleotide dependencies acquired during the interpretable dependency inference, including RNA sequence and structural patterns, thereby enhancing its interpretability. Extensive experimental results demonstrate the promising performance of m<span><math><msup><mrow></mrow><mrow><mn>6</mn></mrow></msup></math></span>A-BIN, showcasing its efficacy in terms of both accuracy and interpretability for the identification of novel m<span><math><msup><mrow></mrow><mrow><mn>6</mn></mrow></msup></math></span>A modification sites.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"164 ","pages":"Article 111541"},"PeriodicalIF":7.5000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325002018","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The accurate identification of N-methyladenosine (mA) modification sites is crucial for unraveling various functional mechanisms. While existing methods primarily focus on learning high-quality embeddings of RNA sequences for this task, few of them consider incorporating specific RNA secondary structures, limiting their interpretability for in-depth post-transcriptional analysis. In this work, we introduce a novel bijective inference network, named mA-BIN, which integrates RNA sequences and secondary structures within a unified parameter-shared framework, enhancing the accuracy of mA modification site identification through the auxiliary supervision of RNA secondary structures. To begin with, mA-BIN constructs sequential and structural graphs from RNA sequences and secondary structures, respectively. Bijective mapping functions are then specifically designed to couple the procedures of graph representation learning and interpretable dependency inference, providing informative supervision for learning sequential and structural embeddings of RNA. By fusing these two types of RNA embeddings, mA-BIN efficiently performs the identification task. The attribution phase of mA-BIN further ascribes the prediction results to nucleotide dependencies acquired during the interpretable dependency inference, including RNA sequence and structural patterns, thereby enhancing its interpretability. Extensive experimental results demonstrate the promising performance of mA-BIN, showcasing its efficacy in terms of both accuracy and interpretability for the identification of novel mA modification sites.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.