{"title":"PaRPI从跨协议和跨批rna结合蛋白数据集预测rna -蛋白相互作用。","authors":"Liangchen Peng, Lijun Quan, Lingkun Meng, Zhihong Zhang, Shengju Zhang, Zhijun Zhang, Yi Zhang, Qiufeng Chen, Bei Zhang, Lexin Cao, Tingfang Wu, Qiang Lyu","doi":"10.1038/s42003-025-08807-0","DOIUrl":null,"url":null,"abstract":"<p><p>RNA-binding proteins (RBPs) play a pivotal role in the regulation of gene expression, with their interactions with RNA reflecting the biological functions and regulatory mechanisms. However, current computational methods are typically tailored to specific RBPs and depend on specific protocols and batches of biological experiments. To overcome these challenges, we propose a method called PaRPI, which aims to predict RNA-protein binding sites in a bidirectional RBP-RNA selection manner. PaRPI groups all RBP datasets based on cell lines, integrating experimental data from different protocols and batches, thereby enabling the development of a unified computational model that effectively captures both shared and distinct interaction patterns among different proteins. Our results demonstrate that PaRPI achieves exceptional performance in accurately identifying binding sites, surpassing state-of-the-art models on 261 RBP datasets from eCLIP and CLIP-seq experiments. Furthermore, PaRPI stands out for its robust generalization capabilities, uniquely able to predict interactions with previously unseen RNA and protein receptors. We also investigate the impact of disease-associated variants on RBP binding and evaluate PaRPI's components and semantic embeddings, demonstrating its capability to dissect complex interaction networks. PaRPI enables large-scale exploration of RNA-protein interactions, facilitating future studies on gene regulation and disease mechanisms.</p>","PeriodicalId":10552,"journal":{"name":"Communications Biology","volume":"8 1","pages":"1396"},"PeriodicalIF":5.1000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12485180/pdf/","citationCount":"0","resultStr":"{\"title\":\"PaRPI predicts RNA-Protein interactions from cross-protocol and cross-batch RNA-binding protein datasets.\",\"authors\":\"Liangchen Peng, Lijun Quan, Lingkun Meng, Zhihong Zhang, Shengju Zhang, Zhijun Zhang, Yi Zhang, Qiufeng Chen, Bei Zhang, Lexin Cao, Tingfang Wu, Qiang Lyu\",\"doi\":\"10.1038/s42003-025-08807-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>RNA-binding proteins (RBPs) play a pivotal role in the regulation of gene expression, with their interactions with RNA reflecting the biological functions and regulatory mechanisms. However, current computational methods are typically tailored to specific RBPs and depend on specific protocols and batches of biological experiments. To overcome these challenges, we propose a method called PaRPI, which aims to predict RNA-protein binding sites in a bidirectional RBP-RNA selection manner. PaRPI groups all RBP datasets based on cell lines, integrating experimental data from different protocols and batches, thereby enabling the development of a unified computational model that effectively captures both shared and distinct interaction patterns among different proteins. Our results demonstrate that PaRPI achieves exceptional performance in accurately identifying binding sites, surpassing state-of-the-art models on 261 RBP datasets from eCLIP and CLIP-seq experiments. Furthermore, PaRPI stands out for its robust generalization capabilities, uniquely able to predict interactions with previously unseen RNA and protein receptors. We also investigate the impact of disease-associated variants on RBP binding and evaluate PaRPI's components and semantic embeddings, demonstrating its capability to dissect complex interaction networks. PaRPI enables large-scale exploration of RNA-protein interactions, facilitating future studies on gene regulation and disease mechanisms.</p>\",\"PeriodicalId\":10552,\"journal\":{\"name\":\"Communications Biology\",\"volume\":\"8 1\",\"pages\":\"1396\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12485180/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1038/s42003-025-08807-0\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s42003-025-08807-0","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
PaRPI predicts RNA-Protein interactions from cross-protocol and cross-batch RNA-binding protein datasets.
RNA-binding proteins (RBPs) play a pivotal role in the regulation of gene expression, with their interactions with RNA reflecting the biological functions and regulatory mechanisms. However, current computational methods are typically tailored to specific RBPs and depend on specific protocols and batches of biological experiments. To overcome these challenges, we propose a method called PaRPI, which aims to predict RNA-protein binding sites in a bidirectional RBP-RNA selection manner. PaRPI groups all RBP datasets based on cell lines, integrating experimental data from different protocols and batches, thereby enabling the development of a unified computational model that effectively captures both shared and distinct interaction patterns among different proteins. Our results demonstrate that PaRPI achieves exceptional performance in accurately identifying binding sites, surpassing state-of-the-art models on 261 RBP datasets from eCLIP and CLIP-seq experiments. Furthermore, PaRPI stands out for its robust generalization capabilities, uniquely able to predict interactions with previously unseen RNA and protein receptors. We also investigate the impact of disease-associated variants on RBP binding and evaluate PaRPI's components and semantic embeddings, demonstrating its capability to dissect complex interaction networks. PaRPI enables large-scale exploration of RNA-protein interactions, facilitating future studies on gene regulation and disease mechanisms.
期刊介绍:
Communications Biology is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the biological sciences. Research papers published by the journal represent significant advances bringing new biological insight to a specialized area of research.