Teng Ma,Mingjian Jiang,Shunpeng Pang,Zhi Zhang,Huaibin Hang,Wei Zhou,Yuanyuan Zhang
{"title":"SeqMG-RPI:一个基于序列的框架,集成了多尺度RNA特征和蛋白质图,用于RNA-蛋白质相互作用预测。","authors":"Teng Ma,Mingjian Jiang,Shunpeng Pang,Zhi Zhang,Huaibin Hang,Wei Zhou,Yuanyuan Zhang","doi":"10.1021/acs.jcim.5c00176","DOIUrl":null,"url":null,"abstract":"RNA-protein interaction (RPI) plays a crucial role in cell biology, and accurate prediction of RPI is essential to understand molecular mechanisms and advance disease research. Some existing RPI prediction methods typically rely on a single feature and there is significant room for improvement. In this paper, we propose a novel sequence-based RPI prediction method, called SeqMG-RPI. For RNA, SeqMG-RPI introduces an innovative multi-scale RNA feature that integrates three sequence-based representations: a multi-channel RNA feature, a k-mer frequency feature, and a k-mer sparse matrix feature. For protein, SeqMG-RPI utilizes a graph-based protein feature to capture protein information. Moreover, a novel neural network architecture is constructed for feature extraction and RPI prediction. Through experiments from multiple perspectives across various datasets, it is demonstrated that the proposed method outperforms existing methods, which has better performance and generalization.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"13 1","pages":""},"PeriodicalIF":5.6000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SeqMG-RPI: A Sequence-Based Framework Integrating Multi-Scale RNA Features and Protein Graphs for RNA-Protein Interaction Prediction.\",\"authors\":\"Teng Ma,Mingjian Jiang,Shunpeng Pang,Zhi Zhang,Huaibin Hang,Wei Zhou,Yuanyuan Zhang\",\"doi\":\"10.1021/acs.jcim.5c00176\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"RNA-protein interaction (RPI) plays a crucial role in cell biology, and accurate prediction of RPI is essential to understand molecular mechanisms and advance disease research. Some existing RPI prediction methods typically rely on a single feature and there is significant room for improvement. In this paper, we propose a novel sequence-based RPI prediction method, called SeqMG-RPI. For RNA, SeqMG-RPI introduces an innovative multi-scale RNA feature that integrates three sequence-based representations: a multi-channel RNA feature, a k-mer frequency feature, and a k-mer sparse matrix feature. For protein, SeqMG-RPI utilizes a graph-based protein feature to capture protein information. Moreover, a novel neural network architecture is constructed for feature extraction and RPI prediction. Through experiments from multiple perspectives across various datasets, it is demonstrated that the proposed method outperforms existing methods, which has better performance and generalization.\",\"PeriodicalId\":44,\"journal\":{\"name\":\"Journal of Chemical Information and Modeling \",\"volume\":\"13 1\",\"pages\":\"\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2025-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Information and Modeling \",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jcim.5c00176\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.5c00176","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
SeqMG-RPI: A Sequence-Based Framework Integrating Multi-Scale RNA Features and Protein Graphs for RNA-Protein Interaction Prediction.
RNA-protein interaction (RPI) plays a crucial role in cell biology, and accurate prediction of RPI is essential to understand molecular mechanisms and advance disease research. Some existing RPI prediction methods typically rely on a single feature and there is significant room for improvement. In this paper, we propose a novel sequence-based RPI prediction method, called SeqMG-RPI. For RNA, SeqMG-RPI introduces an innovative multi-scale RNA feature that integrates three sequence-based representations: a multi-channel RNA feature, a k-mer frequency feature, and a k-mer sparse matrix feature. For protein, SeqMG-RPI utilizes a graph-based protein feature to capture protein information. Moreover, a novel neural network architecture is constructed for feature extraction and RPI prediction. Through experiments from multiple perspectives across various datasets, it is demonstrated that the proposed method outperforms existing methods, which has better performance and generalization.
期刊介绍:
The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery.
Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field.
As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.