MESM：整合多源数据，通过多模态语言模型进行高精度蛋白质-蛋白质相互作用预测。

IF 4.5 1区生物学 Q1 BIOLOGY

BMC Biology Pub Date : 2025-08-11 DOI:10.1186/s12915-025-02356-y

Feng Wang, Jinming Chu, Liyan Shen, Shan Chang

{"title":"MESM：整合多源数据，通过多模态语言模型进行高精度蛋白质-蛋白质相互作用预测。","authors":"Feng Wang, Jinming Chu, Liyan Shen, Shan Chang","doi":"10.1186/s12915-025-02356-y","DOIUrl":null,"url":null,"abstract":"Background: Protein-protein interactions (PPIs) play a critical role in essential biological processes such as signal transduction, enzyme activity regulation, cytoskeletal structure, immune responses, and gene regulation. However, current methods mainly focus on extracting features from protein sequences and using graph neural network (GNN) to acquire interaction information from the PPI network graph. This limits the model's ability to learn richer and more effective interaction information, thereby affecting prediction performance.Results: In this study, we propose a novel deep learning method, MESM, for effectively predicting PPI. The datasets used for the PPI prediction task were primarily constructed from the STRING database, including two Homo sapiens PPI datasets, SHS27k and SHS148k, and two Saccharomyces cerevisiae PPI datasets, SYS30k and SYS60k. MESM consists of three key modules, as follows: First, MESM extracts multimodal representations from protein sequence information, protein structure information, and point cloud features through Sequence Variational Autoencoder (SVAE), Variational Graph Autoencoder (VGAE), and PointNet Autoencoder (PAE). Then, Fusion Autoencoder (FAE) is used to integrate these multimodal features, generating rich and balanced protein representations. Next, MESM leverages GraphGPS to learn structural information from the PPI network graph structure and combines Graph Attention Network (GAT) to further capture protein interaction information. Finally, MESM uses Graph Convolutional Network (GCN) and SubgraphGCN to extract global and local features from the perspective of the overall graph and subgraphs. Moreover, we build seven independent graphs from the overall PPI network graph to specifically learn the features of each PPI type, thereby enhancing the model's learning ability for different types of interactions.Conclusions: Compared to the state-of-the-art methods, MESM achieved improvements of 8.77%, 4.98%, 7.48%, and 6.08% on SHS27k, SHS148k, SYS30k, and SYS60k, respectively. The experimental results demonstrate that MESM exhibits significant improvements in PPI prediction performance.","PeriodicalId":9339,"journal":{"name":"BMC Biology","volume":"23 1","pages":"253"},"PeriodicalIF":4.5000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337455/pdf/","citationCount":"0","resultStr":"{\"title\":\"MESM: integrating multi-source data for high-accuracy protein-protein interactions prediction through multimodal language models.\",\"authors\":\"Feng Wang, Jinming Chu, Liyan Shen, Shan Chang\",\"doi\":\"10.1186/s12915-025-02356-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Protein-protein interactions (PPIs) play a critical role in essential biological processes such as signal transduction, enzyme activity regulation, cytoskeletal structure, immune responses, and gene regulation. However, current methods mainly focus on extracting features from protein sequences and using graph neural network (GNN) to acquire interaction information from the PPI network graph. This limits the model's ability to learn richer and more effective interaction information, thereby affecting prediction performance.Results: In this study, we propose a novel deep learning method, MESM, for effectively predicting PPI. The datasets used for the PPI prediction task were primarily constructed from the STRING database, including two Homo sapiens PPI datasets, SHS27k and SHS148k, and two Saccharomyces cerevisiae PPI datasets, SYS30k and SYS60k. MESM consists of three key modules, as follows: First, MESM extracts multimodal representations from protein sequence information, protein structure information, and point cloud features through Sequence Variational Autoencoder (SVAE), Variational Graph Autoencoder (VGAE), and PointNet Autoencoder (PAE). Then, Fusion Autoencoder (FAE) is used to integrate these multimodal features, generating rich and balanced protein representations. Next, MESM leverages GraphGPS to learn structural information from the PPI network graph structure and combines Graph Attention Network (GAT) to further capture protein interaction information. Finally, MESM uses Graph Convolutional Network (GCN) and SubgraphGCN to extract global and local features from the perspective of the overall graph and subgraphs. Moreover, we build seven independent graphs from the overall PPI network graph to specifically learn the features of each PPI type, thereby enhancing the model's learning ability for different types of interactions.Conclusions: Compared to the state-of-the-art methods, MESM achieved improvements of 8.77%, 4.98%, 7.48%, and 6.08% on SHS27k, SHS148k, SYS30k, and SYS60k, respectively. The experimental results demonstrate that MESM exhibits significant improvements in PPI prediction performance.\",\"PeriodicalId\":9339,\"journal\":{\"name\":\"BMC Biology\",\"volume\":\"23 1\",\"pages\":\"253\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2025-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337455/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12915-025-02356-y\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12915-025-02356-y","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

背景：蛋白-蛋白相互作用（PPIs）在信号转导、酶活性调节、细胞骨架结构、免疫反应和基因调控等基本生物过程中发挥着关键作用。然而，目前的方法主要集中在从蛋白质序列中提取特征，并利用图神经网络（GNN）从PPI网络图中获取相互作用信息。这限制了模型学习更丰富、更有效的交互信息的能力，从而影响了预测性能。结果：在本研究中，我们提出了一种新的深度学习方法MESM，用于有效预测PPI。用于PPI预测任务的数据集主要来自STRING数据库，包括两个智人PPI数据集SHS27k和SHS148k，以及两个酿酒酵母PPI数据集SYS30k和SYS60k。MESM包括三个关键模块：首先，MESM通过序列变分自编码器（sequence Variational Autoencoder， SVAE）、变分图自编码器（Variational Graph Autoencoder， VGAE）和点网自编码器（PointNet Autoencoder， PAE）从蛋白质序列信息、蛋白质结构信息和点云特征中提取多模态表示。然后，使用融合自编码器（FAE）对这些多模态特征进行整合，生成丰富而平衡的蛋白质表示。接下来，MESM利用GraphGPS从PPI网络的图结构中学习结构信息，并结合图注意网络（graph Attention network， GAT）进一步捕获蛋白质相互作用信息。最后，利用图卷积网络（GCN）和子图卷积网络（SubgraphGCN）从整体图和子图的角度提取全局和局部特征。此外，我们从整体PPI网络图中构建了7个独立的图，专门学习每种PPI类型的特征，从而增强了模型对不同类型交互的学习能力。结论：与现有方法相比，MESM对SHS27k、SHS148k、SYS30k和SYS60k分别提高了8.77%、4.98%、7.48%和6.08%。实验结果表明，MESM在PPI预测性能上有显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

MESM: integrating multi-source data for high-accuracy protein-protein interactions prediction through multimodal language models.

Background: Protein-protein interactions (PPIs) play a critical role in essential biological processes such as signal transduction, enzyme activity regulation, cytoskeletal structure, immune responses, and gene regulation. However, current methods mainly focus on extracting features from protein sequences and using graph neural network (GNN) to acquire interaction information from the PPI network graph. This limits the model's ability to learn richer and more effective interaction information, thereby affecting prediction performance.

Results: In this study, we propose a novel deep learning method, MESM, for effectively predicting PPI. The datasets used for the PPI prediction task were primarily constructed from the STRING database, including two Homo sapiens PPI datasets, SHS27k and SHS148k, and two Saccharomyces cerevisiae PPI datasets, SYS30k and SYS60k. MESM consists of three key modules, as follows: First, MESM extracts multimodal representations from protein sequence information, protein structure information, and point cloud features through Sequence Variational Autoencoder (SVAE), Variational Graph Autoencoder (VGAE), and PointNet Autoencoder (PAE). Then, Fusion Autoencoder (FAE) is used to integrate these multimodal features, generating rich and balanced protein representations. Next, MESM leverages GraphGPS to learn structural information from the PPI network graph structure and combines Graph Attention Network (GAT) to further capture protein interaction information. Finally, MESM uses Graph Convolutional Network (GCN) and SubgraphGCN to extract global and local features from the perspective of the overall graph and subgraphs. Moreover, we build seven independent graphs from the overall PPI network graph to specifically learn the features of each PPI type, thereby enhancing the model's learning ability for different types of interactions.

Conclusions: Compared to the state-of-the-art methods, MESM achieved improvements of 8.77%, 4.98%, 7.48%, and 6.08% on SHS27k, SHS148k, SYS30k, and SYS60k, respectively. The experimental results demonstrate that MESM exhibits significant improvements in PPI prediction performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

BMC Biology 生物-生物学

CiteScore

7.80

自引率

1.90%

发文量

260

审稿时长

3 months

期刊介绍： BMC Biology is a broad scope journal covering all areas of biology. Our content includes research articles, new methods and tools. BMC Biology also publishes reviews, Q&A, and commentaries.