Mingqing Wang , Zhiwei Nie , Yonghong He , Athanasios V. Vasilakos , Qiang (Shawn) Cheng , Zhixiang Ren
{"title":"蛋白质表示和功能预测的深度学习方法:全面概述","authors":"Mingqing Wang , Zhiwei Nie , Yonghong He , Athanasios V. Vasilakos , Qiang (Shawn) Cheng , Zhixiang Ren","doi":"10.1016/j.engappai.2025.110977","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning has revolutionized protein function prediction by capturing intricate protein relationships, yet a comprehensive survey of its methodologies remains elusive. In this review, we systematically dissect recent advances by addressing three pivotal questions: (a) which modalities are most critical for various function prediction tasks, (b) which deep learning strategies optimally model these modalities, (c) what common and task-specific challenges persist. We categorize protein data into eight distinct types – from fundamental representations to specialized expert knowledge – and provide an exhaustive analysis of state-of-the-art deep learning models alongside emerging self-supervised learning strategies. Moreover, we compare the evolution of architectures across different modeling paradigms, highlighting their respective strengths and limitations. Our investigation spans over fifteen downstream tasks across five key research areas, including protein function annotation, protein–protein interactions, protein–ligand interactions, mutation effect prediction, and remote homology detection. Finally, we discuss current challenges and propose potential solutions, offering strategic guidance for data selection, methodological innovation, and future research directions in the application of deep learning to protein function prediction.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"155 ","pages":"Article 110977"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep learning methods for protein representation and function prediction: A comprehensive overview\",\"authors\":\"Mingqing Wang , Zhiwei Nie , Yonghong He , Athanasios V. Vasilakos , Qiang (Shawn) Cheng , Zhixiang Ren\",\"doi\":\"10.1016/j.engappai.2025.110977\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep learning has revolutionized protein function prediction by capturing intricate protein relationships, yet a comprehensive survey of its methodologies remains elusive. In this review, we systematically dissect recent advances by addressing three pivotal questions: (a) which modalities are most critical for various function prediction tasks, (b) which deep learning strategies optimally model these modalities, (c) what common and task-specific challenges persist. We categorize protein data into eight distinct types – from fundamental representations to specialized expert knowledge – and provide an exhaustive analysis of state-of-the-art deep learning models alongside emerging self-supervised learning strategies. Moreover, we compare the evolution of architectures across different modeling paradigms, highlighting their respective strengths and limitations. Our investigation spans over fifteen downstream tasks across five key research areas, including protein function annotation, protein–protein interactions, protein–ligand interactions, mutation effect prediction, and remote homology detection. Finally, we discuss current challenges and propose potential solutions, offering strategic guidance for data selection, methodological innovation, and future research directions in the application of deep learning to protein function prediction.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"155 \",\"pages\":\"Article 110977\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625009777\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625009777","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Deep learning methods for protein representation and function prediction: A comprehensive overview
Deep learning has revolutionized protein function prediction by capturing intricate protein relationships, yet a comprehensive survey of its methodologies remains elusive. In this review, we systematically dissect recent advances by addressing three pivotal questions: (a) which modalities are most critical for various function prediction tasks, (b) which deep learning strategies optimally model these modalities, (c) what common and task-specific challenges persist. We categorize protein data into eight distinct types – from fundamental representations to specialized expert knowledge – and provide an exhaustive analysis of state-of-the-art deep learning models alongside emerging self-supervised learning strategies. Moreover, we compare the evolution of architectures across different modeling paradigms, highlighting their respective strengths and limitations. Our investigation spans over fifteen downstream tasks across five key research areas, including protein function annotation, protein–protein interactions, protein–ligand interactions, mutation effect prediction, and remote homology detection. Finally, we discuss current challenges and propose potential solutions, offering strategic guidance for data selection, methodological innovation, and future research directions in the application of deep learning to protein function prediction.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.