{"title":"[Research progress in mutation effect prediction based on protein language models].","authors":"Liang Zhang, Pan Tan, Liang Hong","doi":"10.13345/j.cjb.240683","DOIUrl":null,"url":null,"abstract":"<p><p>Predicting protein mutation effects is a key challenge in bioinformatics and protein engineering. Recent advancements in deep learning, particularly the development of protein language models (PLMs), have brought new opportunities to this field. This review summarizes the application of PLMs in predicting protein mutation effects, focusing on three main types of models: sequence-based models, structure-based models, and models that combine sequence and structural information. We analyze in detail the principles, advantages, and limitations of these models and discuss the application of unsupervised and supervised learning in model training. Furthermore, this paper discusses the main challenges currently faced, including the acquisition of high-quality datasets and the handling of data noise. Finally, we look ahead to future research directions, including the application prospects of emerging technologies such as multimodal fusion and few-shot learning. This review aims to provide researchers with a comprehensive perspective to further advance the prediction of protein mutation effects.</p>","PeriodicalId":21778,"journal":{"name":"Sheng wu gong cheng xue bao = Chinese journal of biotechnology","volume":"41 3","pages":"934-948"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sheng wu gong cheng xue bao = Chinese journal of biotechnology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.13345/j.cjb.240683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0
Abstract
Predicting protein mutation effects is a key challenge in bioinformatics and protein engineering. Recent advancements in deep learning, particularly the development of protein language models (PLMs), have brought new opportunities to this field. This review summarizes the application of PLMs in predicting protein mutation effects, focusing on three main types of models: sequence-based models, structure-based models, and models that combine sequence and structural information. We analyze in detail the principles, advantages, and limitations of these models and discuss the application of unsupervised and supervised learning in model training. Furthermore, this paper discusses the main challenges currently faced, including the acquisition of high-quality datasets and the handling of data noise. Finally, we look ahead to future research directions, including the application prospects of emerging technologies such as multimodal fusion and few-shot learning. This review aims to provide researchers with a comprehensive perspective to further advance the prediction of protein mutation effects.
期刊介绍:
Chinese Journal of Biotechnology (Chinese edition) , sponsored by the Institute of Microbiology, Chinese Academy of Sciences and the Chinese Society for Microbiology, is a peer-reviewed international journal. The journal is cited by many scientific databases , such as Chemical Abstract (CA), Biology Abstract (BA), MEDLINE, Russian Digest , Chinese Scientific Citation Index (CSCI), Chinese Journal Citation Report (CJCR), and Chinese Academic Journal (CD version). The Journal publishes new discoveries, techniques and developments in genetic engineering, cell engineering, enzyme engineering, biochemical engineering, tissue engineering, bioinformatics, biochips and other fields of biotechnology.