Yuxian Luo, Haiyan Wu, Hong Wei, Zhenling Peng, Jianyi Yang
{"title":"Predicting the Oligomeric State of Proteins Using Multiple Templates Detected by Complementary Alignment Methods.","authors":"Yuxian Luo, Haiyan Wu, Hong Wei, Zhenling Peng, Jianyi Yang","doi":"10.1002/prot.70017","DOIUrl":null,"url":null,"abstract":"<p><p>Recognizing the oligomeric state of proteins is crucial for understanding the structure and function of proteins. In the CASP16 experiment, a two-stage prediction is proposed to challenge structure predictors, in which the oligomeric state is unknown at the first stage. The correct prediction of the oligomeric state plays a vital role in the subsequent step of structure prediction. To this end, we introduce POST, a new approach to the prediction of oligomeric state for homo-oligomers using multiple templates, specifically focusing on four states: monomer, dimer, trimer, and tetramer. POST employs three different algorithms, including dynamic programming, protein language model, and hidden Markov model, to detect homologous templates from an in-house template library (i.e., Q-BioLiP). These algorithms lead to three individual methods for oligomeric state prediction. Assessment on two independent datasets and 107 targets from CASP14 and CASP15 suggests that the templates detected by these methods are largely complementary. A combination of the templates from all individual methods results in the most accurate prediction. POST outperforms other sequence-based methods in predicting specific oligomeric states of proteins and distinguishing multimers from monomers, although it is inferior to other structure-based methods. Overall, POST is anticipated to be helpful in protein structure prediction and protein design.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteins-Structure Function and Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/prot.70017","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Recognizing the oligomeric state of proteins is crucial for understanding the structure and function of proteins. In the CASP16 experiment, a two-stage prediction is proposed to challenge structure predictors, in which the oligomeric state is unknown at the first stage. The correct prediction of the oligomeric state plays a vital role in the subsequent step of structure prediction. To this end, we introduce POST, a new approach to the prediction of oligomeric state for homo-oligomers using multiple templates, specifically focusing on four states: monomer, dimer, trimer, and tetramer. POST employs three different algorithms, including dynamic programming, protein language model, and hidden Markov model, to detect homologous templates from an in-house template library (i.e., Q-BioLiP). These algorithms lead to three individual methods for oligomeric state prediction. Assessment on two independent datasets and 107 targets from CASP14 and CASP15 suggests that the templates detected by these methods are largely complementary. A combination of the templates from all individual methods results in the most accurate prediction. POST outperforms other sequence-based methods in predicting specific oligomeric states of proteins and distinguishing multimers from monomers, although it is inferior to other structure-based methods. Overall, POST is anticipated to be helpful in protein structure prediction and protein design.
期刊介绍:
PROTEINS : Structure, Function, and Bioinformatics publishes original reports of significant experimental and analytic research in all areas of protein research: structure, function, computation, genetics, and design. The journal encourages reports that present new experimental or computational approaches for interpreting and understanding data from biophysical chemistry, structural studies of proteins and macromolecular assemblies, alterations of protein structure and function engineered through techniques of molecular biology and genetics, functional analyses under physiologic conditions, as well as the interactions of proteins with receptors, nucleic acids, or other specific ligands or substrates. Research in protein and peptide biochemistry directed toward synthesizing or characterizing molecules that simulate aspects of the activity of proteins, or that act as inhibitors of protein function, is also within the scope of PROTEINS. In addition to full-length reports, short communications (usually not more than 4 printed pages) and prediction reports are welcome. Reviews are typically by invitation; authors are encouraged to submit proposed topics for consideration.