Predicting the Oligomeric State of Proteins Using Multiple Templates Detected by Complementary Alignment Methods.

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics Pub Date : 2025-07-10 DOI:10.1002/prot.70017

Yuxian Luo, Haiyan Wu, Hong Wei, Zhenling Peng, Jianyi Yang

{"title":"Predicting the Oligomeric State of Proteins Using Multiple Templates Detected by Complementary Alignment Methods.","authors":"Yuxian Luo, Haiyan Wu, Hong Wei, Zhenling Peng, Jianyi Yang","doi":"10.1002/prot.70017","DOIUrl":null,"url":null,"abstract":"<p><p>Recognizing the oligomeric state of proteins is crucial for understanding the structure and function of proteins. In the CASP16 experiment, a two-stage prediction is proposed to challenge structure predictors, in which the oligomeric state is unknown at the first stage. The correct prediction of the oligomeric state plays a vital role in the subsequent step of structure prediction. To this end, we introduce POST, a new approach to the prediction of oligomeric state for homo-oligomers using multiple templates, specifically focusing on four states: monomer, dimer, trimer, and tetramer. POST employs three different algorithms, including dynamic programming, protein language model, and hidden Markov model, to detect homologous templates from an in-house template library (i.e., Q-BioLiP). These algorithms lead to three individual methods for oligomeric state prediction. Assessment on two independent datasets and 107 targets from CASP14 and CASP15 suggests that the templates detected by these methods are largely complementary. A combination of the templates from all individual methods results in the most accurate prediction. POST outperforms other sequence-based methods in predicting specific oligomeric states of proteins and distinguishing multimers from monomers, although it is inferior to other structure-based methods. Overall, POST is anticipated to be helpful in protein structure prediction and protein design.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteins-Structure Function and Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/prot.70017","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Recognizing the oligomeric state of proteins is crucial for understanding the structure and function of proteins. In the CASP16 experiment, a two-stage prediction is proposed to challenge structure predictors, in which the oligomeric state is unknown at the first stage. The correct prediction of the oligomeric state plays a vital role in the subsequent step of structure prediction. To this end, we introduce POST, a new approach to the prediction of oligomeric state for homo-oligomers using multiple templates, specifically focusing on four states: monomer, dimer, trimer, and tetramer. POST employs three different algorithms, including dynamic programming, protein language model, and hidden Markov model, to detect homologous templates from an in-house template library (i.e., Q-BioLiP). These algorithms lead to three individual methods for oligomeric state prediction. Assessment on two independent datasets and 107 targets from CASP14 and CASP15 suggests that the templates detected by these methods are largely complementary. A combination of the templates from all individual methods results in the most accurate prediction. POST outperforms other sequence-based methods in predicting specific oligomeric states of proteins and distinguishing multimers from monomers, although it is inferior to other structure-based methods. Overall, POST is anticipated to be helpful in protein structure prediction and protein design.

查看原文本刊更多论文

利用互补比对方法检测多个模板预测蛋白质的寡聚状态。

认识蛋白质的寡聚态对于理解蛋白质的结构和功能至关重要。在CASP16实验中，我们提出了一种两阶段预测方法来挑战第一阶段寡聚状态未知的结构预测方法。寡聚态的正确预测对后续的结构预测起着至关重要的作用。为此，我们介绍了POST，一种利用多模板预测同源低聚物状态的新方法，特别关注四种状态：单体、二聚体、三聚体和四聚体。POST采用动态规划、蛋白质语言模型和隐马尔可夫模型三种不同的算法，从内部模板库（即Q-BioLiP）中检测同源模板。这些算法导致了三种不同的低聚物状态预测方法。对两个独立的数据集和来自CASP14和CASP15的107个靶点的评估表明，这些方法检测到的模板在很大程度上是互补的。所有单独方法的模板组合将产生最准确的预测。POST在预测蛋白质的特定寡聚状态和区分多聚体和单体方面优于其他基于序列的方法，尽管它不如其他基于结构的方法。总的来说，POST有望在蛋白质结构预测和蛋白质设计方面有所帮助。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proteins-Structure Function and Bioinformatics 生物-生化与分子生物学

CiteScore

5.90

自引率

3.40%

发文量

172

审稿时长

3 months

期刊介绍： PROTEINS : Structure, Function, and Bioinformatics publishes original reports of significant experimental and analytic research in all areas of protein research: structure, function, computation, genetics, and design. The journal encourages reports that present new experimental or computational approaches for interpreting and understanding data from biophysical chemistry, structural studies of proteins and macromolecular assemblies, alterations of protein structure and function engineered through techniques of molecular biology and genetics, functional analyses under physiologic conditions, as well as the interactions of proteins with receptors, nucleic acids, or other specific ligands or substrates. Research in protein and peptide biochemistry directed toward synthesizing or characterizing molecules that simulate aspects of the activity of proteins, or that act as inhibitors of protein function, is also within the scope of PROTEINS. In addition to full-length reports, short communications (usually not more than 4 printed pages) and prediction reports are welcome. Reviews are typically by invitation; authors are encouraged to submit proposed topics for consideration.