CASP16蛋白多聚体和RNA的结构建模协议与增强的msa，模型排序和深度学习。

IF 2.8 4区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY

Proteins-Structure Function and Bioinformatics Pub Date : 2025-08-01 DOI:10.1002/prot.70033

Yuki Kagaya, Tsukasa Nakamura, Jacob Verburgt, Anika Jain, Genki Terashi, Pranav Punuru, Emilia Tugolukova, Joon Hong Park, Anouka Saha, David Huang, Daisuke Kihara

{"title":"CASP16蛋白多聚体和RNA的结构建模协议与增强的msa，模型排序和深度学习。","authors":"Yuki Kagaya, Tsukasa Nakamura, Jacob Verburgt, Anika Jain, Genki Terashi, Pranav Punuru, Emilia Tugolukova, Joon Hong Park, Anouka Saha, David Huang, Daisuke Kihara","doi":"10.1002/prot.70033","DOIUrl":null,"url":null,"abstract":"We present the methods and results of our protein complex and RNA structure predictions at CASP16. Our approach integrated multiple state-of-the-art deep learning models with a consensus-based scoring method. To enhance the depth of multiple sequence alignments (MSAs), we employed a large metagenomic sequence database. Model ranking was performed with a state-of-the-art consensus ranking method, to which we added more scoring terms. These predictions were further refined manually based on literature evidence. For RNA, we adopted an ensemble approach that incorporated multiple state-of-the-art methods, centered around our NuFold framework. As a result, our KiharaLab group ranked first in protein complex prediction and third in RNA structure prediction. A detailed analysis of targets that significantly differed from those of other groups highlighted both the strengths of our MSA and scoring strategies, as well as areas requiring further improvement.","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12321240/pdf/","citationCount":"0","resultStr":"{\"title\":\"Structure Modeling Protocols for Protein Multimer and RNA in CASP16 With Enhanced MSAs, Model Ranking, and Deep Learning.\",\"authors\":\"Yuki Kagaya, Tsukasa Nakamura, Jacob Verburgt, Anika Jain, Genki Terashi, Pranav Punuru, Emilia Tugolukova, Joon Hong Park, Anouka Saha, David Huang, Daisuke Kihara\",\"doi\":\"10.1002/prot.70033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present the methods and results of our protein complex and RNA structure predictions at CASP16. Our approach integrated multiple state-of-the-art deep learning models with a consensus-based scoring method. To enhance the depth of multiple sequence alignments (MSAs), we employed a large metagenomic sequence database. Model ranking was performed with a state-of-the-art consensus ranking method, to which we added more scoring terms. These predictions were further refined manually based on literature evidence. For RNA, we adopted an ensemble approach that incorporated multiple state-of-the-art methods, centered around our NuFold framework. As a result, our KiharaLab group ranked first in protein complex prediction and third in RNA structure prediction. A detailed analysis of targets that significantly differed from those of other groups highlighted both the strengths of our MSA and scoring strategies, as well as areas requiring further improvement.\",\"PeriodicalId\":56271,\"journal\":{\"name\":\"Proteins-Structure Function and Bioinformatics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12321240/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proteins-Structure Function and Bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/prot.70033\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteins-Structure Function and Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/prot.70033","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

我们介绍了CASP16蛋白复合物和RNA结构预测的方法和结果。我们的方法将多个最先进的深度学习模型与基于共识的评分方法集成在一起。为了提高多序列比对（msa）的深度，我们使用了一个大型宏基因组序列数据库。模型排名是用最先进的共识排名方法进行的，我们增加了更多的评分项。这些预测是在文献证据的基础上进一步人工完善的。对于RNA，我们采用了一种集成方法，结合了多种最先进的方法，以NuFold框架为中心。因此，我们的KiharaLab小组在蛋白质复合物预测方面排名第一，在RNA结构预测方面排名第三。对与其他组显著不同的目标进行了详细分析，突出了我们的MSA和评分策略的优势，以及需要进一步改进的领域。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Structure Modeling Protocols for Protein Multimer and RNA in CASP16 With Enhanced MSAs, Model Ranking, and Deep Learning.

We present the methods and results of our protein complex and RNA structure predictions at CASP16. Our approach integrated multiple state-of-the-art deep learning models with a consensus-based scoring method. To enhance the depth of multiple sequence alignments (MSAs), we employed a large metagenomic sequence database. Model ranking was performed with a state-of-the-art consensus ranking method, to which we added more scoring terms. These predictions were further refined manually based on literature evidence. For RNA, we adopted an ensemble approach that incorporated multiple state-of-the-art methods, centered around our NuFold framework. As a result, our KiharaLab group ranked first in protein complex prediction and third in RNA structure prediction. A detailed analysis of targets that significantly differed from those of other groups highlighted both the strengths of our MSA and scoring strategies, as well as areas requiring further improvement.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proteins-Structure Function and Bioinformatics 生物-生化与分子生物学

CiteScore

5.90

自引率

3.40%

发文量

172

审稿时长

3 months

期刊介绍： PROTEINS : Structure, Function, and Bioinformatics publishes original reports of significant experimental and analytic research in all areas of protein research: structure, function, computation, genetics, and design. The journal encourages reports that present new experimental or computational approaches for interpreting and understanding data from biophysical chemistry, structural studies of proteins and macromolecular assemblies, alterations of protein structure and function engineered through techniques of molecular biology and genetics, functional analyses under physiologic conditions, as well as the interactions of proteins with receptors, nucleic acids, or other specific ligands or substrates. Research in protein and peptide biochemistry directed toward synthesizing or characterizing molecules that simulate aspects of the activity of proteins, or that act as inhibitors of protein function, is also within the scope of PROTEINS. In addition to full-length reports, short communications (usually not more than 4 printed pages) and prediction reports are welcome. Reviews are typically by invitation; authors are encouraged to submit proposed topics for consideration.