Improved prediction of antibody and their complexes with clustered generative modelling ensembles.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Bioinformatics advances Pub Date : 2025-07-03 eCollection Date: 2025-01-01 DOI:10.1093/bioadv/vbaf161
Xiaotong Xu, Marco Giulini, Alexandre M J J Bonvin
{"title":"Improved prediction of antibody and their complexes with clustered generative modelling ensembles.","authors":"Xiaotong Xu, Marco Giulini, Alexandre M J J Bonvin","doi":"10.1093/bioadv/vbaf161","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Gaining structural insights into antibody-antigen complexes is crucial for understanding antigen recognition mechanisms and advancing therapeutic antibody design. However, accurate prediction of the structure of highly variable complementarity-determining region 3 on the antibody heavy chain (CDR-H3 loop) remains a significant challenge due to its increased length and conformational variability. While AlphaFold2-multimer (AF2) has made substantial progress in protein structure prediction, its application on antibodies and antibody-antigen complexes is limited by the weak evolutionary signals in the CDR region and the lack of structural diversity in its output.</p><p><strong>Results: </strong>To address these limitations, we propose a workflow that combines AlphaFlow to generate ensembles of potential loop conformations with integrative modelling of antibody-antigen complexes with HADDOCK. Improving the structural diversity of the H3 loop increases the success rate of subsequent docking tasks. Our analysis shows that while AF2 generally predicts accurate antibody structures, it struggles with the H3 loop. In cases where AF2 mispredicts the loop, we leverage AlphaFlow to generate ensembles of loop conformations via score-based flow matching, followed by clustering to produce a structurally diverse set of models. We demonstrate that these ensembles significantly improve antibody-antigen docking performance compared to the standard AF2 ensembles.</p><p><strong>Availability and implementation: </strong>The input datasets and codes involved in this research are available at https://github.com/haddocking/alphaflow-antibodies. All the resulting modelling data are available from Zenodo (https://zenodo.org/records/14906314).</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf161"},"PeriodicalIF":2.8000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12279294/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf161","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Gaining structural insights into antibody-antigen complexes is crucial for understanding antigen recognition mechanisms and advancing therapeutic antibody design. However, accurate prediction of the structure of highly variable complementarity-determining region 3 on the antibody heavy chain (CDR-H3 loop) remains a significant challenge due to its increased length and conformational variability. While AlphaFold2-multimer (AF2) has made substantial progress in protein structure prediction, its application on antibodies and antibody-antigen complexes is limited by the weak evolutionary signals in the CDR region and the lack of structural diversity in its output.

Results: To address these limitations, we propose a workflow that combines AlphaFlow to generate ensembles of potential loop conformations with integrative modelling of antibody-antigen complexes with HADDOCK. Improving the structural diversity of the H3 loop increases the success rate of subsequent docking tasks. Our analysis shows that while AF2 generally predicts accurate antibody structures, it struggles with the H3 loop. In cases where AF2 mispredicts the loop, we leverage AlphaFlow to generate ensembles of loop conformations via score-based flow matching, followed by clustering to produce a structurally diverse set of models. We demonstrate that these ensembles significantly improve antibody-antigen docking performance compared to the standard AF2 ensembles.

Availability and implementation: The input datasets and codes involved in this research are available at https://github.com/haddocking/alphaflow-antibodies. All the resulting modelling data are available from Zenodo (https://zenodo.org/records/14906314).

Abstract Image

Abstract Image

Abstract Image

利用聚类生成模型集成改进抗体及其复合物的预测。
动机:了解抗体-抗原复合物的结构对理解抗原识别机制和推进治疗性抗体设计至关重要。然而,由于抗体重链上高度可变的互补决定区3 (CDR-H3环)的长度和构象变异性增加,准确预测其结构仍然是一个重大挑战。虽然alphafold2 - multitimer (AF2)在蛋白质结构预测方面取得了实质性进展,但其在抗体和抗体-抗原复合物上的应用受到CDR区进化信号较弱和其输出结构缺乏多样性的限制。结果:为了解决这些限制,我们提出了一个结合AlphaFlow的工作流程,以生成潜在环路构象的集合,并结合抗体-抗原复合物与HADDOCK的综合建模。提高H3环的结构多样性,提高后续对接任务的成功率。我们的分析表明,虽然AF2通常预测准确的抗体结构,但它与H3环斗争。在AF2错误预测环路的情况下,我们利用AlphaFlow通过基于分数的流匹配来生成环路构象的集合,然后通过聚类来产生结构多样化的模型集。我们证明,与标准AF2集成相比,这些集成显着提高了抗体-抗原对接性能。可用性和实现:本研究中涉及的输入数据集和代码可在https://github.com/haddocking/alphaflow-antibodies上获得。所有生成的建模数据都可以从Zenodo (https://zenodo.org/records/14906314)获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信