Accelerating antibody development: sequence and structure-based models for predicting developability properties via size exclusion chromatography.

IF 7.3 2区 医学 Q1 MEDICINE, RESEARCH & EXPERIMENTAL
mAbs Pub Date : 2025-12-01 Epub Date: 2025-09-26 DOI:10.1080/19420862.2025.2562997
A N M Nafiz Abeer, Mehdi Boroumand, Isabelle Sermadiras, Jenna G Caldwell, Valentin Stanev, Neil Mody, Gilad Kaplan, James Savery, Rebecca Croasdale-Wood, Maryam Pouryahya
{"title":"Accelerating antibody development: sequence and structure-based models for predicting developability properties via size exclusion chromatography.","authors":"A N M Nafiz Abeer, Mehdi Boroumand, Isabelle Sermadiras, Jenna G Caldwell, Valentin Stanev, Neil Mody, Gilad Kaplan, James Savery, Rebecca Croasdale-Wood, Maryam Pouryahya","doi":"10.1080/19420862.2025.2562997","DOIUrl":null,"url":null,"abstract":"<p><p>Experimental screening for biopharmaceutical developability properties typically relies on resource-intensive, and time-consuming assays such as size exclusion chromatography (SEC). This study highlights the potential of in silico models to accelerate the screening process by exploring sequence and structure-based machine learning techniques. Specifically, we compared surrogate models based on pre-computed features extracted from sequence and predicted structure with sequence-based approaches using protein language models (PLMs) like ESM-2. In addition to different end-to-end fine-tuning strategies for PLM, we have also investigated the integration of the structural information of the antibodies into the prediction pipeline through graph neural networks (GNN). We applied these different methods for predicting protein aggregation propensity using a dataset of approximately 1200 Immunoglobulin G (IgG1) molecules. Through this empirical evaluation, our study identifies the most effective in silico approach for predicting developability properties for SEC assays, thereby adding insights to existing screening efforts for accelerating the antibody development process.</p>","PeriodicalId":18206,"journal":{"name":"mAbs","volume":"17 1","pages":"2562997"},"PeriodicalIF":7.3000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12477876/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"mAbs","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/19420862.2025.2562997","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/26 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Experimental screening for biopharmaceutical developability properties typically relies on resource-intensive, and time-consuming assays such as size exclusion chromatography (SEC). This study highlights the potential of in silico models to accelerate the screening process by exploring sequence and structure-based machine learning techniques. Specifically, we compared surrogate models based on pre-computed features extracted from sequence and predicted structure with sequence-based approaches using protein language models (PLMs) like ESM-2. In addition to different end-to-end fine-tuning strategies for PLM, we have also investigated the integration of the structural information of the antibodies into the prediction pipeline through graph neural networks (GNN). We applied these different methods for predicting protein aggregation propensity using a dataset of approximately 1200 Immunoglobulin G (IgG1) molecules. Through this empirical evaluation, our study identifies the most effective in silico approach for predicting developability properties for SEC assays, thereby adding insights to existing screening efforts for accelerating the antibody development process.

加速抗体开发:基于序列和结构的模型,通过尺寸排斥色谱法预测可显影性。
生物药物显影性的实验筛选通常依赖于资源密集型和耗时的分析,如尺寸排除色谱(SEC)。这项研究强调了通过探索基于序列和结构的机器学习技术来加速筛选过程的硅模型的潜力。具体来说,我们比较了基于预先计算的从序列中提取的特征和预测结构的替代模型与使用蛋白质语言模型(PLMs)(如ESM-2)的基于序列的方法。除了不同的PLM端到端微调策略外,我们还研究了通过图神经网络(GNN)将抗体的结构信息整合到预测管道中的方法。我们使用大约1200个免疫球蛋白G (IgG1)分子的数据集,应用这些不同的方法来预测蛋白质聚集倾向。通过这一实证评估,我们的研究确定了最有效的预测SEC分析可开发性的计算机方法,从而为现有的加速抗体开发过程的筛选工作增加了见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
mAbs
mAbs 工程技术-仪器仪表
CiteScore
10.70
自引率
11.30%
发文量
77
审稿时长
6-12 weeks
期刊介绍: mAbs is a multi-disciplinary journal dedicated to the art and science of antibody research and development. The journal has a strong scientific and medical focus, but also strives to serve a broader readership. The articles are thus of interest to scientists, clinical researchers, and physicians, as well as the wider mAb community, including our readers involved in technology transfer, legal issues, investment, strategic planning and the regulation of therapeutics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信