Accelerating antibody development: sequence and structure-based models for predicting developability properties via size exclusion chromatography.

IF 7.3 2区医学 Q1 MEDICINE, RESEARCH & EXPERIMENTAL

mAbs Pub Date : 2025-12-01 Epub Date: 2025-09-26 DOI:10.1080/19420862.2025.2562997

A N M Nafiz Abeer, Mehdi Boroumand, Isabelle Sermadiras, Jenna G Caldwell, Valentin Stanev, Neil Mody, Gilad Kaplan, James Savery, Rebecca Croasdale-Wood, Maryam Pouryahya

{"title":"Accelerating antibody development: sequence and structure-based models for predicting developability properties via size exclusion chromatography.","authors":"A N M Nafiz Abeer, Mehdi Boroumand, Isabelle Sermadiras, Jenna G Caldwell, Valentin Stanev, Neil Mody, Gilad Kaplan, James Savery, Rebecca Croasdale-Wood, Maryam Pouryahya","doi":"10.1080/19420862.2025.2562997","DOIUrl":null,"url":null,"abstract":"<p><p>Experimental screening for biopharmaceutical developability properties typically relies on resource-intensive, and time-consuming assays such as size exclusion chromatography (SEC). This study highlights the potential of in silico models to accelerate the screening process by exploring sequence and structure-based machine learning techniques. Specifically, we compared surrogate models based on pre-computed features extracted from sequence and predicted structure with sequence-based approaches using protein language models (PLMs) like ESM-2. In addition to different end-to-end fine-tuning strategies for PLM, we have also investigated the integration of the structural information of the antibodies into the prediction pipeline through graph neural networks (GNN). We applied these different methods for predicting protein aggregation propensity using a dataset of approximately 1200 Immunoglobulin G (IgG1) molecules. Through this empirical evaluation, our study identifies the most effective in silico approach for predicting developability properties for SEC assays, thereby adding insights to existing screening efforts for accelerating the antibody development process.</p>","PeriodicalId":18206,"journal":{"name":"mAbs","volume":"17 1","pages":"2562997"},"PeriodicalIF":7.3000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12477876/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"mAbs","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/19420862.2025.2562997","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/26 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Experimental screening for biopharmaceutical developability properties typically relies on resource-intensive, and time-consuming assays such as size exclusion chromatography (SEC). This study highlights the potential of in silico models to accelerate the screening process by exploring sequence and structure-based machine learning techniques. Specifically, we compared surrogate models based on pre-computed features extracted from sequence and predicted structure with sequence-based approaches using protein language models (PLMs) like ESM-2. In addition to different end-to-end fine-tuning strategies for PLM, we have also investigated the integration of the structural information of the antibodies into the prediction pipeline through graph neural networks (GNN). We applied these different methods for predicting protein aggregation propensity using a dataset of approximately 1200 Immunoglobulin G (IgG1) molecules. Through this empirical evaluation, our study identifies the most effective in silico approach for predicting developability properties for SEC assays, thereby adding insights to existing screening efforts for accelerating the antibody development process.

查看原文本刊更多论文

加速抗体开发：基于序列和结构的模型，通过尺寸排斥色谱法预测可显影性。

生物药物显影性的实验筛选通常依赖于资源密集型和耗时的分析，如尺寸排除色谱（SEC）。这项研究强调了通过探索基于序列和结构的机器学习技术来加速筛选过程的硅模型的潜力。具体来说，我们比较了基于预先计算的从序列中提取的特征和预测结构的替代模型与使用蛋白质语言模型（PLMs）（如ESM-2）的基于序列的方法。除了不同的PLM端到端微调策略外，我们还研究了通过图神经网络（GNN）将抗体的结构信息整合到预测管道中的方法。我们使用大约1200个免疫球蛋白G （IgG1）分子的数据集，应用这些不同的方法来预测蛋白质聚集倾向。通过这一实证评估，我们的研究确定了最有效的预测SEC分析可开发性的计算机方法，从而为现有的加速抗体开发过程的筛选工作增加了见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

mAbs 工程技术-仪器仪表

CiteScore

10.70

自引率

11.30%

发文量

审稿时长

6-12 weeks

期刊介绍： mAbs is a multi-disciplinary journal dedicated to the art and science of antibody research and development. The journal has a strong scientific and medical focus, but also strives to serve a broader readership. The articles are thus of interest to scientists, clinical researchers, and physicians, as well as the wider mAb community, including our readers involved in technology transfer, legal issues, investment, strategic planning and the regulation of therapeutics.