REVOLUTIONIZING ANTIBODY DISCOVERY INDUSTRY WITH HIGHLY EFFICIENT AND ACCURATE AI-BASED EPITOPE-SPECIFIC ANTIBODY DE NOVO DESIGN WORKFLOW

Q2 Medicine
Tianyuan Wang, Xiangrui Gao, Zhe Huai, Zhaohui Gong, Ting Mao, Xuezhe Fan, Xingxing Wu, Zhiyuan Duan, Xiaodong Wang, Jiewen Du, Mengcheng Yao, Xin Li, Min Wu, Zonghu Wang, Lin Zhang, Junjie Zhang, Wenbo Cao, Kai Yan, Yujie Fang, Shixiang Ma, Kun Yang, Lili Wu, F. An, Yezhou Yang, L. Lai, Xiaolu Huang
{"title":"REVOLUTIONIZING ANTIBODY DISCOVERY INDUSTRY WITH HIGHLY EFFICIENT AND ACCURATE AI-BASED EPITOPE-SPECIFIC ANTIBODY DE NOVO DESIGN WORKFLOW","authors":"Tianyuan Wang, Xiangrui Gao, Zhe Huai, Zhaohui Gong, Ting Mao, Xuezhe Fan, Xingxing Wu, Zhiyuan Duan, Xiaodong Wang, Jiewen Du, Mengcheng Yao, Xin Li, Min Wu, Zonghu Wang, Lin Zhang, Junjie Zhang, Wenbo Cao, Kai Yan, Yujie Fang, Shixiang Ma, Kun Yang, Lili Wu, F. An, Yezhou Yang, L. Lai, Xiaolu Huang","doi":"10.1093/abt/tbad014.024","DOIUrl":null,"url":null,"abstract":"Abstract Background and significance The global antibody drug market is worth over $200 billion in 2021 and is expected to reach $380 billion by 2030. Antibody discovery is one of the most critical steps that determine the crucial properties of antibody drugs, such as efficacy, safety, and developability. Traditional methods based on mouse immunization have many drawbacks limiting drug discovery, which include long time periods, high costs, inability to target function-specific epitopes, unsuitable for low immunogenic and difficult-to-prepare antigens, the need to sacrifice mice, the need for further humanization to reduce immunogenicity, and so on. Here we report an antibody de novo design computational workflow that utilizes high-quality internally produced antibody data and advanced AI models. Using this workflow, we can de novo design antibodies that bind to user-specified functional epitopes with high affinity and specificity. Compared with classical wet-lab methods, the entire process is shortened from several months to several days and suitable for low immunogenicity and difficult-to-prepare antigens. It is particularly noteworthy that due to the use of humanized mouse-generated antibodies (Renlite bearing common light chain from Biocytogen) as training data for AI models, the designed antibodies have a high degree of humanization and good developability, effectively avoiding issues such as ADA and aggregation in subsequent processes. Methods First, with the help of Renlite, we comprehensively combined mouse immunization, B cell sorting with FACS, NGS single-cell sequencing, and bioinformatics analysis to internally generate a large amount of high-quality antibody sequence data. Second, we developed AI models for antigen-specific antibody selection and epitope prediction (bioRxiv, 2022: 2022.12. 22.521634.) to mine antigen-specific antibodies and corresponding antigen epitopes in the data. Based on the processed high-quality data, we trained an affinity prediction model that can accurately predict whether an antigen epitope and antibody sequence pair can bind to each other. Besides, using the sequence data, we trained an antibody sequence pre-training language model (bioRxiv, 2023: 2023.01. 19.524683.), which can generate high-quality antibody sequences to simulate the antibodies produced by mouse immunization. Finally, integrating the above AI models, we established an antibody de novo design computational workflow to simulate the biological process of antibody generation and affinity maturation in the mouse immune system, which can be seen as a “DigitalMouse”. Results In a test case, 1 million antibodies were designed aiming at binding to specific epitope of an antigen. 10 antibodies were selected and expressed. Binding affinity was determined using BLI. Two antibodies out of 10 had KD of 194 nM and 336 nM, respectively, with a concentration dependent signal increase on BLI. These antibodies have great potential as the starting point of candidate molecules for further in vitro, in vivo experimental validation and clinical trials. Conclusions The AI-based antibody de novo design workflow will revolutionize the antibody discovery industry paradigm, greatly shorten the antibody discovery phase, reduce R&D costs, and expand antibody discovery to more antigen targets that are difficult with animal immunization. The computational workflow will have a profound impact on the entire biopharmaceutical industry.","PeriodicalId":36655,"journal":{"name":"Antibody Therapeutics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Antibody Therapeutics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/abt/tbad014.024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Background and significance The global antibody drug market is worth over $200 billion in 2021 and is expected to reach $380 billion by 2030. Antibody discovery is one of the most critical steps that determine the crucial properties of antibody drugs, such as efficacy, safety, and developability. Traditional methods based on mouse immunization have many drawbacks limiting drug discovery, which include long time periods, high costs, inability to target function-specific epitopes, unsuitable for low immunogenic and difficult-to-prepare antigens, the need to sacrifice mice, the need for further humanization to reduce immunogenicity, and so on. Here we report an antibody de novo design computational workflow that utilizes high-quality internally produced antibody data and advanced AI models. Using this workflow, we can de novo design antibodies that bind to user-specified functional epitopes with high affinity and specificity. Compared with classical wet-lab methods, the entire process is shortened from several months to several days and suitable for low immunogenicity and difficult-to-prepare antigens. It is particularly noteworthy that due to the use of humanized mouse-generated antibodies (Renlite bearing common light chain from Biocytogen) as training data for AI models, the designed antibodies have a high degree of humanization and good developability, effectively avoiding issues such as ADA and aggregation in subsequent processes. Methods First, with the help of Renlite, we comprehensively combined mouse immunization, B cell sorting with FACS, NGS single-cell sequencing, and bioinformatics analysis to internally generate a large amount of high-quality antibody sequence data. Second, we developed AI models for antigen-specific antibody selection and epitope prediction (bioRxiv, 2022: 2022.12. 22.521634.) to mine antigen-specific antibodies and corresponding antigen epitopes in the data. Based on the processed high-quality data, we trained an affinity prediction model that can accurately predict whether an antigen epitope and antibody sequence pair can bind to each other. Besides, using the sequence data, we trained an antibody sequence pre-training language model (bioRxiv, 2023: 2023.01. 19.524683.), which can generate high-quality antibody sequences to simulate the antibodies produced by mouse immunization. Finally, integrating the above AI models, we established an antibody de novo design computational workflow to simulate the biological process of antibody generation and affinity maturation in the mouse immune system, which can be seen as a “DigitalMouse”. Results In a test case, 1 million antibodies were designed aiming at binding to specific epitope of an antigen. 10 antibodies were selected and expressed. Binding affinity was determined using BLI. Two antibodies out of 10 had KD of 194 nM and 336 nM, respectively, with a concentration dependent signal increase on BLI. These antibodies have great potential as the starting point of candidate molecules for further in vitro, in vivo experimental validation and clinical trials. Conclusions The AI-based antibody de novo design workflow will revolutionize the antibody discovery industry paradigm, greatly shorten the antibody discovery phase, reduce R&D costs, and expand antibody discovery to more antigen targets that are difficult with animal immunization. The computational workflow will have a profound impact on the entire biopharmaceutical industry.
通过高效准确的基于AI的表位特异性抗体从头设计工作流程,彻底改变抗体发现行业
背景与意义2021年全球抗体药物市场价值超过2000亿美元,预计到2030年将达到3800亿美元。抗体发现是决定抗体药物关键特性(如有效性、安全性和可开发性)的最关键步骤之一。传统的基于小鼠免疫的方法存在许多限制药物发现的缺点,包括时间长、成本高、无法靶向功能特异性表位、不适合低免疫原性和难以制备的抗原、需要牺牲小鼠、需要进一步人源化以降低免疫原性等。在这里,我们报告了一个抗体从头设计计算工作流,利用高质量的内部产生的抗体数据和先进的人工智能模型。利用这一工作流程,我们可以重新设计与用户指定的功能表位结合的抗体,具有高亲和力和特异性。与传统的湿法相比,整个过程从几个月缩短到几天,适用于低免疫原性和难以制备的抗原。特别值得注意的是,由于采用人源化小鼠产生的抗体(Renlite带有Biocytogen的共轻链)作为AI模型的训练数据,设计的抗体人源化程度高,可开发性好,有效避免了后续流程中的ADA和聚集等问题。方法首先,借助Renlite,综合结合小鼠免疫、B细胞分选与FACS、NGS单细胞测序、生物信息学分析,内部生成大量高质量的抗体序列数据。其次,我们开发了抗原特异性抗体选择和表位预测的AI模型(bioRxiv, 2022: 2022.12)。22.521634.)从数据中挖掘抗原特异性抗体及相应抗原表位。基于处理后的高质量数据,我们训练了一个亲和力预测模型,可以准确预测抗原表位和抗体序列对是否可以相互结合。此外,利用序列数据,我们训练了一个抗体序列预训练语言模型(bioRxiv, 2023: 2023.01)。19.524683.),可以生成高质量的抗体序列,模拟小鼠免疫产生的抗体。最后,结合上述AI模型,我们建立了抗体从头设计计算工作流,模拟小鼠免疫系统中抗体产生和亲和成熟的生物学过程,可以看作是一个“DigitalMouse”。结果在一个试验案例中,设计了100万种针对抗原特异性表位结合的抗体。选择10个抗体进行表达。用BLI测定结合亲和力。10个抗体中有2个抗体的KD分别为194 nM和336 nM, BLI的浓度依赖性信号增加。这些抗体有很大的潜力作为候选分子的起点进行进一步的体外、体内实验验证和临床试验。结论基于人工智能的抗体从头设计工作流程将彻底改变抗体发现行业的模式,大大缩短抗体发现阶段,降低研发成本,并将抗体发现扩展到更多难以通过动物免疫的抗原靶点。计算工作流将对整个生物制药行业产生深远的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Antibody Therapeutics
Antibody Therapeutics Medicine-Immunology and Allergy
CiteScore
8.70
自引率
0.00%
发文量
30
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信