{"title":"通过机器学习预测适应性免疫受体特异性是一个数据生成问题。","authors":"Derek M Mason, Sai T Reddy","doi":"10.1016/j.cels.2024.11.008","DOIUrl":null,"url":null,"abstract":"<p><p>Determining the specificity of adaptive immune receptors-B cell receptors (BCRs), their secreted form antibodies, and T cell receptors (TCRs)-is critical for understanding immune responses and advancing immunotherapy and drug discovery. Immune receptors exhibit extensive diversity in their variable domains, enabling them to interact with a plethora of antigens. Despite the significant progress made by AI tools such as AlphaFold in predicting protein structures, challenges remain in accurately modeling the structure and specificity of immune receptors, primarily due to the limited availability of high-quality crystal structures and the complexity of immune receptor-antigen interactions. In this perspective, we highlight recent advancements in sequence-based and structure-based data generation for immune receptors, which are crucial for training machine learning models that predict receptor specificity. We discuss the current bottlenecks and potential future directions in generating and utilizing high-dimensional datasets for predicting and designing the specificity of antibodies and TCRs.</p>","PeriodicalId":93929,"journal":{"name":"Cell systems","volume":"15 12","pages":"1190-1197"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting adaptive immune receptor specificities by machine learning is a data generation problem.\",\"authors\":\"Derek M Mason, Sai T Reddy\",\"doi\":\"10.1016/j.cels.2024.11.008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Determining the specificity of adaptive immune receptors-B cell receptors (BCRs), their secreted form antibodies, and T cell receptors (TCRs)-is critical for understanding immune responses and advancing immunotherapy and drug discovery. Immune receptors exhibit extensive diversity in their variable domains, enabling them to interact with a plethora of antigens. Despite the significant progress made by AI tools such as AlphaFold in predicting protein structures, challenges remain in accurately modeling the structure and specificity of immune receptors, primarily due to the limited availability of high-quality crystal structures and the complexity of immune receptor-antigen interactions. In this perspective, we highlight recent advancements in sequence-based and structure-based data generation for immune receptors, which are crucial for training machine learning models that predict receptor specificity. We discuss the current bottlenecks and potential future directions in generating and utilizing high-dimensional datasets for predicting and designing the specificity of antibodies and TCRs.</p>\",\"PeriodicalId\":93929,\"journal\":{\"name\":\"Cell systems\",\"volume\":\"15 12\",\"pages\":\"1190-1197\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cell systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1016/j.cels.2024.11.008\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.cels.2024.11.008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
确定适应性免疫受体--B 细胞受体(BCR)、其分泌形式抗体和 T 细胞受体(TCR)的特异性,对于了解免疫反应、促进免疫疗法和药物发现至关重要。免疫受体的可变结构域呈现出广泛的多样性,使它们能够与大量抗原相互作用。尽管 AlphaFold 等人工智能工具在预测蛋白质结构方面取得了重大进展,但在准确模拟免疫受体的结构和特异性方面仍然存在挑战,这主要是由于高质量晶体结构的可用性有限以及免疫受体与抗原相互作用的复杂性。在本视角中,我们将重点介绍基于序列和结构的免疫受体数据生成方面的最新进展,这些进展对于训练预测受体特异性的机器学习模型至关重要。我们讨论了生成和利用高维数据集预测和设计抗体和 TCR 特异性的当前瓶颈和潜在的未来方向。
Predicting adaptive immune receptor specificities by machine learning is a data generation problem.
Determining the specificity of adaptive immune receptors-B cell receptors (BCRs), their secreted form antibodies, and T cell receptors (TCRs)-is critical for understanding immune responses and advancing immunotherapy and drug discovery. Immune receptors exhibit extensive diversity in their variable domains, enabling them to interact with a plethora of antigens. Despite the significant progress made by AI tools such as AlphaFold in predicting protein structures, challenges remain in accurately modeling the structure and specificity of immune receptors, primarily due to the limited availability of high-quality crystal structures and the complexity of immune receptor-antigen interactions. In this perspective, we highlight recent advancements in sequence-based and structure-based data generation for immune receptors, which are crucial for training machine learning models that predict receptor specificity. We discuss the current bottlenecks and potential future directions in generating and utilizing high-dimensional datasets for predicting and designing the specificity of antibodies and TCRs.