{"title":"Toward generalizable structure-based deep learning models for protein–ligand interaction prediction: Challenges and strategies","authors":"Seokhyun Moon, Wonho Zhung, Woo Youn Kim","doi":"10.1002/wcms.1705","DOIUrl":null,"url":null,"abstract":"<p>Accurate and rapid prediction of protein–ligand interactions (PLIs) is the fundamental challenge of drug discovery. Deep learning methods have been harnessed for this purpose, yet the insufficient generalizability of PLI prediction prevents their broader impact on practical applications. Here, we highlight the significance of PLI model generalizability by conceiving PLIs as a function defined on infinitely diverse protein–ligand pairs and binding poses. To delve into the generalization challenges within PLI predictions, we comprehensively explore the evaluation strategies to assess the generalizability fairly. Moreover, we categorize structure-based PLI models with leveraged strategies for learning generalizable features from structure-based PLI data. Finally, we conclude the review by emphasizing the need for accurate pose-predicting methods, which is a prerequisite for more accurate PLI predictions.</p><p>This article is categorized under:\n </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 1","pages":""},"PeriodicalIF":16.8000,"publicationDate":"2024-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wiley Interdisciplinary Reviews: Computational Molecular Science","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/wcms.1705","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate and rapid prediction of protein–ligand interactions (PLIs) is the fundamental challenge of drug discovery. Deep learning methods have been harnessed for this purpose, yet the insufficient generalizability of PLI prediction prevents their broader impact on practical applications. Here, we highlight the significance of PLI model generalizability by conceiving PLIs as a function defined on infinitely diverse protein–ligand pairs and binding poses. To delve into the generalization challenges within PLI predictions, we comprehensively explore the evaluation strategies to assess the generalizability fairly. Moreover, we categorize structure-based PLI models with leveraged strategies for learning generalizable features from structure-based PLI data. Finally, we conclude the review by emphasizing the need for accurate pose-predicting methods, which is a prerequisite for more accurate PLI predictions.
准确而快速地预测蛋白质配体相互作用(PLIs)是药物发现的基本挑战。深度学习方法已被用于这一目的,但由于 PLI 预测的普适性不足,它们无法在实际应用中产生更广泛的影响。在这里,我们通过将 PLIs 视为定义在无限多样的蛋白质配体对和结合位置上的函数,强调了 PLI 模型泛化的重要性。为了深入探讨 PLI 预测中的泛化难题,我们全面探讨了公平评估泛化能力的评价策略。此外,我们还对基于结构的 PLI 模型进行了分类,并介绍了从基于结构的 PLI 数据中学习可泛化特征的杠杆策略。最后,我们强调了精确姿势预测方法的必要性,这是更精确的 PLI 预测的先决条件,从而结束了本综述。
期刊介绍:
Computational molecular sciences harness the power of rigorous chemical and physical theories, employing computer-based modeling, specialized hardware, software development, algorithm design, and database management to explore and illuminate every facet of molecular sciences. These interdisciplinary approaches form a bridge between chemistry, biology, and materials sciences, establishing connections with adjacent application-driven fields in both chemistry and biology. WIREs Computational Molecular Science stands as a platform to comprehensively review and spotlight research from these dynamic and interconnected fields.