通过深度学习进行蛋白质配体结合亲和力预测的进展：数据集、数据预处理技术和模型架构的综合研究》。

IF 2.5 4区医学 Q2 PHARMACOLOGY & PHARMACY

Current drug targets Pub Date : 2024-01-01 DOI:10.2174/0113894501330963240905083020

Gelany Aly Abdelkader, Jeong-Dong Kim

{"title":"通过深度学习进行蛋白质配体结合亲和力预测的进展：数据集、数据预处理技术和模型架构的综合研究》。","authors":"Gelany Aly Abdelkader, Jeong-Dong Kim","doi":"10.2174/0113894501330963240905083020","DOIUrl":null,"url":null,"abstract":"Background: Drug discovery is a complex and expensive procedure involving several timely and costly phases through which new potential pharmaceutical compounds must pass to get approved. One of these critical steps is the identification and optimization of lead compounds, which has been made more accessible by the introduction of computational methods, including deep learning (DL) techniques. Diverse DL model architectures have been put forward to learn the vast landscape of interaction between proteins and ligands and predict their affinity, helping in the identification of lead compounds.Objective: This survey fills a gap in previous research by comprehensively analyzing the most commonly used datasets and discussing their quality and limitations. It also offers a comprehensive classification of the most recent DL methods in the context of protein-ligand binding affinity prediction (BAP), providing a fresh perspective on this evolving field.Methods: We thoroughly examine commonly used datasets for BAP and their inherent characteristics. Our exploration extends to various preprocessing steps and DL techniques, including graph neural networks, convolutional neural networks, and transformers, which are found in the literature. We conducted extensive literature research to ensure that the most recent deep learning approaches for BAP were included by the time of writing this manuscript.Results: The systematic approach used for the present study highlighted inherent challenges to BAP via DL, such as data quality, model interpretability, and explainability, and proposed considerations for future research directions. We present valuable insights to accelerate the development of more effective and reliable DL models for BAP within the research community.Conclusion: The present study can considerably enhance future research on predicting affinity between protein and ligand molecules, hence further improving the overall drug development process.","PeriodicalId":10805,"journal":{"name":"Current drug targets","volume":" ","pages":"1041-1065"},"PeriodicalIF":2.5000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11774311/pdf/","citationCount":"0","resultStr":"{\"title\":\"Advances in Protein-Ligand Binding Affinity Prediction via Deep Learning: A Comprehensive Study of Datasets, Data Preprocessing Techniques, and Model Architectures.\",\"authors\":\"Gelany Aly Abdelkader, Jeong-Dong Kim\",\"doi\":\"10.2174/0113894501330963240905083020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Drug discovery is a complex and expensive procedure involving several timely and costly phases through which new potential pharmaceutical compounds must pass to get approved. One of these critical steps is the identification and optimization of lead compounds, which has been made more accessible by the introduction of computational methods, including deep learning (DL) techniques. Diverse DL model architectures have been put forward to learn the vast landscape of interaction between proteins and ligands and predict their affinity, helping in the identification of lead compounds.Objective: This survey fills a gap in previous research by comprehensively analyzing the most commonly used datasets and discussing their quality and limitations. It also offers a comprehensive classification of the most recent DL methods in the context of protein-ligand binding affinity prediction (BAP), providing a fresh perspective on this evolving field.Methods: We thoroughly examine commonly used datasets for BAP and their inherent characteristics. Our exploration extends to various preprocessing steps and DL techniques, including graph neural networks, convolutional neural networks, and transformers, which are found in the literature. We conducted extensive literature research to ensure that the most recent deep learning approaches for BAP were included by the time of writing this manuscript.Results: The systematic approach used for the present study highlighted inherent challenges to BAP via DL, such as data quality, model interpretability, and explainability, and proposed considerations for future research directions. We present valuable insights to accelerate the development of more effective and reliable DL models for BAP within the research community.Conclusion: The present study can considerably enhance future research on predicting affinity between protein and ligand molecules, hence further improving the overall drug development process.\",\"PeriodicalId\":10805,\"journal\":{\"name\":\"Current drug targets\",\"volume\":\" \",\"pages\":\"1041-1065\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11774311/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current drug targets\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2174/0113894501330963240905083020\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current drug targets","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2174/0113894501330963240905083020","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}

引用次数: 0

摘要

背景：药物发现是一个复杂而昂贵的过程，新的潜在药物化合物要想获得批准，必须经过几个及时而昂贵的阶段。其中一个关键步骤是先导化合物的鉴定和优化，而包括深度学习（DL）技术在内的计算方法的引入使这一过程变得更加容易。人们提出了多种深度学习模型架构，以学习蛋白质与配体之间的大量相互作用并预测其亲和力，从而帮助鉴定先导化合物：本调查通过全面分析最常用的数据集并讨论其质量和局限性，填补了以往研究的空白。它还对蛋白质配体结合亲和力预测中最新的 DL 方法进行了全面分类，为这一不断发展的领域提供了一个全新的视角：我们深入研究了常用的 BAP 数据集及其固有特征。我们的探索延伸到各种预处理步骤和 DL 技术，包括图神经网络、卷积神经网络和变压器等文献中出现的技术。我们进行了广泛的文献研究，以确保在撰写本手稿时，BAP 的最新深度学习方法已经包含在内：本研究采用的系统方法强调了通过 DL 进行 BAP 所面临的固有挑战，如数据质量、模型可解释性和可解释性，并提出了未来研究方向的考虑因素。我们提出了宝贵的见解，以加快研究界开发更有效、更可靠的 DL 模型，用于 BAP：本研究可极大地促进未来预测蛋白质与配体分子亲和力的研究，从而进一步改善整个药物开发过程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Advances in Protein-Ligand Binding Affinity Prediction via Deep Learning: A Comprehensive Study of Datasets, Data Preprocessing Techniques, and Model Architectures.

Background: Drug discovery is a complex and expensive procedure involving several timely and costly phases through which new potential pharmaceutical compounds must pass to get approved. One of these critical steps is the identification and optimization of lead compounds, which has been made more accessible by the introduction of computational methods, including deep learning (DL) techniques. Diverse DL model architectures have been put forward to learn the vast landscape of interaction between proteins and ligands and predict their affinity, helping in the identification of lead compounds.

Objective: This survey fills a gap in previous research by comprehensively analyzing the most commonly used datasets and discussing their quality and limitations. It also offers a comprehensive classification of the most recent DL methods in the context of protein-ligand binding affinity prediction (BAP), providing a fresh perspective on this evolving field.

Methods: We thoroughly examine commonly used datasets for BAP and their inherent characteristics. Our exploration extends to various preprocessing steps and DL techniques, including graph neural networks, convolutional neural networks, and transformers, which are found in the literature. We conducted extensive literature research to ensure that the most recent deep learning approaches for BAP were included by the time of writing this manuscript.

Results: The systematic approach used for the present study highlighted inherent challenges to BAP via DL, such as data quality, model interpretability, and explainability, and proposed considerations for future research directions. We present valuable insights to accelerate the development of more effective and reliable DL models for BAP within the research community.

Conclusion: The present study can considerably enhance future research on predicting affinity between protein and ligand molecules, hence further improving the overall drug development process.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Current drug targets 医学-药学

CiteScore

6.20

自引率

0.00%

发文量

127

审稿时长

3-8 weeks

期刊介绍： Current Drug Targets aims to cover the latest and most outstanding developments on the medicinal chemistry and pharmacology of molecular drug targets e.g. disease specific proteins, receptors, enzymes, genes. Current Drug Targets publishes guest edited thematic issues written by leaders in the field covering a range of current topics of drug targets. The journal also accepts for publication mini- & full-length review articles and drug clinical trial studies. As the discovery, identification, characterization and validation of novel human drug targets for drug discovery continues to grow; this journal is essential reading for all pharmaceutical scientists involved in drug discovery and development.