Convolutional graph neural network training scalability for molecular docking

2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) Pub Date : 2023-03-01 DOI:10.1109/PDP59025.2023.00042

Kevin Crampon, Alexis Giorkallos, S. Baud, L. Steffenel

{"title":"Convolutional graph neural network training scalability for molecular docking","authors":"Kevin Crampon, Alexis Giorkallos, S. Baud, L. Steffenel","doi":"10.1109/PDP59025.2023.00042","DOIUrl":null,"url":null,"abstract":"Deep learning use is growing in many numerical simulation fields, and drug discovery does not escape this trend. Indeed, before proceeding with in vitro and then in vivo experiments, drug discovery now relies on in silico techniques such as molecular docking to narrow the number of experiments and identify the best candidates. This method explores the receptor surface and the ligand's conformational space, providing numerous ligand-receptor poses. All these poses are then scored and ranked by a scoring function allowing to predict the best poses among all, then compare different ligands regarding a given receptor or different targets regarding a given ligand. Since the 2010s, numerous deep learning methods have been used to tackle this problem. Nowadays, there are two significant trends in deep learning for molecular docking: (i) the augmentation of available structural data and (ii) the use of a new kind of neural network: the graph convolutional neural networks (GCNs). In this paper, we propose the study of training scalability of a GCN-a molecular complex scoring function-on an increasing number of GPUs and with a variety of batch sizes. After a hyperparameter analysis, we achieve an 80% reduction in the training time, but this improvement sometimes involves a performance metrics degradation that the final users must ponder.","PeriodicalId":153500,"journal":{"name":"2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDP59025.2023.00042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning use is growing in many numerical simulation fields, and drug discovery does not escape this trend. Indeed, before proceeding with in vitro and then in vivo experiments, drug discovery now relies on in silico techniques such as molecular docking to narrow the number of experiments and identify the best candidates. This method explores the receptor surface and the ligand's conformational space, providing numerous ligand-receptor poses. All these poses are then scored and ranked by a scoring function allowing to predict the best poses among all, then compare different ligands regarding a given receptor or different targets regarding a given ligand. Since the 2010s, numerous deep learning methods have been used to tackle this problem. Nowadays, there are two significant trends in deep learning for molecular docking: (i) the augmentation of available structural data and (ii) the use of a new kind of neural network: the graph convolutional neural networks (GCNs). In this paper, we propose the study of training scalability of a GCN-a molecular complex scoring function-on an increasing number of GPUs and with a variety of batch sizes. After a hyperparameter analysis, we achieve an 80% reduction in the training time, but this improvement sometimes involves a performance metrics degradation that the final users must ponder.

查看原文本刊更多论文

卷积图神经网络训练分子对接的可扩展性

深度学习在许多数值模拟领域的应用正在增长，药物发现也没有逃脱这一趋势。事实上，在进行体外和体内实验之前，药物发现现在依赖于分子对接等计算机技术来缩小实验数量并确定最佳候选药物。这种方法探索受体表面和配体的构象空间，提供了许多配体-受体姿势。然后通过评分函数对所有这些姿势进行评分和排名，从而预测所有姿势中的最佳姿势，然后比较给定受体的不同配体或给定配体的不同靶标。自2010年以来，许多深度学习方法被用来解决这个问题。目前，分子对接的深度学习有两个重要的趋势:(i)可用结构数据的增加和(ii)使用一种新的神经网络:图卷积神经网络(GCNs)。在本文中，我们提出了在越来越多的gpu和各种批大小的情况下研究gcn -分子复杂评分函数的训练可扩展性。经过超参数分析后，我们将训练时间减少了80%，但是这种改进有时会导致性能指标的下降，最终用户必须考虑到这一点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)

自引率

0.00%

发文量