Exploring Local Chemical Space in De Novo Molecular Generation Using Multi-Agent Deep Reinforcement Learning

Natural Science Pub Date : 2021-09-02 DOI:10.4236/ns.2021.139034

Wei Hu

{"title":"Exploring Local Chemical Space in De Novo Molecular Generation Using Multi-Agent Deep Reinforcement Learning","authors":"Wei Hu","doi":"10.4236/ns.2021.139034","DOIUrl":null,"url":null,"abstract":"Single-agent reinforcement learning (RL) is commonly used to learn how to play computer games, in which the agent makes one move before making the next in a sequential decision process. Recently single agent was also employed in the design of molecules and drugs. While a single agent is a good fit for computer games, it has limitations when used in molecule design. Its sequential learning makes it impossible to modify or improve the previous steps while working on the current step. In this paper, we proposed to apply the multi-agent RL approach to the research of molecules, which can optimize all sites of a molecule simultaneously. To elucidate the validity of our approach, we chose one chemical compound Favipiravir to explore its local chemical space. Favipiravir is a broad-spectrum inhibitor of viral RNA polymerase, and is one of the compounds that are currently being used in SARS-CoV-2 (COVID-19) clinical trials. Our experiments revealed the collaborative learning of a team of deep RL agents as well as the learning of its individual learning agent in the exploration of Favipiravir. In particular, our multi-agents not only discovered the molecules near Favipiravir in chemical space, but also the learnability of each site in the string representation of Favipiravir, critical information for us to understand the underline mechanism that supports machine learning of molecules.","PeriodicalId":19083,"journal":{"name":"Natural Science","volume":"25 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4236/ns.2021.139034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Single-agent reinforcement learning (RL) is commonly used to learn how to play computer games, in which the agent makes one move before making the next in a sequential decision process. Recently single agent was also employed in the design of molecules and drugs. While a single agent is a good fit for computer games, it has limitations when used in molecule design. Its sequential learning makes it impossible to modify or improve the previous steps while working on the current step. In this paper, we proposed to apply the multi-agent RL approach to the research of molecules, which can optimize all sites of a molecule simultaneously. To elucidate the validity of our approach, we chose one chemical compound Favipiravir to explore its local chemical space. Favipiravir is a broad-spectrum inhibitor of viral RNA polymerase, and is one of the compounds that are currently being used in SARS-CoV-2 (COVID-19) clinical trials. Our experiments revealed the collaborative learning of a team of deep RL agents as well as the learning of its individual learning agent in the exploration of Favipiravir. In particular, our multi-agents not only discovered the molecules near Favipiravir in chemical space, but also the learnability of each site in the string representation of Favipiravir, critical information for us to understand the underline mechanism that supports machine learning of molecules.

查看原文本刊更多论文

利用多智能体深度强化学习在从头分子生成中探索局部化学空间

单智能体强化学习(RL)通常用于学习如何玩计算机游戏，其中智能体在顺序决策过程中先做一步，然后再做下一步。近年来，单因子也被应用于分子和药物的设计中。虽然单个代理非常适合电脑游戏，但它在分子设计中有局限性。它的顺序学习使得在处理当前步骤时不可能修改或改进前面的步骤。在本文中，我们提出将多智能体RL方法应用于分子的研究，该方法可以同时优化一个分子的所有位点。为了阐明我们方法的有效性，我们选择了一种化合物Favipiravir来探索其局部化学空间。法匹拉韦是一种广谱病毒RNA聚合酶抑制剂，是目前用于SARS-CoV-2 (COVID-19)临床试验的化合物之一。我们的实验揭示了深度强化学习代理团队在探索Favipiravir时的协作学习以及其个人学习代理的学习。特别是，我们的多智能体不仅在化学空间中发现了Favipiravir附近的分子，而且还发现了Favipiravir字符串表示中每个位点的可学习性，这对我们理解支持分子机器学习的潜在机制至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Natural Science

自引率

0.00%

发文量