Reinforcement Learning of Molecule Optimization with Bayesian Neural Networks

计算分子生物学(英文) Pub Date : 2021-01-01 DOI:10.4236/cmb.2021.114005

Wei Hu

{"title":"Reinforcement Learning of Molecule Optimization with Bayesian Neural Networks","authors":"Wei Hu","doi":"10.4236/cmb.2021.114005","DOIUrl":null,"url":null,"abstract":"Creating new molecules with desired properties is a fundamental and chal-lenging problem in chemistry. Reinforcement learning (RL) has shown its utility in this area where the target chemical property values can serve as a reward signal. At each step of making a new molecule, the RL agent learns se-lecting an action from a list of many chemically valid actions for a given molecule, implying a great uncertainty associated with its learning. In a traditional implementation of deep RL algorithms, deterministic neural networks are typically employed, thus allowing the agent to choose one action from one sampled action at each step. In this paper, we proposed a new strategy of applying Bayesian neural networks to RL to reduce uncertainty so that the agent can choose one action from a pool of sampled actions at each step, and inves-tigated its benefits in molecule design. Our experiments suggested the Bayesian approach could create molecules of desirable chemical quality while maintained their diversity, a very difficult goal to achieve in machine learning of molecules. We further exploited their diversity by using them to train a generative model to yield more novel drug-like molecules, which were absent in the training molecules as we know novelty is essential for drug candidate molecules. In conclusion, Bayesian approach could offer a balance between exploitation and exploration in RL, and a balance between optimization and diversity in molecule design.","PeriodicalId":70839,"journal":{"name":"计算分子生物学(英文)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"计算分子生物学(英文)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4236/cmb.2021.114005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Creating new molecules with desired properties is a fundamental and chal-lenging problem in chemistry. Reinforcement learning (RL) has shown its utility in this area where the target chemical property values can serve as a reward signal. At each step of making a new molecule, the RL agent learns se-lecting an action from a list of many chemically valid actions for a given molecule, implying a great uncertainty associated with its learning. In a traditional implementation of deep RL algorithms, deterministic neural networks are typically employed, thus allowing the agent to choose one action from one sampled action at each step. In this paper, we proposed a new strategy of applying Bayesian neural networks to RL to reduce uncertainty so that the agent can choose one action from a pool of sampled actions at each step, and inves-tigated its benefits in molecule design. Our experiments suggested the Bayesian approach could create molecules of desirable chemical quality while maintained their diversity, a very difficult goal to achieve in machine learning of molecules. We further exploited their diversity by using them to train a generative model to yield more novel drug-like molecules, which were absent in the training molecules as we know novelty is essential for drug candidate molecules. In conclusion, Bayesian approach could offer a balance between exploitation and exploration in RL, and a balance between optimization and diversity in molecule design.

查看原文本刊更多论文

基于贝叶斯神经网络的分子优化强化学习

创造具有理想性质的新分子是化学中的一个基本和具有挑战性的问题。强化学习(RL)在目标化学性质值可以作为奖励信号的领域已经显示出它的实用性。在制造新分子的每一步中，RL代理都会从给定分子的许多化学有效动作列表中学习选择一个动作，这意味着它的学习存在很大的不确定性。在深度强化学习算法的传统实现中，通常采用确定性神经网络，从而允许代理在每一步从一个采样动作中选择一个动作。本文提出了一种将贝叶斯神经网络应用于强化学习的新策略，以减少不确定性，使智能体在每一步从采样动作池中选择一个动作，并研究了其在分子设计中的益处。我们的实验表明，贝叶斯方法可以在保持分子多样性的同时创造出理想的化学质量的分子，这在分子的机器学习中是很难实现的目标。我们进一步利用它们的多样性，利用它们来训练一个生成模型，以产生更多新的类药物分子，这些分子在训练分子中是不存在的，因为我们知道新颖性对候选药物分子至关重要。综上所述，贝叶斯方法可以在分子设计中实现开发与探索的平衡，在分子设计中实现优化与多样性的平衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

计算分子生物学(英文)

自引率

0.00%

发文量