Improved environmental chemistry property prediction of molecules with graph machine learning†

IF 9.3 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Green Chemistry Pub Date : 2023-08-04 DOI:10.1039/D3GC01920A
Shang Zhu, Bichlien H. Nguyen, Yingce Xia, Kali Frost, Shufang Xie, Venkatasubramanian Viswanathan and Jake A. Smith
{"title":"Improved environmental chemistry property prediction of molecules with graph machine learning†","authors":"Shang Zhu, Bichlien H. Nguyen, Yingce Xia, Kali Frost, Shufang Xie, Venkatasubramanian Viswanathan and Jake A. Smith","doi":"10.1039/D3GC01920A","DOIUrl":null,"url":null,"abstract":"<p >Rapid prediction of environmental chemistry properties is critical for the green and sustainable development of the chemical industry and drug discovery. Machine learning methods can be applied to learn the relations between chemical structures and their environmental impact. Graph machine learning, by learning the representations directly from molecular graphs, may have better predictive power than conventional feature-based models. In this work, we leveraged graph neural networks to predict the environmental chemistry properties of molecules. To systematically evaluate the model performance, we selected a representative list of datasets, ranging from solubility to reactivity, and compared them directly to commonly used methods. We found that the graph model achieved near state-of-the-art accuracy for all tasks and, for several, improved the accuracy by a large margin over conventional models that rely on human-designed chemical features. This demonstrates that graph machine learning can be a powerful tool to perform representation learning for environmental chemistry. Further, we compared the data efficiency of conventional feature-based models and graph neural networks, providing guidance for model selection dependent on the size of datasets and feature requirements.</p>","PeriodicalId":78,"journal":{"name":"Green Chemistry","volume":" 17","pages":" 6612-6617"},"PeriodicalIF":9.3000,"publicationDate":"2023-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Green Chemistry","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2023/gc/d3gc01920a","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Rapid prediction of environmental chemistry properties is critical for the green and sustainable development of the chemical industry and drug discovery. Machine learning methods can be applied to learn the relations between chemical structures and their environmental impact. Graph machine learning, by learning the representations directly from molecular graphs, may have better predictive power than conventional feature-based models. In this work, we leveraged graph neural networks to predict the environmental chemistry properties of molecules. To systematically evaluate the model performance, we selected a representative list of datasets, ranging from solubility to reactivity, and compared them directly to commonly used methods. We found that the graph model achieved near state-of-the-art accuracy for all tasks and, for several, improved the accuracy by a large margin over conventional models that rely on human-designed chemical features. This demonstrates that graph machine learning can be a powerful tool to perform representation learning for environmental chemistry. Further, we compared the data efficiency of conventional feature-based models and graph neural networks, providing guidance for model selection dependent on the size of datasets and feature requirements.

Abstract Image

基于图机器学习的分子环境化学性质预测改进[j]
环境化学性质的快速预测对化学工业的绿色可持续发展和药物开发至关重要。机器学习方法可以用于学习化学结构与其环境影响之间的关系。图机器学习,通过直接从分子图中学习表征,可能比传统的基于特征的模型具有更好的预测能力。在这项工作中,我们利用图神经网络来预测分子的环境化学性质。为了系统地评估模型的性能,我们选择了一个具有代表性的数据集列表,从溶解度到反应性,并将它们直接与常用的方法进行比较。我们发现,图模型在所有任务中都达到了接近最先进的精度,并且在一些任务中,比依赖于人为设计的化学特征的传统模型大大提高了精度。这表明图机器学习可以成为环境化学表征学习的强大工具。此外,我们比较了传统的基于特征的模型和图神经网络的数据效率,为根据数据集的大小和特征需求选择模型提供指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Green Chemistry
Green Chemistry 化学-化学综合
CiteScore
16.10
自引率
7.10%
发文量
677
审稿时长
1.4 months
期刊介绍: Green Chemistry is a journal that provides a unique forum for the publication of innovative research on the development of alternative green and sustainable technologies. The scope of Green Chemistry is based on the definition proposed by Anastas and Warner (Green Chemistry: Theory and Practice, P T Anastas and J C Warner, Oxford University Press, Oxford, 1998), which defines green chemistry as the utilisation of a set of principles that reduces or eliminates the use or generation of hazardous substances in the design, manufacture and application of chemical products. Green Chemistry aims to reduce the environmental impact of the chemical enterprise by developing a technology base that is inherently non-toxic to living things and the environment. The journal welcomes submissions on all aspects of research relating to this endeavor and publishes original and significant cutting-edge research that is likely to be of wide general appeal. For a work to be published, it must present a significant advance in green chemistry, including a comparison with existing methods and a demonstration of advantages over those methods.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信