将化学知识应用于反应性的机器学习。

IF 3.5 2区 物理与天体物理 Q1 PHYSICS, MULTIDISCIPLINARY
Kjell Jorner
{"title":"将化学知识应用于反应性的机器学习。","authors":"Kjell Jorner","doi":"10.2533/chimia.2023.22","DOIUrl":null,"url":null,"abstract":"<p><p>Machine learning has been used to study chemical reactivity for a long time in fields such as physical organic chemistry, chemometrics and cheminformatics. Recent advances in computer science have resulted in deep neural networks that can learn directly from the molecular structure. Neural networks are a good choice when large amounts of data are available. However, many datasets in chemistry are small, and models utilizing chemical knowledge are required for good performance. Adding chemical knowledge can be achieved either by adding more information about the molecules or by adjusting the model architecture itself. The current method of choice for adding more information is descriptors based on computed quantum-chemical properties. Exciting new research directions show that it is possible to augment deep learning with such descriptors for better performance in the low-data regime. To modify the models, differentiable programming enables seamless merging of neural networks with mathematical models from chemistry and physics. The resulting methods are also more data-efficient and make better predictions for molecules that are different from the initial dataset on which they were trained. Application of these chemistry-informed machine learning methods promise to accelerate research in fields such as drug design, materials design, catalysis and reactivity.</p>","PeriodicalId":10344,"journal":{"name":"Chinese Physics Letters","volume":"18 1","pages":"22-30"},"PeriodicalIF":3.5000,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Putting Chemical Knowledge to Work in Machine Learning for Reactivity.\",\"authors\":\"Kjell Jorner\",\"doi\":\"10.2533/chimia.2023.22\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Machine learning has been used to study chemical reactivity for a long time in fields such as physical organic chemistry, chemometrics and cheminformatics. Recent advances in computer science have resulted in deep neural networks that can learn directly from the molecular structure. Neural networks are a good choice when large amounts of data are available. However, many datasets in chemistry are small, and models utilizing chemical knowledge are required for good performance. Adding chemical knowledge can be achieved either by adding more information about the molecules or by adjusting the model architecture itself. The current method of choice for adding more information is descriptors based on computed quantum-chemical properties. Exciting new research directions show that it is possible to augment deep learning with such descriptors for better performance in the low-data regime. To modify the models, differentiable programming enables seamless merging of neural networks with mathematical models from chemistry and physics. The resulting methods are also more data-efficient and make better predictions for molecules that are different from the initial dataset on which they were trained. Application of these chemistry-informed machine learning methods promise to accelerate research in fields such as drug design, materials design, catalysis and reactivity.</p>\",\"PeriodicalId\":10344,\"journal\":{\"name\":\"Chinese Physics Letters\",\"volume\":\"18 1\",\"pages\":\"22-30\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2023-02-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chinese Physics Letters\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.2533/chimia.2023.22\",\"RegionNum\":2,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PHYSICS, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Physics Letters","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.2533/chimia.2023.22","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

在物理有机化学、化学计量学和化学信息学等领域,机器学习一直被用于研究化学反应性。计算机科学的最新进展导致了可以直接从分子结构中学习的深度神经网络。当有大量可用数据时,神经网络是一个很好的选择。然而,化学中的许多数据集都很小,并且需要利用化学知识的模型来获得良好的性能。添加化学知识可以通过添加更多关于分子的信息或调整模型结构本身来实现。目前选择的增加更多信息的方法是基于计算量子化学性质的描述符。令人兴奋的新研究方向表明,有可能用这种描述符增强深度学习,从而在低数据状态下获得更好的性能。为了修改模型,可微编程使神经网络与化学和物理的数学模型无缝融合。由此产生的方法数据效率也更高,并且可以更好地预测与训练它们的初始数据集不同的分子。这些化学知识机器学习方法的应用有望加速药物设计、材料设计、催化和反应性等领域的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Putting Chemical Knowledge to Work in Machine Learning for Reactivity.

Machine learning has been used to study chemical reactivity for a long time in fields such as physical organic chemistry, chemometrics and cheminformatics. Recent advances in computer science have resulted in deep neural networks that can learn directly from the molecular structure. Neural networks are a good choice when large amounts of data are available. However, many datasets in chemistry are small, and models utilizing chemical knowledge are required for good performance. Adding chemical knowledge can be achieved either by adding more information about the molecules or by adjusting the model architecture itself. The current method of choice for adding more information is descriptors based on computed quantum-chemical properties. Exciting new research directions show that it is possible to augment deep learning with such descriptors for better performance in the low-data regime. To modify the models, differentiable programming enables seamless merging of neural networks with mathematical models from chemistry and physics. The resulting methods are also more data-efficient and make better predictions for molecules that are different from the initial dataset on which they were trained. Application of these chemistry-informed machine learning methods promise to accelerate research in fields such as drug design, materials design, catalysis and reactivity.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Chinese Physics Letters
Chinese Physics Letters 物理-物理:综合
CiteScore
5.90
自引率
8.60%
发文量
13238
审稿时长
4 months
期刊介绍: Chinese Physics Letters provides rapid publication of short reports and important research in all fields of physics and is published by the Chinese Physical Society and hosted online by IOP Publishing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信