从感知到预测和解释:借助机器学习和量子化学照亮生命分子砖块的灰色地带

IF 16.8 2区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Vincenzo Barone
{"title":"从感知到预测和解释:借助机器学习和量子化学照亮生命分子砖块的灰色地带","authors":"Vincenzo Barone","doi":"10.1002/wcms.70000","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The latest developments of a general exploration/exploitation strategy for the computational study of molecular bricks of life in the gas-phase are presented and illustrated by means of prototypical semi-rigid and flexible systems. In the first step, generalized natural internal coordinates are employed to obtain a clear-cut separation between different degrees of freedom, and machine-learning algorithms based on chemical descriptors (synthons) drive fast quantum chemical methods in the exploration of rugged potential energy surfaces ruled by soft degrees of freedom. Then, different quantum chemical models are carefully selected for exploiting energies, geometries, and vibrational frequencies with the aim of maximizing the accuracy of the overall description while retaining a reasonable cost for all the steps. In particular, a composite wave-function method is used for energies, whereas a double-hybrid functional is employed for geometries and harmonic frequencies and a cheaper global hybrid functional for anharmonic contributions. A panel of molecular bricks of life containing up to 50 atoms is employed to show that the proposed strategy draws closer to the accuracy of state-of-the-art composite wave-function methods for small semi-rigid molecules, but is applicable to much larger systems. The implementation of the whole computational workflow in terms of preprocessing and postprocessing of data provided by standard electronic structure codes paves the way toward the accurate yet not prohibitively expensive study of medium- to large-sized molecules by a user-friendly black-box tool exploitable also by experiment-oriented researchers.</p>\n </div>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"15 1","pages":""},"PeriodicalIF":16.8000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From Perception to Prediction and Interpretation: Enlightening the Gray Zone of Molecular Bricks of Life With the Help of Machine Learning and Quantum Chemistry\",\"authors\":\"Vincenzo Barone\",\"doi\":\"10.1002/wcms.70000\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>The latest developments of a general exploration/exploitation strategy for the computational study of molecular bricks of life in the gas-phase are presented and illustrated by means of prototypical semi-rigid and flexible systems. In the first step, generalized natural internal coordinates are employed to obtain a clear-cut separation between different degrees of freedom, and machine-learning algorithms based on chemical descriptors (synthons) drive fast quantum chemical methods in the exploration of rugged potential energy surfaces ruled by soft degrees of freedom. Then, different quantum chemical models are carefully selected for exploiting energies, geometries, and vibrational frequencies with the aim of maximizing the accuracy of the overall description while retaining a reasonable cost for all the steps. In particular, a composite wave-function method is used for energies, whereas a double-hybrid functional is employed for geometries and harmonic frequencies and a cheaper global hybrid functional for anharmonic contributions. A panel of molecular bricks of life containing up to 50 atoms is employed to show that the proposed strategy draws closer to the accuracy of state-of-the-art composite wave-function methods for small semi-rigid molecules, but is applicable to much larger systems. The implementation of the whole computational workflow in terms of preprocessing and postprocessing of data provided by standard electronic structure codes paves the way toward the accurate yet not prohibitively expensive study of medium- to large-sized molecules by a user-friendly black-box tool exploitable also by experiment-oriented researchers.</p>\\n </div>\",\"PeriodicalId\":236,\"journal\":{\"name\":\"Wiley Interdisciplinary Reviews: Computational Molecular Science\",\"volume\":\"15 1\",\"pages\":\"\"},\"PeriodicalIF\":16.8000,\"publicationDate\":\"2025-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Wiley Interdisciplinary Reviews: Computational Molecular Science\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/wcms.70000\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wiley Interdisciplinary Reviews: Computational Molecular Science","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/wcms.70000","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

本文介绍了气相生命分子砖计算研究的一般勘探/开发策略的最新进展,并通过半刚性和柔性原型系统进行了说明。第一步,采用广义自然内坐标来明确不同自由度之间的分离,基于化学描述符(synthons)的机器学习算法驱动快速量子化学方法来探索由软自由度支配的粗糙势能面。然后,仔细选择不同的量子化学模型来利用能量,几何形状和振动频率,目的是最大限度地提高整体描述的准确性,同时保持所有步骤的合理成本。特别地,对能量使用复合波函数方法,而对几何形状和谐波频率使用双杂化泛函,对非谐波贡献使用更便宜的全局杂化泛函。一个包含多达50个原子的生命分子砖的面板显示,所提出的策略更接近于最先进的复合波函数方法的精度,用于小的半刚性分子,但适用于更大的系统。在标准电子结构代码提供的数据的预处理和后处理方面,整个计算工作流程的实现为通过一个用户友好的黑盒工具对中大型分子进行准确但不过于昂贵的研究铺平了道路,该工具也可由面向实验的研究人员利用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

From Perception to Prediction and Interpretation: Enlightening the Gray Zone of Molecular Bricks of Life With the Help of Machine Learning and Quantum Chemistry

From Perception to Prediction and Interpretation: Enlightening the Gray Zone of Molecular Bricks of Life With the Help of Machine Learning and Quantum Chemistry

The latest developments of a general exploration/exploitation strategy for the computational study of molecular bricks of life in the gas-phase are presented and illustrated by means of prototypical semi-rigid and flexible systems. In the first step, generalized natural internal coordinates are employed to obtain a clear-cut separation between different degrees of freedom, and machine-learning algorithms based on chemical descriptors (synthons) drive fast quantum chemical methods in the exploration of rugged potential energy surfaces ruled by soft degrees of freedom. Then, different quantum chemical models are carefully selected for exploiting energies, geometries, and vibrational frequencies with the aim of maximizing the accuracy of the overall description while retaining a reasonable cost for all the steps. In particular, a composite wave-function method is used for energies, whereas a double-hybrid functional is employed for geometries and harmonic frequencies and a cheaper global hybrid functional for anharmonic contributions. A panel of molecular bricks of life containing up to 50 atoms is employed to show that the proposed strategy draws closer to the accuracy of state-of-the-art composite wave-function methods for small semi-rigid molecules, but is applicable to much larger systems. The implementation of the whole computational workflow in terms of preprocessing and postprocessing of data provided by standard electronic structure codes paves the way toward the accurate yet not prohibitively expensive study of medium- to large-sized molecules by a user-friendly black-box tool exploitable also by experiment-oriented researchers.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Wiley Interdisciplinary Reviews: Computational Molecular Science
Wiley Interdisciplinary Reviews: Computational Molecular Science CHEMISTRY, MULTIDISCIPLINARY-MATHEMATICAL & COMPUTATIONAL BIOLOGY
CiteScore
28.90
自引率
1.80%
发文量
52
审稿时长
6-12 weeks
期刊介绍: Computational molecular sciences harness the power of rigorous chemical and physical theories, employing computer-based modeling, specialized hardware, software development, algorithm design, and database management to explore and illuminate every facet of molecular sciences. These interdisciplinary approaches form a bridge between chemistry, biology, and materials sciences, establishing connections with adjacent application-driven fields in both chemistry and biology. WIREs Computational Molecular Science stands as a platform to comprehensively review and spotlight research from these dynamic and interconnected fields.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信