A simple approach to rotationally invariant machine learning of a vector quantity.

IF 3.1 2区 化学 Q3 CHEMISTRY, PHYSICAL
Jakub Martinka, Marek Pederzoli, Mario Barbatti, Pavlo O Dral, Jiří Pittner
{"title":"A simple approach to rotationally invariant machine learning of a vector quantity.","authors":"Jakub Martinka, Marek Pederzoli, Mario Barbatti, Pavlo O Dral, Jiří Pittner","doi":"10.1063/5.0230176","DOIUrl":null,"url":null,"abstract":"<p><p>Unlike with the energy, which is a scalar property, machine learning (ML) prediction of vector or tensor properties poses the additional challenge of achieving proper invariance (covariance) with respect to molecular rotation. For the energy gradients needed in molecular dynamics (MD), this symmetry is automatically fulfilled when taking analytic derivative of the energy, which is a scalar invariant (using properly invariant molecular descriptors). However, if the properties cannot be obtained by differentiation, other appropriate methods should be applied to retain the covariance. Several approaches have been suggested to properly treat this issue. For nonadiabatic couplings and polarizabilities, for example, it was possible to construct virtual quantities from which the above tensorial properties are obtained by differentiation and thus guarantee the covariance. Another possible solution is to build the rotational equivariance into the design of a neural network employed in the model. Here, we propose a simpler alternative technique, which does not require construction of auxiliary properties or application of special equivariant ML techniques. We suggest a three-step approach, using the molecular tensor of inertia. In the first step, the molecule is rotated using the eigenvectors of this tensor to its principal axes. In the second step, the ML procedure predicts the vector property relative to this orientation, based on a training set where all vector properties were in this same coordinate system. As the third step, it remains to transform the ML estimate of the vector property back to the original orientation. This rotate-predict-rotate (RPR) procedure should thus guarantee proper covariance of a vector property and is trivially extensible also to tensors such as polarizability. The RPR procedure has an advantage that the accurate models can be trained very fast for thousands of molecular configurations, which might be beneficial where many training sets are required (e.g., in active learning). We have implemented the RPR technique, using the MLatom and Newton-X programs for ML and MD, and performed its assessment on the dipole moment along MD trajectories of 1,2-dichloroethane.</p>","PeriodicalId":15313,"journal":{"name":"Journal of Chemical Physics","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Physics","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1063/5.0230176","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Unlike with the energy, which is a scalar property, machine learning (ML) prediction of vector or tensor properties poses the additional challenge of achieving proper invariance (covariance) with respect to molecular rotation. For the energy gradients needed in molecular dynamics (MD), this symmetry is automatically fulfilled when taking analytic derivative of the energy, which is a scalar invariant (using properly invariant molecular descriptors). However, if the properties cannot be obtained by differentiation, other appropriate methods should be applied to retain the covariance. Several approaches have been suggested to properly treat this issue. For nonadiabatic couplings and polarizabilities, for example, it was possible to construct virtual quantities from which the above tensorial properties are obtained by differentiation and thus guarantee the covariance. Another possible solution is to build the rotational equivariance into the design of a neural network employed in the model. Here, we propose a simpler alternative technique, which does not require construction of auxiliary properties or application of special equivariant ML techniques. We suggest a three-step approach, using the molecular tensor of inertia. In the first step, the molecule is rotated using the eigenvectors of this tensor to its principal axes. In the second step, the ML procedure predicts the vector property relative to this orientation, based on a training set where all vector properties were in this same coordinate system. As the third step, it remains to transform the ML estimate of the vector property back to the original orientation. This rotate-predict-rotate (RPR) procedure should thus guarantee proper covariance of a vector property and is trivially extensible also to tensors such as polarizability. The RPR procedure has an advantage that the accurate models can be trained very fast for thousands of molecular configurations, which might be beneficial where many training sets are required (e.g., in active learning). We have implemented the RPR technique, using the MLatom and Newton-X programs for ML and MD, and performed its assessment on the dipole moment along MD trajectories of 1,2-dichloroethane.

矢量旋转不变机器学习的简单方法。
能量是一种标量属性,与之不同的是,机器学习(ML)对矢量或张量属性的预测还面临着一个额外的挑战,即如何实现与分子旋转相关的适当不变性(协方差)。对于分子动力学(MD)所需的能量梯度,在对能量进行解析导数时,这种对称性会自动得到满足,因为能量是一个标量不变式(使用适当不变的分子描述符)。但是,如果无法通过微分获得属性,则应采用其他适当的方法来保留协方差。已经提出了几种正确处理这一问题的方法。例如,对于非绝热耦合和极化率,可以构建虚拟量,通过微分得到上述张量性质,从而保证协方差。另一种可能的解决方案是在设计模型中使用的神经网络时建立旋转协方差。在这里,我们提出了一种更简单的替代技术,它不需要构建辅助属性,也不需要应用特殊的等差数列 ML 技术。我们建议采用分子惯性张量的三步法。第一步,利用该张量的特征向量将分子旋转到其主轴上。第二步,根据所有矢量属性都在同一坐标系中的训练集,利用 ML 程序预测相对于这一方向的矢量属性。第三步,将矢量属性的 ML 估计值转换回原始方向。因此,这种旋转-预测-旋转(RPR)程序应能保证矢量属性具有适当的协方差,并可扩展到极化性等张量。RPR 程序的一个优势是,可以针对成千上万的分子构型快速训练出精确的模型,这在需要大量训练集的情况下(如主动学习)可能是有益的。我们使用用于 ML 和 MD 的 MLatom 和 Newton-X 程序实现了 RPR 技术,并对 1,2-二氯乙烷 MD 轨迹上的偶极矩进行了评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Chemical Physics
Journal of Chemical Physics 物理-物理:原子、分子和化学物理
CiteScore
7.40
自引率
15.90%
发文量
1615
审稿时长
2 months
期刊介绍: The Journal of Chemical Physics publishes quantitative and rigorous science of long-lasting value in methods and applications of chemical physics. The Journal also publishes brief Communications of significant new findings, Perspectives on the latest advances in the field, and Special Topic issues. The Journal focuses on innovative research in experimental and theoretical areas of chemical physics, including spectroscopy, dynamics, kinetics, statistical mechanics, and quantum mechanics. In addition, topical areas such as polymers, soft matter, materials, surfaces/interfaces, and systems of biological relevance are of increasing importance. Topical coverage includes: Theoretical Methods and Algorithms Advanced Experimental Techniques Atoms, Molecules, and Clusters Liquids, Glasses, and Crystals Surfaces, Interfaces, and Materials Polymers and Soft Matter Biological Molecules and Networks.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信