Hybrid Unsupervised/Supervised Machine Learning for Identifying Molecular Structural Fingerprints From Ensemble Property

IF 3.4 3区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY
Arpan Choudhury, Debashree Ghosh
{"title":"Hybrid Unsupervised/Supervised Machine Learning for Identifying Molecular Structural Fingerprints From Ensemble Property","authors":"Arpan Choudhury,&nbsp;Debashree Ghosh","doi":"10.1002/jcc.70038","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The ensemble properties of a system are obtained by averaging over the properties calculated for the various configurations it can have at a finite temperature and thus cannot be captured by a single molecular structure. Such ensemble properties are often important in material discovery. In designing new materials, the goal is to predict those ensemble structures that display a tailored property. However, mapping this average property to multiple structures introduces ambiguities and unreliable convergence in supervised machine learning. This presents a major obstacle in designing new materials. Here, we introduce a hybrid unsupervised/supervised learning method and demonstrate how to predict the structural parameters defining the conformers of a heterogeneous system, melanin, from its ensemble-averaged spectra. This also shows a new way to identify different structural fingerprints responsible for an ensemble-averaged superposition spectrum.</p>\n </div>","PeriodicalId":188,"journal":{"name":"Journal of Computational Chemistry","volume":"46 3","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Chemistry","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jcc.70038","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The ensemble properties of a system are obtained by averaging over the properties calculated for the various configurations it can have at a finite temperature and thus cannot be captured by a single molecular structure. Such ensemble properties are often important in material discovery. In designing new materials, the goal is to predict those ensemble structures that display a tailored property. However, mapping this average property to multiple structures introduces ambiguities and unreliable convergence in supervised machine learning. This presents a major obstacle in designing new materials. Here, we introduce a hybrid unsupervised/supervised learning method and demonstrate how to predict the structural parameters defining the conformers of a heterogeneous system, melanin, from its ensemble-averaged spectra. This also shows a new way to identify different structural fingerprints responsible for an ensemble-averaged superposition spectrum.

Abstract Image

Abstract Image

一个系统的集合特性是通过对其在有限温度下可能具有的各种构型所计算出的特性进行平均而获得的,因此无法用单一分子结构来捕捉。这种集合特性通常在材料发现中非常重要。在设计新材料时,我们的目标是预测那些能显示出定制特性的集合结构。然而,将这种平均特性映射到多个结构会带来模糊性,并且在有监督的机器学习中收敛不可靠。这成为设计新材料的一大障碍。在此,我们介绍了一种混合的无监督/监督学习方法,并演示了如何从黑色素的集合平均光谱中预测定义异质系统构象的结构参数。这也展示了一种新的方法来识别造成集合平均叠加光谱的不同结构指纹。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.60
自引率
3.30%
发文量
247
审稿时长
1.7 months
期刊介绍: This distinguished journal publishes articles concerned with all aspects of computational chemistry: analytical, biological, inorganic, organic, physical, and materials. The Journal of Computational Chemistry presents original research, contemporary developments in theory and methodology, and state-of-the-art applications. Computational areas that are featured in the journal include ab initio and semiempirical quantum mechanics, density functional theory, molecular mechanics, molecular dynamics, statistical mechanics, cheminformatics, biomolecular structure prediction, molecular design, and bioinformatics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信