Yan Yuan, Yilin Zhao, Linling Lu, Junjie Wang, Jingbo Chen, Shubin Liu, Paul W Ayers, Dongbo Zhao
{"title":"Multiproperty Deep Learning of the Correlation Energy of Electrons and the Physicochemical Properties of Molecules.","authors":"Yan Yuan, Yilin Zhao, Linling Lu, Junjie Wang, Jingbo Chen, Shubin Liu, Paul W Ayers, Dongbo Zhao","doi":"10.1021/acs.jctc.5c00414","DOIUrl":null,"url":null,"abstract":"<p><p>The density-based descriptors from the information-theoretic approach (ITA) are used as features for multiproperty deep learning (DL), predicting the correlation energy and physicochemical properties of molecules. In addition to response properties (molecular polarizability α<sub>iso</sub> and NMR shielding constant σ<sub>iso</sub>) where ITA has been shown to work well before, we consider four conceptually distinct but practically related concepts: electron correlation, redox potential, octanol-water partition coefficient (log<i>K</i><sub>ow</sub>), and the wavelength of maximum absorption (λ<sub>max</sub>). The DL-predicted results are in good agreement with either the calculated or experimental counterparts, indicative of the model's robustness. We verified the transferability of redox potentials of phenazine derivatives. Generalizability is observed for the λ<sub>max</sub> data: small chromophores are used for training/validation but the test set has sizable molecules. The trained DL model outperforms the conventional TD-DFT method in terms of accuracy and efficiency. We also showcase that the isotropic quadrupole moment (Θ<sub>iso</sub>) is a good predictor of log<i>K</i><sub>ow</sub>. This establishes that versatile density-based ITA quantities can be used to make accurate, low-cost predictions of both extensive and intensive properties, suggesting that this ITA-DL protocol has the potential for closed-loop chemistry automation. Implication of this work is straightforward, that a universal framework should be possible based on the ITA-based DL models.</p>","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":" ","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Theory and Computation","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jctc.5c00414","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
The density-based descriptors from the information-theoretic approach (ITA) are used as features for multiproperty deep learning (DL), predicting the correlation energy and physicochemical properties of molecules. In addition to response properties (molecular polarizability αiso and NMR shielding constant σiso) where ITA has been shown to work well before, we consider four conceptually distinct but practically related concepts: electron correlation, redox potential, octanol-water partition coefficient (logKow), and the wavelength of maximum absorption (λmax). The DL-predicted results are in good agreement with either the calculated or experimental counterparts, indicative of the model's robustness. We verified the transferability of redox potentials of phenazine derivatives. Generalizability is observed for the λmax data: small chromophores are used for training/validation but the test set has sizable molecules. The trained DL model outperforms the conventional TD-DFT method in terms of accuracy and efficiency. We also showcase that the isotropic quadrupole moment (Θiso) is a good predictor of logKow. This establishes that versatile density-based ITA quantities can be used to make accurate, low-cost predictions of both extensive and intensive properties, suggesting that this ITA-DL protocol has the potential for closed-loop chemistry automation. Implication of this work is straightforward, that a universal framework should be possible based on the ITA-based DL models.
期刊介绍:
The Journal of Chemical Theory and Computation invites new and original contributions with the understanding that, if accepted, they will not be published elsewhere. Papers reporting new theories, methodology, and/or important applications in quantum electronic structure, molecular dynamics, and statistical mechanics are appropriate for submission to this Journal. Specific topics include advances in or applications of ab initio quantum mechanics, density functional theory, design and properties of new materials, surface science, Monte Carlo simulations, solvation models, QM/MM calculations, biomolecular structure prediction, and molecular dynamics in the broadest sense including gas-phase dynamics, ab initio dynamics, biomolecular dynamics, and protein folding. The Journal does not consider papers that are straightforward applications of known methods including DFT and molecular dynamics. The Journal favors submissions that include advances in theory or methodology with applications to compelling problems.