Qian Zhang,Hengyue Zhang,Zhiyao Su,Yajing Sun,Wenping Hu
{"title":"利用知识丰富的可解释深度学习发现有机光电子学中的分子见解。","authors":"Qian Zhang,Hengyue Zhang,Zhiyao Su,Yajing Sun,Wenping Hu","doi":"10.1021/acs.jctc.5c00713","DOIUrl":null,"url":null,"abstract":"Deep learning holds significant promise for accelerating molecular screening and materials design. However, the black-box nature of current models limits their ability to generate fundamentally new chemical knowledge and insights. Here, we propose LUMIA (Learning and Understanding Molecular Insights with Artificial Intelligence), an innovative interpretable deep learning framework integrating chemistry-informed contrastive learning and Monte Carlo tree search (MCTS). LUMIA is pretrained on approximately 1.4 million organic molecules, using knowledge-informed augmentations that embed π-conjugation and substituent effects explicitly. This allows it to effectively capture hierarchical molecular representations aligned with chemical intuition. Critically, the explicit integration of chemical knowledge enables LUMIA to achieve state-of-the-art performance across multiple organic optoelectronic property prediction tasks. Leveraging its intrinsic interpretability through MCTS, LUMIA directly uncovers previously unexplored substructure patterns influencing reorganization energy, enabling rational molecular design beyond the training data set. Furthermore, LUMIA reveals novel chemical insights, including synergistic effects of substituent positions in pyrazole derivatives. This study highlights the pivotal role of knowledge embedding in interpretable deep learning, transforming molecular design, and accelerating chemical discovery.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"21 1","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discovering Molecular Insights in Organic Optoelectronics with Knowledge-Informed Interpretable Deep Learning.\",\"authors\":\"Qian Zhang,Hengyue Zhang,Zhiyao Su,Yajing Sun,Wenping Hu\",\"doi\":\"10.1021/acs.jctc.5c00713\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning holds significant promise for accelerating molecular screening and materials design. However, the black-box nature of current models limits their ability to generate fundamentally new chemical knowledge and insights. Here, we propose LUMIA (Learning and Understanding Molecular Insights with Artificial Intelligence), an innovative interpretable deep learning framework integrating chemistry-informed contrastive learning and Monte Carlo tree search (MCTS). LUMIA is pretrained on approximately 1.4 million organic molecules, using knowledge-informed augmentations that embed π-conjugation and substituent effects explicitly. This allows it to effectively capture hierarchical molecular representations aligned with chemical intuition. Critically, the explicit integration of chemical knowledge enables LUMIA to achieve state-of-the-art performance across multiple organic optoelectronic property prediction tasks. Leveraging its intrinsic interpretability through MCTS, LUMIA directly uncovers previously unexplored substructure patterns influencing reorganization energy, enabling rational molecular design beyond the training data set. Furthermore, LUMIA reveals novel chemical insights, including synergistic effects of substituent positions in pyrazole derivatives. This study highlights the pivotal role of knowledge embedding in interpretable deep learning, transforming molecular design, and accelerating chemical discovery.\",\"PeriodicalId\":45,\"journal\":{\"name\":\"Journal of Chemical Theory and Computation\",\"volume\":\"21 1\",\"pages\":\"\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Theory and Computation\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jctc.5c00713\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Theory and Computation","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jctc.5c00713","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
Discovering Molecular Insights in Organic Optoelectronics with Knowledge-Informed Interpretable Deep Learning.
Deep learning holds significant promise for accelerating molecular screening and materials design. However, the black-box nature of current models limits their ability to generate fundamentally new chemical knowledge and insights. Here, we propose LUMIA (Learning and Understanding Molecular Insights with Artificial Intelligence), an innovative interpretable deep learning framework integrating chemistry-informed contrastive learning and Monte Carlo tree search (MCTS). LUMIA is pretrained on approximately 1.4 million organic molecules, using knowledge-informed augmentations that embed π-conjugation and substituent effects explicitly. This allows it to effectively capture hierarchical molecular representations aligned with chemical intuition. Critically, the explicit integration of chemical knowledge enables LUMIA to achieve state-of-the-art performance across multiple organic optoelectronic property prediction tasks. Leveraging its intrinsic interpretability through MCTS, LUMIA directly uncovers previously unexplored substructure patterns influencing reorganization energy, enabling rational molecular design beyond the training data set. Furthermore, LUMIA reveals novel chemical insights, including synergistic effects of substituent positions in pyrazole derivatives. This study highlights the pivotal role of knowledge embedding in interpretable deep learning, transforming molecular design, and accelerating chemical discovery.
期刊介绍:
The Journal of Chemical Theory and Computation invites new and original contributions with the understanding that, if accepted, they will not be published elsewhere. Papers reporting new theories, methodology, and/or important applications in quantum electronic structure, molecular dynamics, and statistical mechanics are appropriate for submission to this Journal. Specific topics include advances in or applications of ab initio quantum mechanics, density functional theory, design and properties of new materials, surface science, Monte Carlo simulations, solvation models, QM/MM calculations, biomolecular structure prediction, and molecular dynamics in the broadest sense including gas-phase dynamics, ab initio dynamics, biomolecular dynamics, and protein folding. The Journal does not consider papers that are straightforward applications of known methods including DFT and molecular dynamics. The Journal favors submissions that include advances in theory or methodology with applications to compelling problems.