{"title":"Multi-modal Homogeneous Chemical Reaction Performance Prediction with Graph and Chemical Language Information","authors":"Shen Wang, Weiren Zhao, Yining Liu, Yang Li","doi":"10.1002/cjoc.202401186","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Accurate prediction for chemical reaction performance offers optimal direction for synthetic development. To this end, we present a novel multi-modal model called MMHRP-GCL to achieve the prediction of homogeneous chemical reaction yield, enantioselectivity, and activation energy by fusing the information from the text and graph modalities, requiring only 8 simple descriptors and Reaction SMILES obtained without high-cost DFT computation, and capable of managing reactions involving a fluctuating number of molecules. Experimental results on 4 datasets show that MMHRP-GCL outperforms at least 7 generalized SOTA methods. Ablation study confirms the critical roles of the complementation of graph and text modalities, as well as the significance of modality alignment and atomic features in prediction. Albeit there is still room for improvement in the interpretation of atomic relationships, the model has a remarkable ability to identify important atoms. A statistically interpretable study of the feature importance and a test on challenging dataset further demonstrates the utility and potential of the model. As a high-accuracy, low-cost, interpretable, and general multi-modal model, MMHRP-GCL provides valuable guidance on the design of forward predictors for homogeneous catalytic reactions.</p>\n <p>\n </p>\n </div>","PeriodicalId":151,"journal":{"name":"Chinese Journal of Chemistry","volume":"43 11","pages":"1230-1238"},"PeriodicalIF":5.5000,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Journal of Chemistry","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cjoc.202401186","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate prediction for chemical reaction performance offers optimal direction for synthetic development. To this end, we present a novel multi-modal model called MMHRP-GCL to achieve the prediction of homogeneous chemical reaction yield, enantioselectivity, and activation energy by fusing the information from the text and graph modalities, requiring only 8 simple descriptors and Reaction SMILES obtained without high-cost DFT computation, and capable of managing reactions involving a fluctuating number of molecules. Experimental results on 4 datasets show that MMHRP-GCL outperforms at least 7 generalized SOTA methods. Ablation study confirms the critical roles of the complementation of graph and text modalities, as well as the significance of modality alignment and atomic features in prediction. Albeit there is still room for improvement in the interpretation of atomic relationships, the model has a remarkable ability to identify important atoms. A statistically interpretable study of the feature importance and a test on challenging dataset further demonstrates the utility and potential of the model. As a high-accuracy, low-cost, interpretable, and general multi-modal model, MMHRP-GCL provides valuable guidance on the design of forward predictors for homogeneous catalytic reactions.
期刊介绍:
The Chinese Journal of Chemistry is an international forum for peer-reviewed original research results in all fields of chemistry. Founded in 1983 under the name Acta Chimica Sinica English Edition and renamed in 1990 as Chinese Journal of Chemistry, the journal publishes a stimulating mixture of Accounts, Full Papers, Notes and Communications in English.