{"title":"利用抽取规则增强哈米特常数预测的可解释性","authors":"Dr. Sadettin Y. Ugurlu","doi":"10.1002/slct.202501778","DOIUrl":null,"url":null,"abstract":"<p>The Hammett constants (<span></span><math></math>) describe the electron-withdrawing and electron-donating effects of substituents in aromatic compounds and are widely used in structure–activity relationship studies. However, their experimental determination is resource-intensive and time-consuming. Although graph neural networks (GNNs), such as GCN and Weave, have been proposed for predicting Hammett constants using graph-based features, they suffer from poor interpretability. To address limited interpretability, we introduce <b>Inter-Hammett</b>, a framework designed to enhance interpretability while maintaining high predictive performance. Inter-Hammett leverages cheminformatics-derived descriptors from RDKit, Mordred, PyBioMed, and CDK, followed by rigorous AutoGluon-based feature selection to mitigate the curse of dimensionality. The model core is trained using RuleFit on 85% of the dataset, ensuring a balance between accuracy and interpretability. On unseen data, Inter-Hammett achieved an <i>R</i><b><sup>2</sup> of 0.880</b> and an <b>RMSE of 0.128</b>, outperforming eleven models, including four recently published state-of-the-art deep learning approaches. Additionally, a comprehensive interpretability analysis using seven different methods further enhances transparency, making Inter-Hammett a robust alternative for Hammett's constant prediction.</p>","PeriodicalId":146,"journal":{"name":"ChemistrySelect","volume":"10 30","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Inter-Hammett: Enhancing Interpretability in Hammett‘s Constant Prediction via Extracting Rules\",\"authors\":\"Dr. Sadettin Y. Ugurlu\",\"doi\":\"10.1002/slct.202501778\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The Hammett constants (<span></span><math></math>) describe the electron-withdrawing and electron-donating effects of substituents in aromatic compounds and are widely used in structure–activity relationship studies. However, their experimental determination is resource-intensive and time-consuming. Although graph neural networks (GNNs), such as GCN and Weave, have been proposed for predicting Hammett constants using graph-based features, they suffer from poor interpretability. To address limited interpretability, we introduce <b>Inter-Hammett</b>, a framework designed to enhance interpretability while maintaining high predictive performance. Inter-Hammett leverages cheminformatics-derived descriptors from RDKit, Mordred, PyBioMed, and CDK, followed by rigorous AutoGluon-based feature selection to mitigate the curse of dimensionality. The model core is trained using RuleFit on 85% of the dataset, ensuring a balance between accuracy and interpretability. On unseen data, Inter-Hammett achieved an <i>R</i><b><sup>2</sup> of 0.880</b> and an <b>RMSE of 0.128</b>, outperforming eleven models, including four recently published state-of-the-art deep learning approaches. Additionally, a comprehensive interpretability analysis using seven different methods further enhances transparency, making Inter-Hammett a robust alternative for Hammett's constant prediction.</p>\",\"PeriodicalId\":146,\"journal\":{\"name\":\"ChemistrySelect\",\"volume\":\"10 30\",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ChemistrySelect\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/slct.202501778\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ChemistrySelect","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/slct.202501778","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Inter-Hammett: Enhancing Interpretability in Hammett‘s Constant Prediction via Extracting Rules
The Hammett constants () describe the electron-withdrawing and electron-donating effects of substituents in aromatic compounds and are widely used in structure–activity relationship studies. However, their experimental determination is resource-intensive and time-consuming. Although graph neural networks (GNNs), such as GCN and Weave, have been proposed for predicting Hammett constants using graph-based features, they suffer from poor interpretability. To address limited interpretability, we introduce Inter-Hammett, a framework designed to enhance interpretability while maintaining high predictive performance. Inter-Hammett leverages cheminformatics-derived descriptors from RDKit, Mordred, PyBioMed, and CDK, followed by rigorous AutoGluon-based feature selection to mitigate the curse of dimensionality. The model core is trained using RuleFit on 85% of the dataset, ensuring a balance between accuracy and interpretability. On unseen data, Inter-Hammett achieved an R2 of 0.880 and an RMSE of 0.128, outperforming eleven models, including four recently published state-of-the-art deep learning approaches. Additionally, a comprehensive interpretability analysis using seven different methods further enhances transparency, making Inter-Hammett a robust alternative for Hammett's constant prediction.
期刊介绍:
ChemistrySelect is the latest journal from ChemPubSoc Europe and Wiley-VCH. It offers researchers a quality society-owned journal in which to publish their work in all areas of chemistry. Manuscripts are evaluated by active researchers to ensure they add meaningfully to the scientific literature, and those accepted are processed quickly to ensure rapid online publication.