Insight into the Mechanism of Machine Learning Models for Predicting Ionic Liquids Toxicity Based on Molecular Structure Descriptors

IF 7.3 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Runqi Zhang, Yu Wang, Wenguang Zhu, Leilei Xin, Jianguang Qi, Yinglong Wang, Zhaoyou Zhu, Peizhe Cui
{"title":"Insight into the Mechanism of Machine Learning Models for Predicting Ionic Liquids Toxicity Based on Molecular Structure Descriptors","authors":"Runqi Zhang, Yu Wang, Wenguang Zhu, Leilei Xin, Jianguang Qi, Yinglong Wang, Zhaoyou Zhu, Peizhe Cui","doi":"10.1021/acssuschemeng.4c06546","DOIUrl":null,"url":null,"abstract":"The development and application of functionalized ionic liquids (ILs) are currently hot topics in chemical engineering. However, research on ILs toxicity is significantly lagging behind studies on their physical properties and applications. This study begins with the construction of ILs toxicity model, utilizing three types of descriptors to quantify ILs structures and developing four machine learning (ML) models for predicting toxicity to <i>Daphnia magna</i>. Guttmann coefficients are used to evaluate the diversity of ILs structures. Feature engineering is employed to optimize the inputs to the quantitative structure–activity relationship (QSAR) models, enhancing their ability to capture the relationship between ILs structures and toxicity. Grid search and cross-validation ensure model robustness and prevent overfitting. Results indicate that the random forest model based on RDKit descriptors performs best (<i>R</i><sup>2</sup> = 0.975, RMSE = 0.222). SHAP analysis identifies key molecular features contributing to ILs toxicity, revealing that substructures around carbon atoms are crucial for ILs toxicity, while structures containing oxygen atoms can reduce toxicity. These findings offer insights for designing low-toxicity, environmentally friendly ILs and highlight the value of machine learning models in green chemistry and sustainability research.","PeriodicalId":25,"journal":{"name":"ACS Sustainable Chemistry & Engineering","volume":"33 1","pages":""},"PeriodicalIF":7.3000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Sustainable Chemistry & Engineering","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acssuschemeng.4c06546","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The development and application of functionalized ionic liquids (ILs) are currently hot topics in chemical engineering. However, research on ILs toxicity is significantly lagging behind studies on their physical properties and applications. This study begins with the construction of ILs toxicity model, utilizing three types of descriptors to quantify ILs structures and developing four machine learning (ML) models for predicting toxicity to Daphnia magna. Guttmann coefficients are used to evaluate the diversity of ILs structures. Feature engineering is employed to optimize the inputs to the quantitative structure–activity relationship (QSAR) models, enhancing their ability to capture the relationship between ILs structures and toxicity. Grid search and cross-validation ensure model robustness and prevent overfitting. Results indicate that the random forest model based on RDKit descriptors performs best (R2 = 0.975, RMSE = 0.222). SHAP analysis identifies key molecular features contributing to ILs toxicity, revealing that substructures around carbon atoms are crucial for ILs toxicity, while structures containing oxygen atoms can reduce toxicity. These findings offer insights for designing low-toxicity, environmentally friendly ILs and highlight the value of machine learning models in green chemistry and sustainability research.

Abstract Image

基于分子结构描述符预测离子液体毒性的机器学习模型机理透视
功能化离子液体(ILs)的开发和应用是目前化学工程领域的热门话题。然而,对离子液体毒性的研究明显落后于对其物理性质和应用的研究。本研究从构建离子液体毒性模型入手,利用三种描述符量化离子液体结构,并开发了四种机器学习(ML)模型来预测对大型蚤的毒性。Guttmann 系数用于评估 ILs 结构的多样性。特征工程用于优化定量结构-活性关系(QSAR)模型的输入,从而提高其捕捉 ILs 结构与毒性之间关系的能力。网格搜索和交叉验证确保了模型的稳健性并防止了过度拟合。结果表明,基于 RDKit 描述符的随机森林模型表现最佳(R2 = 0.975,RMSE = 0.222)。SHAP分析确定了导致ILs毒性的关键分子特征,揭示了碳原子周围的亚结构对ILs毒性至关重要,而含有氧原子的结构可以降低毒性。这些发现为设计低毒性、环境友好型 IL 提供了启示,并凸显了机器学习模型在绿色化学和可持续发展研究中的价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Sustainable Chemistry & Engineering
ACS Sustainable Chemistry & Engineering CHEMISTRY, MULTIDISCIPLINARY-ENGINEERING, CHEMICAL
CiteScore
13.80
自引率
4.80%
发文量
1470
审稿时长
1.7 months
期刊介绍: ACS Sustainable Chemistry & Engineering is a prestigious weekly peer-reviewed scientific journal published by the American Chemical Society. Dedicated to advancing the principles of green chemistry and green engineering, it covers a wide array of research topics including green chemistry, green engineering, biomass, alternative energy, and life cycle assessment. The journal welcomes submissions in various formats, including Letters, Articles, Features, and Perspectives (Reviews), that address the challenges of sustainability in the chemical enterprise and contribute to the advancement of sustainable practices. Join us in shaping the future of sustainable chemistry and engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信