毒理学中化学结构表征学习的新趋势:机器学习模型方法综述

IF 11.4 1区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Jiabin Zhang, Lei Zhao, Wei Wang, De-Feng Xing, Zhen-Xing Wang, Jun Ma, Aijie Wang, Nan-Qi Ren, Duu-Jong Lee, Chuan Chen
{"title":"毒理学中化学结构表征学习的新趋势:机器学习模型方法综述","authors":"Jiabin Zhang, Lei Zhao, Wei Wang, De-Feng Xing, Zhen-Xing Wang, Jun Ma, Aijie Wang, Nan-Qi Ren, Duu-Jong Lee, Chuan Chen","doi":"10.1080/10643389.2025.2469868","DOIUrl":null,"url":null,"abstract":"Computer-assisted virtual screening using structure-activity relationship (QSAR) models is a surrogate method for reducing the need for costly animal experiments. However, traditional QSAR models face significant challenges, such as the ‘activity cliff’ phenomenon and small datasets, which limit their ability to generalize and predict toxicity. This review examines transistion of digital encodings form in molecules and its corresponding models, introducing from molecule descriptors to three advanced types of molecular representations based on deep learning techniques. We highlight the importance of deep learning models that can not only capture molecular similarity in chemical space to address the ‘activity cliff’ problem but also improve model performance through feature fusion. As alternative solutions to reduce reliance on feature engineering potentially, graph neural network, convolutional neural network and large lanuage model and their related training paradigm such as transfer learning could give another opportunity for toxicity model setting in terms of data insuffient dealing etc. This work could help potential deep learning modelers to build robust model, setting the stage for groundbreaking advancements in further development and application of toxicity prediction models.","PeriodicalId":10823,"journal":{"name":"Critical Reviews in Environmental Science and Technology","volume":"543 1","pages":"1-26"},"PeriodicalIF":11.4000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"New trend on chemical structure representation learning in toxicology: In reviews of machine learning model methodology\",\"authors\":\"Jiabin Zhang, Lei Zhao, Wei Wang, De-Feng Xing, Zhen-Xing Wang, Jun Ma, Aijie Wang, Nan-Qi Ren, Duu-Jong Lee, Chuan Chen\",\"doi\":\"10.1080/10643389.2025.2469868\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer-assisted virtual screening using structure-activity relationship (QSAR) models is a surrogate method for reducing the need for costly animal experiments. However, traditional QSAR models face significant challenges, such as the ‘activity cliff’ phenomenon and small datasets, which limit their ability to generalize and predict toxicity. This review examines transistion of digital encodings form in molecules and its corresponding models, introducing from molecule descriptors to three advanced types of molecular representations based on deep learning techniques. We highlight the importance of deep learning models that can not only capture molecular similarity in chemical space to address the ‘activity cliff’ problem but also improve model performance through feature fusion. As alternative solutions to reduce reliance on feature engineering potentially, graph neural network, convolutional neural network and large lanuage model and their related training paradigm such as transfer learning could give another opportunity for toxicity model setting in terms of data insuffient dealing etc. This work could help potential deep learning modelers to build robust model, setting the stage for groundbreaking advancements in further development and application of toxicity prediction models.\",\"PeriodicalId\":10823,\"journal\":{\"name\":\"Critical Reviews in Environmental Science and Technology\",\"volume\":\"543 1\",\"pages\":\"1-26\"},\"PeriodicalIF\":11.4000,\"publicationDate\":\"2025-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Critical Reviews in Environmental Science and Technology\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1080/10643389.2025.2469868\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Critical Reviews in Environmental Science and Technology","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1080/10643389.2025.2469868","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

使用结构-活性关系(QSAR)模型的计算机辅助虚拟筛选是减少昂贵的动物实验需求的替代方法。然而,传统的QSAR模型面临着重大挑战,例如“活动悬崖”现象和小数据集,这限制了它们概括和预测毒性的能力。本文研究了数字编码形式在分子及其相应模型中的转换,介绍了从分子描述符到基于深度学习技术的三种高级分子表示类型。我们强调了深度学习模型的重要性,它不仅可以捕获化学空间中的分子相似性来解决“活动悬崖”问题,还可以通过特征融合提高模型性能。图神经网络、卷积神经网络和大型语言模型及其相关的训练范式(如迁移学习)作为潜在的减少对特征工程依赖的替代解决方案,可以在数据处理不足等方面为毒性模型设置提供另一个机会。这项工作可以帮助潜在的深度学习建模者建立稳健的模型,为进一步开发和应用毒性预测模型奠定突破性的进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
New trend on chemical structure representation learning in toxicology: In reviews of machine learning model methodology
Computer-assisted virtual screening using structure-activity relationship (QSAR) models is a surrogate method for reducing the need for costly animal experiments. However, traditional QSAR models face significant challenges, such as the ‘activity cliff’ phenomenon and small datasets, which limit their ability to generalize and predict toxicity. This review examines transistion of digital encodings form in molecules and its corresponding models, introducing from molecule descriptors to three advanced types of molecular representations based on deep learning techniques. We highlight the importance of deep learning models that can not only capture molecular similarity in chemical space to address the ‘activity cliff’ problem but also improve model performance through feature fusion. As alternative solutions to reduce reliance on feature engineering potentially, graph neural network, convolutional neural network and large lanuage model and their related training paradigm such as transfer learning could give another opportunity for toxicity model setting in terms of data insuffient dealing etc. This work could help potential deep learning modelers to build robust model, setting the stage for groundbreaking advancements in further development and application of toxicity prediction models.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
27.30
自引率
1.60%
发文量
64
审稿时长
2 months
期刊介绍: Two of the most pressing global challenges of our era involve understanding and addressing the multitude of environmental problems we face. In order to tackle them effectively, it is essential to devise logical strategies and methods for their control. Critical Reviews in Environmental Science and Technology serves as a valuable international platform for the comprehensive assessment of current knowledge across a wide range of environmental science topics. Environmental science is a field that encompasses the intricate and fluid interactions between various scientific disciplines. These include earth and agricultural sciences, chemistry, biology, medicine, and engineering. Furthermore, new disciplines such as environmental toxicology and risk assessment have emerged in response to the increasing complexity of environmental challenges. The purpose of Critical Reviews in Environmental Science and Technology is to provide a space for critical analysis and evaluation of existing knowledge in environmental science. By doing so, it encourages the advancement of our understanding and the development of effective solutions. This journal plays a crucial role in fostering international cooperation and collaboration in addressing the pressing environmental issues of our time.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信