A comparative study of deep learning models and classification algorithms for chemical compound identification and Tox21 prediction

IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
{"title":"A comparative study of deep learning models and classification algorithms for chemical compound identification and Tox21 prediction","authors":"","doi":"10.1016/j.compchemeng.2024.108805","DOIUrl":null,"url":null,"abstract":"<div><p>Chemical compound classification, toxicity prediction, and environmental risk assessments are critically important in various applications within the field of chemistry. Deep learning models provide highly effective tools for extracting features from complex large datasets and performing classification tasks. Four different deep learning models, namely ResNet50V2, VGG19, InceptionV3, and MobileNetV2, have been compared with the random forest (RF) and k-nearest neighbors (KNN) algorithms. The results obtained from experiments conducted using QRCODE images of the Tox21SMILES dataset demonstrate the effectiveness of deep learning models for classifying chemical compounds and showcase the performance of different classification algorithms. The findings of the study thoroughly evaluate the performance of deep learning models and classification algorithms in the task of chemical classification. While ResNet50V2 and VGG19 models achieve high accuracy and precision, InceptionV3 and MobileNetV2 models provide more balanced results. Additionally, in terms of classification algorithms, the k-nearest neighbors (KNN) algorithm generally outperforms the Random Forest (RF) algorithm. Although the RF algorithm achieves good accuracy, the KNN algorithm proves to be more effective in terms of sensitivity and F1 score. These results emphasize the factors to consider when choosing which deep learning model or classification algorithm to use in chemical classification tasks. In conclusion, this study presents a comprehensive analysis comparing the performance of deep learning models and classification algorithms in chemical classification tasks. The selection of the most suitable model and algorithm for a specific task supports achieving better results in the classification of chemical compounds and related applications.</p></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":null,"pages":null},"PeriodicalIF":3.9000,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135424002230","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Chemical compound classification, toxicity prediction, and environmental risk assessments are critically important in various applications within the field of chemistry. Deep learning models provide highly effective tools for extracting features from complex large datasets and performing classification tasks. Four different deep learning models, namely ResNet50V2, VGG19, InceptionV3, and MobileNetV2, have been compared with the random forest (RF) and k-nearest neighbors (KNN) algorithms. The results obtained from experiments conducted using QRCODE images of the Tox21SMILES dataset demonstrate the effectiveness of deep learning models for classifying chemical compounds and showcase the performance of different classification algorithms. The findings of the study thoroughly evaluate the performance of deep learning models and classification algorithms in the task of chemical classification. While ResNet50V2 and VGG19 models achieve high accuracy and precision, InceptionV3 and MobileNetV2 models provide more balanced results. Additionally, in terms of classification algorithms, the k-nearest neighbors (KNN) algorithm generally outperforms the Random Forest (RF) algorithm. Although the RF algorithm achieves good accuracy, the KNN algorithm proves to be more effective in terms of sensitivity and F1 score. These results emphasize the factors to consider when choosing which deep learning model or classification algorithm to use in chemical classification tasks. In conclusion, this study presents a comprehensive analysis comparing the performance of deep learning models and classification algorithms in chemical classification tasks. The selection of the most suitable model and algorithm for a specific task supports achieving better results in the classification of chemical compounds and related applications.

Abstract Image

用于化合物识别和 Tox21 预测的深度学习模型和分类算法比较研究
化合物分类、毒性预测和环境风险评估在化学领域的各种应用中至关重要。深度学习模型为从复杂的大型数据集中提取特征和执行分类任务提供了高效的工具。我们将四种不同的深度学习模型,即 ResNet50V2、VGG19、InceptionV3 和 MobileNetV2,与随机森林(RF)和 k-nearest neighbors(KNN)算法进行了比较。使用 Tox21SMILES 数据集的 QRCODE 图像进行的实验结果证明了深度学习模型在化合物分类方面的有效性,并展示了不同分类算法的性能。研究结果全面评估了深度学习模型和分类算法在化学分类任务中的性能。ResNet50V2 和 VGG19 模型实现了较高的准确度和精确度,而 InceptionV3 和 MobileNetV2 模型则提供了更均衡的结果。此外,在分类算法方面,k-近邻(KNN)算法普遍优于随机森林(RF)算法。虽然 RF 算法取得了不错的准确率,但事实证明 KNN 算法在灵敏度和 F1 分数方面更为有效。这些结果强调了在化学分类任务中选择使用哪种深度学习模型或分类算法时需要考虑的因素。总之,本研究全面分析比较了深度学习模型和分类算法在化学分类任务中的表现。为特定任务选择最合适的模型和算法有助于在化合物分类和相关应用中取得更好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers & Chemical Engineering
Computers & Chemical Engineering 工程技术-工程:化工
CiteScore
8.70
自引率
14.00%
发文量
374
审稿时长
70 days
期刊介绍: Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信