{"title":"Development of Convolutional Neural Network-Based Models for Efficient and Reliable Flashpoint Prediction","authors":"Jiaxing Zhu, Lin Hao, Hao Zhang, Hongyuan Wei","doi":"10.1021/acs.iecr.4c04373","DOIUrl":null,"url":null,"abstract":"The initial step in quantitative structure–property relationship (QSPR) modeling typically involves selecting relevant descriptors for predicting flammability characteristics. However, traditional selection methods face challenges and inefficiencies when dealing with large pools of molecular descriptors. To address these issues, this study proposes a convolutional neural network (CNN)-based method to automatically identify relevant descriptors, eliminating the need for preselection. This method is applied to predict the flashpoints of pure compounds and mixtures using two different sets of molecular descriptors. Bayesian Optimization is employed to tune the hyperparameters of the CNN models through 5-fold cross-validation. The results demonstrate that the developed CNN models achieve high accuracy, with <i>R</i><sup>2</sup> values of 0.9723 for pure compounds and 0.9939/0.9951 for mixtures in the test data set, outperforming previously established models. Notably, the CNN models maintain high accuracy even when the number of valid molecular descriptors is increased from 125 to 861. Overall, the proposed approach leads to robust performance and accelerates QSPR model development through automated molecular descriptor selection.","PeriodicalId":39,"journal":{"name":"Industrial & Engineering Chemistry Research","volume":"663 1","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Industrial & Engineering Chemistry Research","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1021/acs.iecr.4c04373","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0
Abstract
The initial step in quantitative structure–property relationship (QSPR) modeling typically involves selecting relevant descriptors for predicting flammability characteristics. However, traditional selection methods face challenges and inefficiencies when dealing with large pools of molecular descriptors. To address these issues, this study proposes a convolutional neural network (CNN)-based method to automatically identify relevant descriptors, eliminating the need for preselection. This method is applied to predict the flashpoints of pure compounds and mixtures using two different sets of molecular descriptors. Bayesian Optimization is employed to tune the hyperparameters of the CNN models through 5-fold cross-validation. The results demonstrate that the developed CNN models achieve high accuracy, with R2 values of 0.9723 for pure compounds and 0.9939/0.9951 for mixtures in the test data set, outperforming previously established models. Notably, the CNN models maintain high accuracy even when the number of valid molecular descriptors is increased from 125 to 861. Overall, the proposed approach leads to robust performance and accelerates QSPR model development through automated molecular descriptor selection.
期刊介绍:
ndustrial & Engineering Chemistry, with variations in title and format, has been published since 1909 by the American Chemical Society. Industrial & Engineering Chemistry Research is a weekly publication that reports industrial and academic research in the broad fields of applied chemistry and chemical engineering with special focus on fundamentals, processes, and products.