{"title":"推进乙苯生产装置故障诊断:一种机器学习和可解释的人工智能方法","authors":"Somnath Chowdhury , Sumana Roy , Bitopama Modak , Fahim Ahmed , Aditi Mahto , Abhiram Hens , Sandip Kumar Lahiri","doi":"10.1016/j.jics.2025.102157","DOIUrl":null,"url":null,"abstract":"<div><div>Fault diagnosis in chemical process plants is critical to ensuring operational safety, product quality, and economic performance. In complex units such as ethyl benzene production plant, early fault detection is challenging due to nonlinear process dynamics, strong variable interactions, and the scarcity of labelled fault data. To address this gap, this study presents a simulation-driven, explainable machine learning-based fault diagnosis system (FDS) capable of identifying multiple fault types without reliance on extensive historical plant data. A dynamic process simulation was conducted to generate datasets under both normal and faulty operating conditions. The system was first operated for 20 h in normal closed-loop mode with a 0.6-min sampling interval to form the baseline dataset. For each fault case, the simulation was reset and run for 5 h to allow stabilization, followed by 15 h of faulty operation, producing 2001 samples across 28 process variables per case. Fourteen representative faults, including sensor bias, actuator malfunctions, and process disturbances, were introduced. After feature selection, multiple classifiers were trained, and three high-performing yet diverse models, Ensemble Bagged Trees (tree-based), Support Vector Machine (kernel-based), and Fine KNN (distance-based), were integrated using a majority-voting mechanism to enhance robustness. EBT being best performing model achieved 99.98 % accuracy and an F1 score of 0.9988. Local Interpretable Model-agnostic Explanations (LIME) provided transparent insight into variable importance for each fault class, aiding interpretability. The novelty of this work lies in generating realistic labelled datasets entirely from dynamic simulation, combining heterogeneous algorithms for improved generalization, and embedding explainable AI (XAI) to create a scalable, transparent FDS framework applicable to other chemical processes with limited fault history.</div></div>","PeriodicalId":17276,"journal":{"name":"Journal of the Indian Chemical Society","volume":"102 11","pages":"Article 102157"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advancing fault diagnosis for ethyl benzene production plant: A machine learning and explainable AI approach\",\"authors\":\"Somnath Chowdhury , Sumana Roy , Bitopama Modak , Fahim Ahmed , Aditi Mahto , Abhiram Hens , Sandip Kumar Lahiri\",\"doi\":\"10.1016/j.jics.2025.102157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Fault diagnosis in chemical process plants is critical to ensuring operational safety, product quality, and economic performance. In complex units such as ethyl benzene production plant, early fault detection is challenging due to nonlinear process dynamics, strong variable interactions, and the scarcity of labelled fault data. To address this gap, this study presents a simulation-driven, explainable machine learning-based fault diagnosis system (FDS) capable of identifying multiple fault types without reliance on extensive historical plant data. A dynamic process simulation was conducted to generate datasets under both normal and faulty operating conditions. The system was first operated for 20 h in normal closed-loop mode with a 0.6-min sampling interval to form the baseline dataset. For each fault case, the simulation was reset and run for 5 h to allow stabilization, followed by 15 h of faulty operation, producing 2001 samples across 28 process variables per case. Fourteen representative faults, including sensor bias, actuator malfunctions, and process disturbances, were introduced. After feature selection, multiple classifiers were trained, and three high-performing yet diverse models, Ensemble Bagged Trees (tree-based), Support Vector Machine (kernel-based), and Fine KNN (distance-based), were integrated using a majority-voting mechanism to enhance robustness. EBT being best performing model achieved 99.98 % accuracy and an F1 score of 0.9988. Local Interpretable Model-agnostic Explanations (LIME) provided transparent insight into variable importance for each fault class, aiding interpretability. The novelty of this work lies in generating realistic labelled datasets entirely from dynamic simulation, combining heterogeneous algorithms for improved generalization, and embedding explainable AI (XAI) to create a scalable, transparent FDS framework applicable to other chemical processes with limited fault history.</div></div>\",\"PeriodicalId\":17276,\"journal\":{\"name\":\"Journal of the Indian Chemical Society\",\"volume\":\"102 11\",\"pages\":\"Article 102157\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Indian Chemical Society\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0019452225005928\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Indian Chemical Society","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0019452225005928","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Advancing fault diagnosis for ethyl benzene production plant: A machine learning and explainable AI approach
Fault diagnosis in chemical process plants is critical to ensuring operational safety, product quality, and economic performance. In complex units such as ethyl benzene production plant, early fault detection is challenging due to nonlinear process dynamics, strong variable interactions, and the scarcity of labelled fault data. To address this gap, this study presents a simulation-driven, explainable machine learning-based fault diagnosis system (FDS) capable of identifying multiple fault types without reliance on extensive historical plant data. A dynamic process simulation was conducted to generate datasets under both normal and faulty operating conditions. The system was first operated for 20 h in normal closed-loop mode with a 0.6-min sampling interval to form the baseline dataset. For each fault case, the simulation was reset and run for 5 h to allow stabilization, followed by 15 h of faulty operation, producing 2001 samples across 28 process variables per case. Fourteen representative faults, including sensor bias, actuator malfunctions, and process disturbances, were introduced. After feature selection, multiple classifiers were trained, and three high-performing yet diverse models, Ensemble Bagged Trees (tree-based), Support Vector Machine (kernel-based), and Fine KNN (distance-based), were integrated using a majority-voting mechanism to enhance robustness. EBT being best performing model achieved 99.98 % accuracy and an F1 score of 0.9988. Local Interpretable Model-agnostic Explanations (LIME) provided transparent insight into variable importance for each fault class, aiding interpretability. The novelty of this work lies in generating realistic labelled datasets entirely from dynamic simulation, combining heterogeneous algorithms for improved generalization, and embedding explainable AI (XAI) to create a scalable, transparent FDS framework applicable to other chemical processes with limited fault history.
期刊介绍:
The Journal of the Indian Chemical Society publishes original, fundamental, theorical, experimental research work of highest quality in all areas of chemistry, biochemistry, medicinal chemistry, electrochemistry, agrochemistry, chemical engineering and technology, food chemistry, environmental chemistry, etc.