{"title":"利用机器学习进行数据驱动建模,研究沸石吸附剂的脱硫性能","authors":"Mahyar Mansouri, Mohsen Shayanmehr, Ahad Ghaemi","doi":"10.1016/j.clet.2025.101073","DOIUrl":null,"url":null,"abstract":"<div><div>This work introduces an experimentally validated, data-driven machine learning (ML) framework for predicting the adsorptive desulfurization (ADS) performance of zeolite-based materials. A curated dataset of 700 entries was compiled from diverse sources, incorporating key structural and operational parameters such as Brunauer–Emmett–Teller (BET) surface area, total pore volume (TPV), temperature, contact time, and molecular weight of sulfur compounds (MW-S). Seven ML models were developed and compared, with Extra Trees Regressor (ETR) achieving the best performance (R<sup>2</sup> = 0.9979, MAE = 0.0308), followed by Random Forest (RF) (R<sup>2</sup> = 0.9932, MAE = 0.0524). Feature importance analysis and shapley additive explanations (SHAP) identified molecular weight and BET surface area as the most influential descriptors. For better interpretability and generalizability, the zeolite type was excluded as an input feature and replaced by physicochemical properties. Furthermore, the top-performing model was integrated with a genetic algorithm (GA) to optimize operating conditions, resulting in a predicted maximum adsorption capacity of 131.63 mg S/g. Model robustness was also confirmed using an independent test set. Overall, this study provides a reliable and interpretable framework for accelerating ADS system design and can be extended to other adsorption-based separation processes.</div></div>","PeriodicalId":34618,"journal":{"name":"Cleaner Engineering and Technology","volume":"28 ","pages":"Article 101073"},"PeriodicalIF":6.5000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-driven modeling using machine learning to investigate the desulfurization performance by zeolitic adsorbents\",\"authors\":\"Mahyar Mansouri, Mohsen Shayanmehr, Ahad Ghaemi\",\"doi\":\"10.1016/j.clet.2025.101073\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This work introduces an experimentally validated, data-driven machine learning (ML) framework for predicting the adsorptive desulfurization (ADS) performance of zeolite-based materials. A curated dataset of 700 entries was compiled from diverse sources, incorporating key structural and operational parameters such as Brunauer–Emmett–Teller (BET) surface area, total pore volume (TPV), temperature, contact time, and molecular weight of sulfur compounds (MW-S). Seven ML models were developed and compared, with Extra Trees Regressor (ETR) achieving the best performance (R<sup>2</sup> = 0.9979, MAE = 0.0308), followed by Random Forest (RF) (R<sup>2</sup> = 0.9932, MAE = 0.0524). Feature importance analysis and shapley additive explanations (SHAP) identified molecular weight and BET surface area as the most influential descriptors. For better interpretability and generalizability, the zeolite type was excluded as an input feature and replaced by physicochemical properties. Furthermore, the top-performing model was integrated with a genetic algorithm (GA) to optimize operating conditions, resulting in a predicted maximum adsorption capacity of 131.63 mg S/g. Model robustness was also confirmed using an independent test set. Overall, this study provides a reliable and interpretable framework for accelerating ADS system design and can be extended to other adsorption-based separation processes.</div></div>\",\"PeriodicalId\":34618,\"journal\":{\"name\":\"Cleaner Engineering and Technology\",\"volume\":\"28 \",\"pages\":\"Article 101073\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cleaner Engineering and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S266679082500196X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cleaner Engineering and Technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266679082500196X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
摘要
这项工作介绍了一个实验验证的,数据驱动的机器学习(ML)框架,用于预测沸石基材料的吸附脱硫(ADS)性能。收集了来自不同来源的700个条目,包括关键的结构和操作参数,如brunauer - emmet - teller (BET)表面积、总孔隙体积(TPV)、温度、接触时间和硫化合物分子量(MW-S)。建立7个ML模型进行比较,其中Extra Trees regression (ETR)模型表现最佳(R2 = 0.9979, MAE = 0.0308),其次是Random Forest (RF)模型(R2 = 0.9932, MAE = 0.0524)。特征重要性分析和shapley加性解释(SHAP)确定分子量和BET表面积是最具影响力的描述符。为了更好地解释和推广,沸石类型被排除在输入特征之外,取而代之的是物理化学性质。此外,将最佳模型与遗传算法(GA)相结合,对操作条件进行优化,预测最大吸附容量为131.63 mg S/g。模型的稳健性也通过一个独立的测试集得到证实。总的来说,本研究为加速ADS系统的设计提供了一个可靠和可解释的框架,并可扩展到其他基于吸附的分离过程。
Data-driven modeling using machine learning to investigate the desulfurization performance by zeolitic adsorbents
This work introduces an experimentally validated, data-driven machine learning (ML) framework for predicting the adsorptive desulfurization (ADS) performance of zeolite-based materials. A curated dataset of 700 entries was compiled from diverse sources, incorporating key structural and operational parameters such as Brunauer–Emmett–Teller (BET) surface area, total pore volume (TPV), temperature, contact time, and molecular weight of sulfur compounds (MW-S). Seven ML models were developed and compared, with Extra Trees Regressor (ETR) achieving the best performance (R2 = 0.9979, MAE = 0.0308), followed by Random Forest (RF) (R2 = 0.9932, MAE = 0.0524). Feature importance analysis and shapley additive explanations (SHAP) identified molecular weight and BET surface area as the most influential descriptors. For better interpretability and generalizability, the zeolite type was excluded as an input feature and replaced by physicochemical properties. Furthermore, the top-performing model was integrated with a genetic algorithm (GA) to optimize operating conditions, resulting in a predicted maximum adsorption capacity of 131.63 mg S/g. Model robustness was also confirmed using an independent test set. Overall, this study provides a reliable and interpretable framework for accelerating ADS system design and can be extended to other adsorption-based separation processes.