{"title":"Data-driven modeling using machine learning to investigate the desulfurization performance by zeolitic adsorbents","authors":"Mahyar Mansouri, Mohsen Shayanmehr, Ahad Ghaemi","doi":"10.1016/j.clet.2025.101073","DOIUrl":null,"url":null,"abstract":"<div><div>This work introduces an experimentally validated, data-driven machine learning (ML) framework for predicting the adsorptive desulfurization (ADS) performance of zeolite-based materials. A curated dataset of 700 entries was compiled from diverse sources, incorporating key structural and operational parameters such as Brunauer–Emmett–Teller (BET) surface area, total pore volume (TPV), temperature, contact time, and molecular weight of sulfur compounds (MW-S). Seven ML models were developed and compared, with Extra Trees Regressor (ETR) achieving the best performance (R<sup>2</sup> = 0.9979, MAE = 0.0308), followed by Random Forest (RF) (R<sup>2</sup> = 0.9932, MAE = 0.0524). Feature importance analysis and shapley additive explanations (SHAP) identified molecular weight and BET surface area as the most influential descriptors. For better interpretability and generalizability, the zeolite type was excluded as an input feature and replaced by physicochemical properties. Furthermore, the top-performing model was integrated with a genetic algorithm (GA) to optimize operating conditions, resulting in a predicted maximum adsorption capacity of 131.63 mg S/g. Model robustness was also confirmed using an independent test set. Overall, this study provides a reliable and interpretable framework for accelerating ADS system design and can be extended to other adsorption-based separation processes.</div></div>","PeriodicalId":34618,"journal":{"name":"Cleaner Engineering and Technology","volume":"28 ","pages":"Article 101073"},"PeriodicalIF":6.5000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cleaner Engineering and Technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266679082500196X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
This work introduces an experimentally validated, data-driven machine learning (ML) framework for predicting the adsorptive desulfurization (ADS) performance of zeolite-based materials. A curated dataset of 700 entries was compiled from diverse sources, incorporating key structural and operational parameters such as Brunauer–Emmett–Teller (BET) surface area, total pore volume (TPV), temperature, contact time, and molecular weight of sulfur compounds (MW-S). Seven ML models were developed and compared, with Extra Trees Regressor (ETR) achieving the best performance (R2 = 0.9979, MAE = 0.0308), followed by Random Forest (RF) (R2 = 0.9932, MAE = 0.0524). Feature importance analysis and shapley additive explanations (SHAP) identified molecular weight and BET surface area as the most influential descriptors. For better interpretability and generalizability, the zeolite type was excluded as an input feature and replaced by physicochemical properties. Furthermore, the top-performing model was integrated with a genetic algorithm (GA) to optimize operating conditions, resulting in a predicted maximum adsorption capacity of 131.63 mg S/g. Model robustness was also confirmed using an independent test set. Overall, this study provides a reliable and interpretable framework for accelerating ADS system design and can be extended to other adsorption-based separation processes.