{"title":"以数据为中心的方法模拟小气体分子在颗粒活性炭上的吸附。","authors":"Kai Zhang,Huichun Zhang","doi":"10.1021/acs.est.4c12168","DOIUrl":null,"url":null,"abstract":"Data-driven models are increasingly employed in gas adsorption studies, optimizing adsorption and elucidating mechanisms. Yet, the importance of high-quality data sets, which is crucial for modeling, is often underexplored. Focusing on small gas molecule adsorption, we showcased novel data-centric methods to improve data set quality for adsorption modeling. First, we for the first time showed that solute descriptors could be used as features for gas molecules to increase data quality by merging smaller data sets into larger ones. This approach enabled the development of satisfactory predictive models for chlorofluorocarbons/hydrochlorofluorocarbons and hydrocarbon gas molecules for the first time. Then, we showed that mostly overlooked experimental measurements (Brunauer-Emmett-Teller, BET adsorption curves) enriched the data set quality by providing more detailed characterizations for adsorbents. New models including these curves for CO2 and CH4 reduced mean-squared errors (MSE) by approximately 18%. We also raised attention to data skewness's impact on model performance. Last, we developed a new method for \"actively\" building suitable data sets for modeling, which aligned with results by the posterior method but without requiring training models in advance. Overall, these new techniques and findings will greatly contribute to future modeling from a data-centric perspective.","PeriodicalId":36,"journal":{"name":"环境科学与技术","volume":"23 1","pages":""},"PeriodicalIF":10.8000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-Centric Approach for Modeling the Adsorption of Small Gas Molecules on Granular Activated Carbons.\",\"authors\":\"Kai Zhang,Huichun Zhang\",\"doi\":\"10.1021/acs.est.4c12168\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data-driven models are increasingly employed in gas adsorption studies, optimizing adsorption and elucidating mechanisms. Yet, the importance of high-quality data sets, which is crucial for modeling, is often underexplored. Focusing on small gas molecule adsorption, we showcased novel data-centric methods to improve data set quality for adsorption modeling. First, we for the first time showed that solute descriptors could be used as features for gas molecules to increase data quality by merging smaller data sets into larger ones. This approach enabled the development of satisfactory predictive models for chlorofluorocarbons/hydrochlorofluorocarbons and hydrocarbon gas molecules for the first time. Then, we showed that mostly overlooked experimental measurements (Brunauer-Emmett-Teller, BET adsorption curves) enriched the data set quality by providing more detailed characterizations for adsorbents. New models including these curves for CO2 and CH4 reduced mean-squared errors (MSE) by approximately 18%. We also raised attention to data skewness's impact on model performance. Last, we developed a new method for \\\"actively\\\" building suitable data sets for modeling, which aligned with results by the posterior method but without requiring training models in advance. Overall, these new techniques and findings will greatly contribute to future modeling from a data-centric perspective.\",\"PeriodicalId\":36,\"journal\":{\"name\":\"环境科学与技术\",\"volume\":\"23 1\",\"pages\":\"\"},\"PeriodicalIF\":10.8000,\"publicationDate\":\"2025-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"环境科学与技术\",\"FirstCategoryId\":\"1\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.est.4c12168\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"环境科学与技术","FirstCategoryId":"1","ListUrlMain":"https://doi.org/10.1021/acs.est.4c12168","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
Data-Centric Approach for Modeling the Adsorption of Small Gas Molecules on Granular Activated Carbons.
Data-driven models are increasingly employed in gas adsorption studies, optimizing adsorption and elucidating mechanisms. Yet, the importance of high-quality data sets, which is crucial for modeling, is often underexplored. Focusing on small gas molecule adsorption, we showcased novel data-centric methods to improve data set quality for adsorption modeling. First, we for the first time showed that solute descriptors could be used as features for gas molecules to increase data quality by merging smaller data sets into larger ones. This approach enabled the development of satisfactory predictive models for chlorofluorocarbons/hydrochlorofluorocarbons and hydrocarbon gas molecules for the first time. Then, we showed that mostly overlooked experimental measurements (Brunauer-Emmett-Teller, BET adsorption curves) enriched the data set quality by providing more detailed characterizations for adsorbents. New models including these curves for CO2 and CH4 reduced mean-squared errors (MSE) by approximately 18%. We also raised attention to data skewness's impact on model performance. Last, we developed a new method for "actively" building suitable data sets for modeling, which aligned with results by the posterior method but without requiring training models in advance. Overall, these new techniques and findings will greatly contribute to future modeling from a data-centric perspective.
期刊介绍:
Environmental Science & Technology (ES&T) is a co-sponsored academic and technical magazine by the Hubei Provincial Environmental Protection Bureau and the Hubei Provincial Academy of Environmental Sciences.
Environmental Science & Technology (ES&T) holds the status of Chinese core journals, scientific papers source journals of China, Chinese Science Citation Database source journals, and Chinese Academic Journal Comprehensive Evaluation Database source journals. This publication focuses on the academic field of environmental protection, featuring articles related to environmental protection and technical advancements.