Viejay Z. Ordillo, Koji Shimizu, D. Putungan, A. Santos-Putungan, Satoshi Watanabe, Rizalinda de Leon, Joey D. Ocon, K. Pilario, A. A. Padama
{"title":"基于机器学习辅助 DFT 的单原子合金表面反应研究的两阶段特征选择","authors":"Viejay Z. Ordillo, Koji Shimizu, D. Putungan, A. Santos-Putungan, Satoshi Watanabe, Rizalinda de Leon, Joey D. Ocon, K. Pilario, A. A. Padama","doi":"10.1088/1361-651x/ad53ee","DOIUrl":null,"url":null,"abstract":"\n This paper presents a feature-centric strategy for predicting adsorption energies of key CO2 reduction reaction (CO2RR) adsorbates, CO and H species, utilizing DFT-based calculations for eight (8) adsorption sites and considering alloying effects of nine (9) transition metals at single-atom concentrations. Here, we explore a class of materials consisting of a majority host metal where individual atoms of a different element are dispersed called single-atom alloys (SAA). A total of eight (8) feature selection methods are assessed within Gradient Boosting Regression and Linear Regression models. This study proposes a practical and effective two-stage approach that narrows down the initial 86 features to subsets of 10 and 7 for CO and H adsorption energy predictions, respectively, with the arithmetic mean of valence electrons (VE-am) feature consistently emerging as highly influential, validated through permutation and Shapley additive explanations (SHAP)-based feature importance analyses. The models exhibit robust performance on unseen data, indicating their generalization capability. The findings emphasize VE-am as a potential key machine learning feature for CO2RR on SAA surfaces and underline the effectiveness of the feature-centric approach in understanding feature impacts in machine learning models for CO2RR on SAA systems. Additionally, while other features based on structural, electronic and elemental properties may not individually impact the model significantly, their collective contribution plays a vital role in achieving more accurate adsorption energy predictions.","PeriodicalId":18648,"journal":{"name":"Modelling and Simulation in Materials Science and Engineering","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Two-Stage Feature Selection for Machine Learning-aided DFT-based Surface Reactivity Study on Single-Atom Alloys\",\"authors\":\"Viejay Z. Ordillo, Koji Shimizu, D. Putungan, A. Santos-Putungan, Satoshi Watanabe, Rizalinda de Leon, Joey D. Ocon, K. Pilario, A. A. Padama\",\"doi\":\"10.1088/1361-651x/ad53ee\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n This paper presents a feature-centric strategy for predicting adsorption energies of key CO2 reduction reaction (CO2RR) adsorbates, CO and H species, utilizing DFT-based calculations for eight (8) adsorption sites and considering alloying effects of nine (9) transition metals at single-atom concentrations. Here, we explore a class of materials consisting of a majority host metal where individual atoms of a different element are dispersed called single-atom alloys (SAA). A total of eight (8) feature selection methods are assessed within Gradient Boosting Regression and Linear Regression models. This study proposes a practical and effective two-stage approach that narrows down the initial 86 features to subsets of 10 and 7 for CO and H adsorption energy predictions, respectively, with the arithmetic mean of valence electrons (VE-am) feature consistently emerging as highly influential, validated through permutation and Shapley additive explanations (SHAP)-based feature importance analyses. The models exhibit robust performance on unseen data, indicating their generalization capability. The findings emphasize VE-am as a potential key machine learning feature for CO2RR on SAA surfaces and underline the effectiveness of the feature-centric approach in understanding feature impacts in machine learning models for CO2RR on SAA systems. Additionally, while other features based on structural, electronic and elemental properties may not individually impact the model significantly, their collective contribution plays a vital role in achieving more accurate adsorption energy predictions.\",\"PeriodicalId\":18648,\"journal\":{\"name\":\"Modelling and Simulation in Materials Science and Engineering\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Modelling and Simulation in Materials Science and Engineering\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://doi.org/10.1088/1361-651x/ad53ee\",\"RegionNum\":4,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MATERIALS SCIENCE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Modelling and Simulation in Materials Science and Engineering","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1088/1361-651x/ad53ee","RegionNum":4,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
本文提出了一种以特征为中心的策略,利用基于 DFT 计算的八(8)个吸附位点,并考虑九(9)种过渡金属在单原子浓度下的合金效应,预测二氧化碳还原反应(CO2RR)关键吸附剂(CO 和 H 物种)的吸附能。在此,我们探讨了一类由多数主金属组成的材料,其中分散着不同元素的单个原子,这种材料被称为单原子合金(SAA)。在梯度提升回归(Gradient Boosting Regression)和线性回归(Linear Regression)模型中,共评估了八(8)种特征选择方法。本研究提出了一种实用有效的两阶段方法,可将最初的 86 个特征缩小到 10 个子集和 7 个子集,分别用于 CO 和 H 吸附能预测,其中价电子算术平均值(VE-am)特征始终具有很高的影响力,这一点通过基于置换和夏普利加法解释(SHAP)的特征重要性分析得到了验证。这些模型在未见过的数据上表现出稳健的性能,表明了它们的泛化能力。研究结果强调,VE-am 是机器学习对 SAA 表面 CO2RR 的潜在关键特征,并强调了以特征为中心的方法在理解机器学习模型对 SAA 系统 CO2RR 的特征影响方面的有效性。此外,虽然基于结构、电子和元素特性的其他特征可能不会单独对模型产生重大影响,但它们的集体贡献在实现更准确的吸附能预测方面发挥着至关重要的作用。
Two-Stage Feature Selection for Machine Learning-aided DFT-based Surface Reactivity Study on Single-Atom Alloys
This paper presents a feature-centric strategy for predicting adsorption energies of key CO2 reduction reaction (CO2RR) adsorbates, CO and H species, utilizing DFT-based calculations for eight (8) adsorption sites and considering alloying effects of nine (9) transition metals at single-atom concentrations. Here, we explore a class of materials consisting of a majority host metal where individual atoms of a different element are dispersed called single-atom alloys (SAA). A total of eight (8) feature selection methods are assessed within Gradient Boosting Regression and Linear Regression models. This study proposes a practical and effective two-stage approach that narrows down the initial 86 features to subsets of 10 and 7 for CO and H adsorption energy predictions, respectively, with the arithmetic mean of valence electrons (VE-am) feature consistently emerging as highly influential, validated through permutation and Shapley additive explanations (SHAP)-based feature importance analyses. The models exhibit robust performance on unseen data, indicating their generalization capability. The findings emphasize VE-am as a potential key machine learning feature for CO2RR on SAA surfaces and underline the effectiveness of the feature-centric approach in understanding feature impacts in machine learning models for CO2RR on SAA systems. Additionally, while other features based on structural, electronic and elemental properties may not individually impact the model significantly, their collective contribution plays a vital role in achieving more accurate adsorption energy predictions.
期刊介绍:
Serving the multidisciplinary materials community, the journal aims to publish new research work that advances the understanding and prediction of material behaviour at scales from atomistic to macroscopic through modelling and simulation.
Subject coverage:
Modelling and/or simulation across materials science that emphasizes fundamental materials issues advancing the understanding and prediction of material behaviour. Interdisciplinary research that tackles challenging and complex materials problems where the governing phenomena may span different scales of materials behaviour, with an emphasis on the development of quantitative approaches to explain and predict experimental observations. Material processing that advances the fundamental materials science and engineering underpinning the connection between processing and properties. Covering all classes of materials, and mechanical, microstructural, electronic, chemical, biological, and optical properties.