Jingjin He , Ruowei Yin , Changxin Wang , Chuanbao Liu , Dezhen Xue , Yanjing Su , Lijie Qiao , Turab Lookman , Yang Bai
{"title":"Compositional design of compounds with elements not in training data using supervised learning","authors":"Jingjin He , Ruowei Yin , Changxin Wang , Chuanbao Liu , Dezhen Xue , Yanjing Su , Lijie Qiao , Turab Lookman , Yang Bai","doi":"10.1016/j.jmat.2024.06.008","DOIUrl":null,"url":null,"abstract":"<div><div>An issue of current interest in the use of machine learning models to predict compositions of materials is their reliability in predicting outcomes with elements not included in the training data. We show that the phase diagram of the ceramic (Ba<sub>1−<em>x</em>−<em>y</em></sub>Ca<sub><em>x</em></sub>Sr<sub><em>y</em></sub>)(Ti<sub>1−<em>u</em>−<em>v</em>−<em>w</em></sub>Zr<sub><em>u</em></sub>Sn<sub><em>v</em></sub>Hf<sub><em>w</em></sub>)O<sub>3</sub> can be accurately predicted if the feature values of unknown elements do not exceed the range of values for existing elements in the training data. In particular, we employ physical features as descriptors and compositions as weights to show that by excluding an element, such as Zr, Sn or Hf from the training set and treating it as an unknown element, the machine learning model accurately predicts the property only if the feature values of the unknown element does not exceed the range of values of existing elements in the training set. By adding a small amount of data for the unknown element restores the prediction accuracy. We demonstrate this for BaTiO<sub>3</sub> ceramics doped with rare earth elements where the prediction accuracy is restored if the physical feature space is suitably enlarged with training data. The prediction error increases with the Euclidean distance of the testing sample relative to the nearest training sample in the physical feature space. Our work provides an effective strategy for extending machine learning models for material compositions beyond the scope of available data.</div></div>","PeriodicalId":16173,"journal":{"name":"Journal of Materiomics","volume":"11 3","pages":"Article 100913"},"PeriodicalIF":8.4000,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Materiomics","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352847824001527","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
An issue of current interest in the use of machine learning models to predict compositions of materials is their reliability in predicting outcomes with elements not included in the training data. We show that the phase diagram of the ceramic (Ba1−x−yCaxSry)(Ti1−u−v−wZruSnvHfw)O3 can be accurately predicted if the feature values of unknown elements do not exceed the range of values for existing elements in the training data. In particular, we employ physical features as descriptors and compositions as weights to show that by excluding an element, such as Zr, Sn or Hf from the training set and treating it as an unknown element, the machine learning model accurately predicts the property only if the feature values of the unknown element does not exceed the range of values of existing elements in the training set. By adding a small amount of data for the unknown element restores the prediction accuracy. We demonstrate this for BaTiO3 ceramics doped with rare earth elements where the prediction accuracy is restored if the physical feature space is suitably enlarged with training data. The prediction error increases with the Euclidean distance of the testing sample relative to the nearest training sample in the physical feature space. Our work provides an effective strategy for extending machine learning models for material compositions beyond the scope of available data.
期刊介绍:
The Journal of Materiomics is a peer-reviewed open-access journal that aims to serve as a forum for the continuous dissemination of research within the field of materials science. It particularly emphasizes systematic studies on the relationships between composition, processing, structure, property, and performance of advanced materials. The journal is supported by the Chinese Ceramic Society and is indexed in SCIE and Scopus. It is commonly referred to as J Materiomics.