Yong Li , Chenchong Wang , Yu Zhang , Yuqi Zhang , Lingyu Wang , Yizhuang Li , Wei Xu
{"title":"Thermodynamically informed graph for interpretable and extensible machine learning: Martensite start temperature prediction","authors":"Yong Li , Chenchong Wang , Yu Zhang , Yuqi Zhang , Lingyu Wang , Yizhuang Li , Wei Xu","doi":"10.1016/j.calphad.2024.102710","DOIUrl":null,"url":null,"abstract":"<div><p>A common challenge in accelerated material design is to apply machine learning (ML) methods that can handle data with different structures and dimensions, and also provide physical interpretability. Unfortunately, most existing ML methods are ‘black box’ models incapable of providing physical interpretation or dealing with missing dimensions data that are often encountered in materials science. To overcome this challenge, we propose an interpretable and extensible machine learning framework based on thermodynamically informed graphs and deep data mining from graph neural networks. We demonstrate our framework on the problem of predicting the martensite start (M<sub>s</sub>) temperature, which depends on various factors (composition, austenite grain size, and outfield conditions). We construct a thermodynamically informed graph that captures the quantitative relationships between these factors and the M<sub>s</sub> temperature using limited and incomplete data. The prediction results indicate that our framework provides clear physical insights because the thermodynamic mechanisms are embedded in the thermodynamic representation graph. Our framework has several advantages: 1) it incorporates thermodynamic mechanisms into the graph structure, 2) it can handle missing dimensions data by filling in the gaps with graph information, and 3) it can be easily extended to new features without requiring much additional data for training. Moreover, we derive a general empirical equation for the M<sub>s</sub> temperature prediction from the trained graph neural networks for practical applications.</p></div>","PeriodicalId":9436,"journal":{"name":"Calphad-computer Coupling of Phase Diagrams and Thermochemistry","volume":"85 ","pages":"Article 102710"},"PeriodicalIF":1.9000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Calphad-computer Coupling of Phase Diagrams and Thermochemistry","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S036459162400052X","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
A common challenge in accelerated material design is to apply machine learning (ML) methods that can handle data with different structures and dimensions, and also provide physical interpretability. Unfortunately, most existing ML methods are ‘black box’ models incapable of providing physical interpretation or dealing with missing dimensions data that are often encountered in materials science. To overcome this challenge, we propose an interpretable and extensible machine learning framework based on thermodynamically informed graphs and deep data mining from graph neural networks. We demonstrate our framework on the problem of predicting the martensite start (Ms) temperature, which depends on various factors (composition, austenite grain size, and outfield conditions). We construct a thermodynamically informed graph that captures the quantitative relationships between these factors and the Ms temperature using limited and incomplete data. The prediction results indicate that our framework provides clear physical insights because the thermodynamic mechanisms are embedded in the thermodynamic representation graph. Our framework has several advantages: 1) it incorporates thermodynamic mechanisms into the graph structure, 2) it can handle missing dimensions data by filling in the gaps with graph information, and 3) it can be easily extended to new features without requiring much additional data for training. Moreover, we derive a general empirical equation for the Ms temperature prediction from the trained graph neural networks for practical applications.
加速材料设计中的一个常见挑战是如何应用机器学习(ML)方法来处理具有不同结构和维度的数据,并提供物理解释性。遗憾的是,大多数现有的机器学习方法都是 "黑盒 "模型,无法提供物理解释或处理材料科学中经常遇到的缺失维度数据。为了克服这一挑战,我们提出了一种可解释、可扩展的机器学习框架,该框架基于热力学信息图以及图神经网络的深度数据挖掘。我们在预测马氏体起始(Ms)温度的问题上演示了我们的框架,该温度取决于各种因素(成分、奥氏体晶粒尺寸和外场条件)。我们利用有限和不完整的数据构建了一个热力学信息图,该图捕捉了这些因素与 Ms 温度之间的定量关系。预测结果表明,我们的框架提供了清晰的物理洞察力,因为热力学机制已嵌入热力学表示图中。我们的框架有几个优点1)它将热力学机制纳入了图结构;2)它可以通过用图信息填补空白来处理缺失的维度数据;3)它可以很容易地扩展到新的特征,而不需要很多额外的训练数据。此外,我们还为实际应用中通过训练有素的图神经网络预测 Ms 温度推导出了一个通用经验方程。
期刊介绍:
The design of industrial processes requires reliable thermodynamic data. CALPHAD (Computer Coupling of Phase Diagrams and Thermochemistry) aims to promote computational thermodynamics through development of models to represent thermodynamic properties for various phases which permit prediction of properties of multicomponent systems from those of binary and ternary subsystems, critical assessment of data and their incorporation into self-consistent databases, development of software to optimize and derive thermodynamic parameters and the development and use of databanks for calculations to improve understanding of various industrial and technological processes. This work is disseminated through the CALPHAD journal and its annual conference.