{"title":"火箭与电子之间的高效材料信息学","authors":"Adam M. Krajewski","doi":"arxiv-2407.04648","DOIUrl":null,"url":null,"abstract":"The true power of computational research typically can lay in either what it\naccomplishes or what it enables others to accomplish. In this work, both\navenues are simultaneously embraced across several distinct efforts existing at\nthree general scales of abstractions of what a material is - atomistic,\nphysical, and design. At each, an efficient materials informatics\ninfrastructure is being built from the ground up based on (1) the fundamental\nunderstanding of the underlying prior knowledge, including the data, (2)\ndeployment routes that take advantage of it, and (3) pathways to extend it in\nan autonomous or semi-autonomous fashion, while heavily relying on artificial\nintelligence (AI) to guide well-established DFT-based ab initio and\nCALPHAD-based thermodynamic methods. The resulting multi-level discovery infrastructure is highly generalizable as\nit focuses on encoding problems to solve them easily rather than looking for an\nexisting solution. To showcase it, this dissertation discusses the design of\nmulti-alloy functionally graded materials (FGMs) incorporating ultra-high\ntemperature refractory high entropy alloys (RHEAs) towards gas turbine and jet\nengine efficiency increase reducing CO2 emissions, as well as hypersonic\nvehicles. It leverages a new graph representation of underlying mathematical\nspace using a newly developed algorithm based on combinatorics, not subject to\nmany problems troubling the community. Underneath, property models and phase\nrelations are learned from optimized samplings of the largest and highest\nquality dataset of HEA in the world, called ULTERA. At the atomistic level, a\ndata ecosystem optimized for machine learning (ML) from over 4.5 million\nrelaxed structures, called MPDD, is used to inform experimental observations\nand improve thermodynamic models by providing stability data enabled by a new\nefficient featurization framework.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"18 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient Materials Informatics between Rockets and Electrons\",\"authors\":\"Adam M. Krajewski\",\"doi\":\"arxiv-2407.04648\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The true power of computational research typically can lay in either what it\\naccomplishes or what it enables others to accomplish. In this work, both\\navenues are simultaneously embraced across several distinct efforts existing at\\nthree general scales of abstractions of what a material is - atomistic,\\nphysical, and design. At each, an efficient materials informatics\\ninfrastructure is being built from the ground up based on (1) the fundamental\\nunderstanding of the underlying prior knowledge, including the data, (2)\\ndeployment routes that take advantage of it, and (3) pathways to extend it in\\nan autonomous or semi-autonomous fashion, while heavily relying on artificial\\nintelligence (AI) to guide well-established DFT-based ab initio and\\nCALPHAD-based thermodynamic methods. The resulting multi-level discovery infrastructure is highly generalizable as\\nit focuses on encoding problems to solve them easily rather than looking for an\\nexisting solution. To showcase it, this dissertation discusses the design of\\nmulti-alloy functionally graded materials (FGMs) incorporating ultra-high\\ntemperature refractory high entropy alloys (RHEAs) towards gas turbine and jet\\nengine efficiency increase reducing CO2 emissions, as well as hypersonic\\nvehicles. It leverages a new graph representation of underlying mathematical\\nspace using a newly developed algorithm based on combinatorics, not subject to\\nmany problems troubling the community. Underneath, property models and phase\\nrelations are learned from optimized samplings of the largest and highest\\nquality dataset of HEA in the world, called ULTERA. At the atomistic level, a\\ndata ecosystem optimized for machine learning (ML) from over 4.5 million\\nrelaxed structures, called MPDD, is used to inform experimental observations\\nand improve thermodynamic models by providing stability data enabled by a new\\nefficient featurization framework.\",\"PeriodicalId\":501065,\"journal\":{\"name\":\"arXiv - PHYS - Data Analysis, Statistics and Probability\",\"volume\":\"18 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - PHYS - Data Analysis, Statistics and Probability\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.04648\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Data Analysis, Statistics and Probability","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.04648","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
计算研究的真正威力通常在于它所完成的工作或它能帮助他人完成的工作。在这项工作中,这两个方面同时贯穿于几项不同的工作中,这些工作存在于对材料的原子、物理和设计这三个一般抽象尺度上。每一项工作都从头开始构建高效的材料信息学基础设施,其基础是:(1) 对包括数据在内的先验知识的基本理解;(2) 利用先验知识的部署路线;(3) 以自主或半自主方式扩展先验知识的途径,同时在很大程度上依赖人工智能(AI)来指导成熟的基于 DFT 的 ab initio 和基于 CALPHAD 的热力学方法。由此产生的多层次发现基础架构具有很强的通用性,因为它侧重于对问题进行编码以轻松解决问题,而不是寻找已有的解决方案。为了展示这一点,本论文讨论了结合超高温难熔高熵合金(RHEAs)的多合金功能分级材料(FGMs)的设计,以提高燃气轮机和喷气发动机的效率,减少二氧化碳排放,以及高超音速飞行器。它利用新开发的基于组合学的算法,对底层数学空间进行了全新的图形表示,从而避免了许多困扰业界的问题。在此基础上,通过对世界上最大、质量最高的 HEA 数据集(ULTERA)进行优化采样,学习属性模型和相位关系。在原子水平上,从 450 多万个松弛结构中优化出的机器学习(ML)数据生态系统(称为 MPDD)被用来为实验观察提供信息,并通过新的高效特征化框架提供稳定性数据来改进热力学模型。
Efficient Materials Informatics between Rockets and Electrons
The true power of computational research typically can lay in either what it
accomplishes or what it enables others to accomplish. In this work, both
avenues are simultaneously embraced across several distinct efforts existing at
three general scales of abstractions of what a material is - atomistic,
physical, and design. At each, an efficient materials informatics
infrastructure is being built from the ground up based on (1) the fundamental
understanding of the underlying prior knowledge, including the data, (2)
deployment routes that take advantage of it, and (3) pathways to extend it in
an autonomous or semi-autonomous fashion, while heavily relying on artificial
intelligence (AI) to guide well-established DFT-based ab initio and
CALPHAD-based thermodynamic methods. The resulting multi-level discovery infrastructure is highly generalizable as
it focuses on encoding problems to solve them easily rather than looking for an
existing solution. To showcase it, this dissertation discusses the design of
multi-alloy functionally graded materials (FGMs) incorporating ultra-high
temperature refractory high entropy alloys (RHEAs) towards gas turbine and jet
engine efficiency increase reducing CO2 emissions, as well as hypersonic
vehicles. It leverages a new graph representation of underlying mathematical
space using a newly developed algorithm based on combinatorics, not subject to
many problems troubling the community. Underneath, property models and phase
relations are learned from optimized samplings of the largest and highest
quality dataset of HEA in the world, called ULTERA. At the atomistic level, a
data ecosystem optimized for machine learning (ML) from over 4.5 million
relaxed structures, called MPDD, is used to inform experimental observations
and improve thermodynamic models by providing stability data enabled by a new
efficient featurization framework.