Efficient structure-informed featurization and property prediction of ordered, dilute, and random atomic structures

IF 3.1 3区 材料科学 Q2 MATERIALS SCIENCE, MULTIDISCIPLINARY
Adam M. Krajewski , Jonathan W. Siegel , Zi-Kui Liu
{"title":"Efficient structure-informed featurization and property prediction of ordered, dilute, and random atomic structures","authors":"Adam M. Krajewski ,&nbsp;Jonathan W. Siegel ,&nbsp;Zi-Kui Liu","doi":"10.1016/j.commatsci.2024.113495","DOIUrl":null,"url":null,"abstract":"<div><div>Structure-informed materials informatics is a rapidly evolving discipline of materials science relying on the featurization of atomic structures or configurations to construct vector, voxel, graph, graphlet, and other representations useful for machine learning prediction of properties, fingerprinting, and generative design. This work discusses how current featurizers typically perform redundant calculations and how their efficiency could be improved by considering (1) fundamentals of crystallographic (orbits) equivalency to optimize ordered structures and (2) representation-dependent equivalency to optimize dilute, doped, and defect structures with broken symmetry. It also discusses and contrasts ways of (3) approximating random solid solutions occupying arbitrary lattices under such representations. Efficiency improvements discussed in this work were implemented within <figure><img></figure> or <em>python toolset for Structure-Informed Property and Feature Engineering with Neural Networks</em> developed by authors since 2019 and shown to increase performance from 2 to 10 times for typical inputs. Throughout this work, the authors explicitly discuss how these advances can be applied to different kinds of similar tools in the community.</div></div>","PeriodicalId":10650,"journal":{"name":"Computational Materials Science","volume":"247 ","pages":"Article 113495"},"PeriodicalIF":3.1000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Materials Science","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092702562400716X","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Structure-informed materials informatics is a rapidly evolving discipline of materials science relying on the featurization of atomic structures or configurations to construct vector, voxel, graph, graphlet, and other representations useful for machine learning prediction of properties, fingerprinting, and generative design. This work discusses how current featurizers typically perform redundant calculations and how their efficiency could be improved by considering (1) fundamentals of crystallographic (orbits) equivalency to optimize ordered structures and (2) representation-dependent equivalency to optimize dilute, doped, and defect structures with broken symmetry. It also discusses and contrasts ways of (3) approximating random solid solutions occupying arbitrary lattices under such representations. Efficiency improvements discussed in this work were implemented within
or python toolset for Structure-Informed Property and Feature Engineering with Neural Networks developed by authors since 2019 and shown to increase performance from 2 to 10 times for typical inputs. Throughout this work, the authors explicitly discuss how these advances can be applied to different kinds of similar tools in the community.

Abstract Image

有序、稀释和随机原子结构的高效结构信息特征化和特性预测
结构信息材料信息学是一门快速发展的材料科学学科,它依赖于原子结构或构型的特征化来构建矢量、体素、图、小图和其他表征,这些表征对机器学习预测特性、指纹识别和生成设计非常有用。这项研究讨论了目前的特征化器通常是如何进行冗余计算的,以及如何通过考虑(1)晶体学(轨道)等效性的基本原理来优化有序结构,以及(2)依赖于表征的等效性来优化稀释、掺杂和具有破缺对称性的缺陷结构,从而提高它们的效率。它还讨论并对比了 (3) 在此类表示下近似占据任意晶格的随机固溶体的方法。作者自 2019 年以来开发的 "利用神经网络进行结构信息属性和特征工程 "python 工具集已实现了本论文中讨论的效率改进,并证明在典型输入情况下可将性能提高 2 至 10 倍。在整个工作中,作者明确讨论了如何将这些进展应用于社区中不同类型的类似工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computational Materials Science
Computational Materials Science 工程技术-材料科学:综合
CiteScore
6.50
自引率
6.10%
发文量
665
审稿时长
26 days
期刊介绍: The goal of Computational Materials Science is to report on results that provide new or unique insights into, or significantly expand our understanding of, the properties of materials or phenomena associated with their design, synthesis, processing, characterization, and utilization. To be relevant to the journal, the results should be applied or applicable to specific material systems that are discussed within the submission.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信