Prediction of the space group and cell volume by training a convolutional neural network with primitive 'ideal' diffraction profiles and its application to 'real' experimental data.

IF 2.8 3区 材料科学 Q1 Biochemistry, Genetics and Molecular Biology
Journal of Applied Crystallography Pub Date : 2025-04-25 eCollection Date: 2025-06-01 DOI:10.1107/S1600576725002419
Hiroyuki Ozaki, Naoya Ishida, Tetsu Kiyobayashi
{"title":"Prediction of the space group and cell volume by training a convolutional neural network with primitive 'ideal' diffraction profiles and its application to 'real' experimental data.","authors":"Hiroyuki Ozaki, Naoya Ishida, Tetsu Kiyobayashi","doi":"10.1107/S1600576725002419","DOIUrl":null,"url":null,"abstract":"<p><p>This study describes a deep learning approach to predict the space group and unit-cell volume of inorganic crystals from their powder X-ray diffraction profiles. Using an inorganic crystallographic database, convolutional neural network (CNN) models were successfully constructed with the δ-function-like 'ideal' X-ray diffraction profiles derived solely from the intrinsic properties of the crystal structure, which are dependent on neither the incident X-ray wavelength nor the line shape of the profiles. We examined how the statistical metrics (<i>e.g.</i> the prediction accuracy, precision and recall) are influenced by the ensemble averaging technique and the multi-task learning approach; six CNN models were created from an identical data set for the former, and the space group classification was coupled with the unit-cell volume prediction in a CNN architecture for the latter. The CNN models trained in the 'ideal' world were tested with 'real' X-ray profiles for eleven materials such as TiO<sub>2</sub>, LiNiO<sub>2</sub> and LiMnO<sub>2</sub>. While the models mostly fared well in the 'real' world, the cases at odds were scrutinized to elucidate the causes of the mismatch. Specifically for Li<sub>2</sub>MnO<sub>3</sub>, detailed crystallographic considerations revealed that the mismatch can stem from the state of the specific material and/or from the quality of the experimental data, and not from the CNN models. The present study demonstrates that we can obviate the need for emulating experimental diffraction profiles in training CNN models to elicit structural information, thereby focusing efforts on further improvements.</p>","PeriodicalId":14950,"journal":{"name":"Journal of Applied Crystallography","volume":"58 Pt 3","pages":"718-730"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12135985/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Crystallography","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1107/S1600576725002419","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0

Abstract

This study describes a deep learning approach to predict the space group and unit-cell volume of inorganic crystals from their powder X-ray diffraction profiles. Using an inorganic crystallographic database, convolutional neural network (CNN) models were successfully constructed with the δ-function-like 'ideal' X-ray diffraction profiles derived solely from the intrinsic properties of the crystal structure, which are dependent on neither the incident X-ray wavelength nor the line shape of the profiles. We examined how the statistical metrics (e.g. the prediction accuracy, precision and recall) are influenced by the ensemble averaging technique and the multi-task learning approach; six CNN models were created from an identical data set for the former, and the space group classification was coupled with the unit-cell volume prediction in a CNN architecture for the latter. The CNN models trained in the 'ideal' world were tested with 'real' X-ray profiles for eleven materials such as TiO2, LiNiO2 and LiMnO2. While the models mostly fared well in the 'real' world, the cases at odds were scrutinized to elucidate the causes of the mismatch. Specifically for Li2MnO3, detailed crystallographic considerations revealed that the mismatch can stem from the state of the specific material and/or from the quality of the experimental data, and not from the CNN models. The present study demonstrates that we can obviate the need for emulating experimental diffraction profiles in training CNN models to elicit structural information, thereby focusing efforts on further improvements.

通过训练具有原始“理想”衍射剖面的卷积神经网络来预测空间群和细胞体积,并将其应用于“真实”实验数据。
本研究描述了一种深度学习方法,通过粉末x射线衍射剖面预测无机晶体的空间群和单位胞体积。利用无机晶体学数据库,利用晶体结构的固有特性获得的δ函数型“理想”x射线衍射谱成功构建了卷积神经网络(CNN)模型,该模型既不依赖于入射x射线波长,也不依赖于谱线形状。我们研究了集合平均技术和多任务学习方法对统计指标(如预测准确度、精度和召回率)的影响;前者使用相同的数据集创建了6个CNN模型,后者使用CNN架构中的空间群分类与单位细胞体积预测相结合。在“理想”世界中训练的CNN模型用“真实”x射线剖面测试了11种材料,如TiO2、LiNiO2和LiMnO2。虽然这些模型大多在“真实”世界中表现良好,但为了阐明不匹配的原因,研究人员仔细审查了存在分歧的案例。特别是对于Li2MnO3,详细的晶体学考虑表明,不匹配可能源于特定材料的状态和/或实验数据的质量,而不是来自CNN模型。本研究表明,我们可以在训练CNN模型时不需要模拟实验衍射曲线来获取结构信息,从而集中精力进行进一步的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.00
自引率
3.30%
发文量
178
审稿时长
4.7 months
期刊介绍: Many research topics in condensed matter research, materials science and the life sciences make use of crystallographic methods to study crystalline and non-crystalline matter with neutrons, X-rays and electrons. Articles published in the Journal of Applied Crystallography focus on these methods and their use in identifying structural and diffusion-controlled phase transformations, structure-property relationships, structural changes of defects, interfaces and surfaces, etc. Developments of instrumentation and crystallographic apparatus, theory and interpretation, numerical analysis and other related subjects are also covered. The journal is the primary place where crystallographic computer program information is published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信