利用生成式机器学习从粉末衍射图样确定晶体结构

IF 15.6 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Eric A. Riesel, Tsach Mackey, Hamed Nilforoshan, Minkai Xu, Catherine K. Badding, Alison B. Altman, Jure Leskovec* and Danna E. Freedman*, 
{"title":"利用生成式机器学习从粉末衍射图样确定晶体结构","authors":"Eric A. Riesel,&nbsp;Tsach Mackey,&nbsp;Hamed Nilforoshan,&nbsp;Minkai Xu,&nbsp;Catherine K. Badding,&nbsp;Alison B. Altman,&nbsp;Jure Leskovec* and Danna E. Freedman*,&nbsp;","doi":"10.1021/jacs.4c1024410.1021/jacs.4c10244","DOIUrl":null,"url":null,"abstract":"<p >Powder X-ray diffraction (PXRD) is a cornerstone technique in materials characterization. However, complete structure determination from PXRD patterns alone remains time-consuming and is often intractable, especially for novel materials. Current machine learning (ML) approaches to PXRD analysis predict only a subset of the total information that comprises a crystal structure. We developed a pioneering generative ML model designed to solve crystal structures from real-world experimental PXRD data. In addition to strong performance on simulated diffraction patterns, we demonstrate full structure solutions over a large set of experimental diffraction patterns. Benchmarking our model, we predicted the structure for 134 experimental patterns from the RRUFF database and thousands of simulated patterns from the Materials Project on which our model achieves state-of-the-art 42 and 67% match rate, respectively. Further, we applied our model to determine the unreported structures of materials such as NaCu<sub>2</sub>P<sub>2</sub>, Ca<sub>2</sub>MnTeO<sub>6</sub>, ZrGe<sub>6</sub>Ni<sub>6</sub>, LuOF, and HoNdV<sub>2</sub>O<sub>8</sub> from the Powder Diffraction File database. We extended this methodology to new materials created in our lab at high pressure with previously unsolved structures and found the new binary compounds Rh<sub>3</sub>Bi, RuBi<sub>2</sub>, and KBi<sub>3</sub>. We expect that our model will open avenues toward materials discovery under conditions which preclude single crystal growth and toward automated materials discovery pipelines, opening the door to new domains of chemistry.</p>","PeriodicalId":49,"journal":{"name":"Journal of the American Chemical Society","volume":"146 44","pages":"30340–30348 30340–30348"},"PeriodicalIF":15.6000,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Crystal Structure Determination from Powder Diffraction Patterns with Generative Machine Learning\",\"authors\":\"Eric A. Riesel,&nbsp;Tsach Mackey,&nbsp;Hamed Nilforoshan,&nbsp;Minkai Xu,&nbsp;Catherine K. Badding,&nbsp;Alison B. Altman,&nbsp;Jure Leskovec* and Danna E. Freedman*,&nbsp;\",\"doi\":\"10.1021/jacs.4c1024410.1021/jacs.4c10244\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Powder X-ray diffraction (PXRD) is a cornerstone technique in materials characterization. However, complete structure determination from PXRD patterns alone remains time-consuming and is often intractable, especially for novel materials. Current machine learning (ML) approaches to PXRD analysis predict only a subset of the total information that comprises a crystal structure. We developed a pioneering generative ML model designed to solve crystal structures from real-world experimental PXRD data. In addition to strong performance on simulated diffraction patterns, we demonstrate full structure solutions over a large set of experimental diffraction patterns. Benchmarking our model, we predicted the structure for 134 experimental patterns from the RRUFF database and thousands of simulated patterns from the Materials Project on which our model achieves state-of-the-art 42 and 67% match rate, respectively. Further, we applied our model to determine the unreported structures of materials such as NaCu<sub>2</sub>P<sub>2</sub>, Ca<sub>2</sub>MnTeO<sub>6</sub>, ZrGe<sub>6</sub>Ni<sub>6</sub>, LuOF, and HoNdV<sub>2</sub>O<sub>8</sub> from the Powder Diffraction File database. We extended this methodology to new materials created in our lab at high pressure with previously unsolved structures and found the new binary compounds Rh<sub>3</sub>Bi, RuBi<sub>2</sub>, and KBi<sub>3</sub>. We expect that our model will open avenues toward materials discovery under conditions which preclude single crystal growth and toward automated materials discovery pipelines, opening the door to new domains of chemistry.</p>\",\"PeriodicalId\":49,\"journal\":{\"name\":\"Journal of the American Chemical Society\",\"volume\":\"146 44\",\"pages\":\"30340–30348 30340–30348\"},\"PeriodicalIF\":15.6000,\"publicationDate\":\"2024-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the American Chemical Society\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/jacs.4c10244\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Chemical Society","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/jacs.4c10244","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

粉末 X 射线衍射 (PXRD) 是材料表征的基础技术。然而,仅从 PXRD 图样确定完整的结构仍然耗时,而且往往难以实现,尤其是对于新型材料而言。目前用于 PXRD 分析的机器学习(ML)方法只能预测晶体结构总信息的一个子集。我们开发了一种开创性的生成式 ML 模型,旨在从真实世界的 PXRD 实验数据中求解晶体结构。除了在模拟衍射图样上表现出色外,我们还展示了大量实验衍射图样的完整结构解决方案。作为模型的基准,我们预测了来自 RRUFF 数据库的 134 个实验图样和来自材料项目的数千个模拟图样的结构,在这些图样上,我们的模型分别实现了最先进的 42% 和 67% 的匹配率。此外,我们还应用我们的模型确定了粉末衍射文件数据库中 NaCu2P2、Ca2MnTeO6、ZrGe6Ni6、LuOF 和 HoNdV2O8 等材料的未报告结构。我们将这一方法扩展到我们实验室在高压下创造的具有以前未解决结构的新材料,并发现了新的二元化合物 Rh3Bi、RuBi2 和 KBi3。我们希望我们的模型能够为在不允许单晶生长的条件下发现材料和自动发现材料的管道开辟道路,从而为化学的新领域打开大门。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Crystal Structure Determination from Powder Diffraction Patterns with Generative Machine Learning

Crystal Structure Determination from Powder Diffraction Patterns with Generative Machine Learning

Powder X-ray diffraction (PXRD) is a cornerstone technique in materials characterization. However, complete structure determination from PXRD patterns alone remains time-consuming and is often intractable, especially for novel materials. Current machine learning (ML) approaches to PXRD analysis predict only a subset of the total information that comprises a crystal structure. We developed a pioneering generative ML model designed to solve crystal structures from real-world experimental PXRD data. In addition to strong performance on simulated diffraction patterns, we demonstrate full structure solutions over a large set of experimental diffraction patterns. Benchmarking our model, we predicted the structure for 134 experimental patterns from the RRUFF database and thousands of simulated patterns from the Materials Project on which our model achieves state-of-the-art 42 and 67% match rate, respectively. Further, we applied our model to determine the unreported structures of materials such as NaCu2P2, Ca2MnTeO6, ZrGe6Ni6, LuOF, and HoNdV2O8 from the Powder Diffraction File database. We extended this methodology to new materials created in our lab at high pressure with previously unsolved structures and found the new binary compounds Rh3Bi, RuBi2, and KBi3. We expect that our model will open avenues toward materials discovery under conditions which preclude single crystal growth and toward automated materials discovery pipelines, opening the door to new domains of chemistry.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
24.40
自引率
6.00%
发文量
2398
审稿时长
1.6 months
期刊介绍: The flagship journal of the American Chemical Society, known as the Journal of the American Chemical Society (JACS), has been a prestigious publication since its establishment in 1879. It holds a preeminent position in the field of chemistry and related interdisciplinary sciences. JACS is committed to disseminating cutting-edge research papers, covering a wide range of topics, and encompasses approximately 19,000 pages of Articles, Communications, and Perspectives annually. With a weekly publication frequency, JACS plays a vital role in advancing the field of chemistry by providing essential research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信