{"title":"x射线衍射中用于材料发现和表征的机器学习","authors":"Connor Davel , Nazanin Bassiri-Gharb , Juan-Pablo Correa-Baena","doi":"10.1016/j.matt.2025.102272","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning (ML) is a promising analytical method for large high-throughput, <em>in situ</em>, and <em>operando</em> X-ray diffraction (XRD) datasets. However, ML methods are, by default, physics agnostic and must therefore be interpreted carefully. In this review, we survey how supervised ML methods are used to predict symmetries and phases in pure and mixed-composition materials, and we highlight challenges related to experimental artifacts and model interpretation. We also review recent uses of unsupervised ML methods in the extraction of patterns hidden in high-dimensional data, such as in <em>in situ</em> and microscopic studies. Finally, we discuss the importance of problem formulation, data transferability, and reporting, leveraging examples from the literature, and we provide various resources throughout to expedite the learning curve for readers new to XRD or ML. We advocate for greater scrutiny of ML methods and how they are reported in the literature, and we explain how to conduct data-driven research responsibly.</div></div>","PeriodicalId":388,"journal":{"name":"Matter","volume":"8 9","pages":"Article 102272"},"PeriodicalIF":17.5000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning in X-ray diffraction for materials discovery and characterization\",\"authors\":\"Connor Davel , Nazanin Bassiri-Gharb , Juan-Pablo Correa-Baena\",\"doi\":\"10.1016/j.matt.2025.102272\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Machine learning (ML) is a promising analytical method for large high-throughput, <em>in situ</em>, and <em>operando</em> X-ray diffraction (XRD) datasets. However, ML methods are, by default, physics agnostic and must therefore be interpreted carefully. In this review, we survey how supervised ML methods are used to predict symmetries and phases in pure and mixed-composition materials, and we highlight challenges related to experimental artifacts and model interpretation. We also review recent uses of unsupervised ML methods in the extraction of patterns hidden in high-dimensional data, such as in <em>in situ</em> and microscopic studies. Finally, we discuss the importance of problem formulation, data transferability, and reporting, leveraging examples from the literature, and we provide various resources throughout to expedite the learning curve for readers new to XRD or ML. We advocate for greater scrutiny of ML methods and how they are reported in the literature, and we explain how to conduct data-driven research responsibly.</div></div>\",\"PeriodicalId\":388,\"journal\":{\"name\":\"Matter\",\"volume\":\"8 9\",\"pages\":\"Article 102272\"},\"PeriodicalIF\":17.5000,\"publicationDate\":\"2025-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Matter\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2590238525003157\",\"RegionNum\":1,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATERIALS SCIENCE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Matter","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590238525003157","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
Machine learning in X-ray diffraction for materials discovery and characterization
Machine learning (ML) is a promising analytical method for large high-throughput, in situ, and operando X-ray diffraction (XRD) datasets. However, ML methods are, by default, physics agnostic and must therefore be interpreted carefully. In this review, we survey how supervised ML methods are used to predict symmetries and phases in pure and mixed-composition materials, and we highlight challenges related to experimental artifacts and model interpretation. We also review recent uses of unsupervised ML methods in the extraction of patterns hidden in high-dimensional data, such as in in situ and microscopic studies. Finally, we discuss the importance of problem formulation, data transferability, and reporting, leveraging examples from the literature, and we provide various resources throughout to expedite the learning curve for readers new to XRD or ML. We advocate for greater scrutiny of ML methods and how they are reported in the literature, and we explain how to conduct data-driven research responsibly.
期刊介绍:
Matter, a monthly journal affiliated with Cell, spans the broad field of materials science from nano to macro levels,covering fundamentals to applications. Embracing groundbreaking technologies,it includes full-length research articles,reviews, perspectives,previews, opinions, personnel stories, and general editorial content.
Matter aims to be the primary resource for researchers in academia and industry, inspiring the next generation of materials scientists.