Hao Wang, Jiajun Zhong, Yikun Li, Junrong Zhang, Rong Du
{"title":"为卷积神经网络设计一个数据集来预测符合消光规律的空间群","authors":"Hao Wang, Jiajun Zhong, Yikun Li, Junrong Zhang, Rong Du","doi":"10.1107/S1600576725002316","DOIUrl":null,"url":null,"abstract":"<p>A dataset of one-dimensional powder diffraction patterns has been designed with a new strategy to train convolutional neural networks for predicting space groups. The diffraction patterns were calculated from lattice parameters and extinction laws, instead of the traditional approach of generating them from a crystallographic database. The new strategy is shown to be more effective than the conventional method. As a result, the model trained on the cubic and tetragonal training sets from the newly designed dataset achieves a prediction accuracy that matches the theoretical maximum calculated on the basis of extinction laws. These results demonstrate that machine-learning-based prediction can be both physically reasonable and reliable. The model trained on this newly designed dataset shows excellent generalization capability, much better than for one trained on a traditionally designed dataset.</p>","PeriodicalId":48737,"journal":{"name":"Journal of Applied Crystallography","volume":"58 3","pages":"711-717"},"PeriodicalIF":5.2000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Designing a dataset for convolutional neural networks to predict space groups consistent with extinction laws\",\"authors\":\"Hao Wang, Jiajun Zhong, Yikun Li, Junrong Zhang, Rong Du\",\"doi\":\"10.1107/S1600576725002316\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>A dataset of one-dimensional powder diffraction patterns has been designed with a new strategy to train convolutional neural networks for predicting space groups. The diffraction patterns were calculated from lattice parameters and extinction laws, instead of the traditional approach of generating them from a crystallographic database. The new strategy is shown to be more effective than the conventional method. As a result, the model trained on the cubic and tetragonal training sets from the newly designed dataset achieves a prediction accuracy that matches the theoretical maximum calculated on the basis of extinction laws. These results demonstrate that machine-learning-based prediction can be both physically reasonable and reliable. The model trained on this newly designed dataset shows excellent generalization capability, much better than for one trained on a traditionally designed dataset.</p>\",\"PeriodicalId\":48737,\"journal\":{\"name\":\"Journal of Applied Crystallography\",\"volume\":\"58 3\",\"pages\":\"711-717\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2025-04-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Applied Crystallography\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1107/S1600576725002316\",\"RegionNum\":3,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Crystallography","FirstCategoryId":"88","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1107/S1600576725002316","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Designing a dataset for convolutional neural networks to predict space groups consistent with extinction laws
A dataset of one-dimensional powder diffraction patterns has been designed with a new strategy to train convolutional neural networks for predicting space groups. The diffraction patterns were calculated from lattice parameters and extinction laws, instead of the traditional approach of generating them from a crystallographic database. The new strategy is shown to be more effective than the conventional method. As a result, the model trained on the cubic and tetragonal training sets from the newly designed dataset achieves a prediction accuracy that matches the theoretical maximum calculated on the basis of extinction laws. These results demonstrate that machine-learning-based prediction can be both physically reasonable and reliable. The model trained on this newly designed dataset shows excellent generalization capability, much better than for one trained on a traditionally designed dataset.
期刊介绍:
Many research topics in condensed matter research, materials science and the life sciences make use of crystallographic methods to study crystalline and non-crystalline matter with neutrons, X-rays and electrons. Articles published in the Journal of Applied Crystallography focus on these methods and their use in identifying structural and diffusion-controlled phase transformations, structure-property relationships, structural changes of defects, interfaces and surfaces, etc. Developments of instrumentation and crystallographic apparatus, theory and interpretation, numerical analysis and other related subjects are also covered. The journal is the primary place where crystallographic computer program information is published.