Automated selection of nanoparticle models for small-angle X-ray scattering data analysis using machine learning.

IF 1.9 4区 材料科学 Q3 CHEMISTRY, MULTIDISCIPLINARY
Nicolas Monge, Alexis Deschamps, Massih Reza Amini
{"title":"Automated selection of nanoparticle models for small-angle X-ray scattering data analysis using machine learning.","authors":"Nicolas Monge, Alexis Deschamps, Massih Reza Amini","doi":"10.1107/S2053273324000950","DOIUrl":null,"url":null,"abstract":"<p><p>Small-angle X-ray scattering (SAXS) is widely used to analyze the shape and size of nanoparticles in solution. A multitude of models, describing the SAXS intensity resulting from nanoparticles of various shapes, have been developed by the scientific community and are used for data analysis. Choosing the optimal model is a crucial step in data analysis, which can be difficult and time-consuming, especially for non-expert users. An algorithm is proposed, based on machine learning, representation learning and SAXS-specific preprocessing methods, which instantly selects the nanoparticle model best suited to describe SAXS data. The different algorithms compared are trained and evaluated on a simulated database. This database includes 75 000 scattering spectra from nine nanoparticle models, and realistically simulates two distinct device configurations. It will be made freely available to serve as a basis of comparison for future work. Deploying a universal solution for automatic nanoparticle model selection is a challenge made more difficult by the diversity of SAXS instruments and their flexible settings. The poor transferability of classification rules learned on one device configuration to another is highlighted. It is shown that training on several device configurations enables the algorithm to be generalized, without degrading performance compared with configuration-specific training. Finally, the classification algorithm is evaluated on a real data set obtained by performing SAXS experiments on nanoparticles for each of the instrumental configurations, which have been characterized by transmission electron microscopy. This data set, although very limited, allows estimation of the transferability of the classification rules learned on simulated data to real data.</p>","PeriodicalId":106,"journal":{"name":"Acta Crystallographica Section A: Foundations and Advances","volume":" ","pages":"202-212"},"PeriodicalIF":1.9000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10913671/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Crystallographica Section A: Foundations and Advances","FirstCategoryId":"1","ListUrlMain":"https://doi.org/10.1107/S2053273324000950","RegionNum":4,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/29 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Small-angle X-ray scattering (SAXS) is widely used to analyze the shape and size of nanoparticles in solution. A multitude of models, describing the SAXS intensity resulting from nanoparticles of various shapes, have been developed by the scientific community and are used for data analysis. Choosing the optimal model is a crucial step in data analysis, which can be difficult and time-consuming, especially for non-expert users. An algorithm is proposed, based on machine learning, representation learning and SAXS-specific preprocessing methods, which instantly selects the nanoparticle model best suited to describe SAXS data. The different algorithms compared are trained and evaluated on a simulated database. This database includes 75 000 scattering spectra from nine nanoparticle models, and realistically simulates two distinct device configurations. It will be made freely available to serve as a basis of comparison for future work. Deploying a universal solution for automatic nanoparticle model selection is a challenge made more difficult by the diversity of SAXS instruments and their flexible settings. The poor transferability of classification rules learned on one device configuration to another is highlighted. It is shown that training on several device configurations enables the algorithm to be generalized, without degrading performance compared with configuration-specific training. Finally, the classification algorithm is evaluated on a real data set obtained by performing SAXS experiments on nanoparticles for each of the instrumental configurations, which have been characterized by transmission electron microscopy. This data set, although very limited, allows estimation of the transferability of the classification rules learned on simulated data to real data.

Abstract Image

利用机器学习自动选择用于小角 X 射线散射数据分析的纳米粒子模型。
小角 X 射线散射(SAXS)被广泛用于分析溶液中纳米粒子的形状和大小。科学界开发了大量模型来描述各种形状的纳米粒子产生的 SAXS 强度,并用于数据分析。选择最佳模型是数据分析的关键步骤,但这一步骤可能既困难又耗时,尤其是对于非专业用户而言。本文提出了一种基于机器学习、表征学习和 SAXS 特定预处理方法的算法,可即时选择最适合描述 SAXS 数据的纳米粒子模型。所比较的不同算法在模拟数据库上进行了训练和评估。该数据库包括来自九种纳米粒子模型的 75000 个散射光谱,真实地模拟了两种不同的设备配置。该数据库将免费提供,作为未来工作的比较基础。由于 SAXS 仪器的多样性及其灵活的设置,为自动选择纳米粒子模型部署通用解决方案变得更加困难。在一种设备配置上学习到的分类规则在另一种设备配置上的可移植性很差,这一点得到了强调。结果表明,与针对特定配置的训练相比,在多个设备配置上进行训练可使算法通用化,而不会降低性能。最后,通过对透射电子显微镜表征的每种仪器配置的纳米粒子进行 SAXS 实验,在实际数据集上对分类算法进行了评估。这个数据集虽然非常有限,但可以估计从模拟数据中学到的分类规则对真实数据的可移植性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Acta Crystallographica Section A: Foundations and Advances
Acta Crystallographica Section A: Foundations and Advances CHEMISTRY, MULTIDISCIPLINARYCRYSTALLOGRAPH-CRYSTALLOGRAPHY
CiteScore
2.60
自引率
11.10%
发文量
419
期刊介绍: Acta Crystallographica Section A: Foundations and Advances publishes articles reporting advances in the theory and practice of all areas of crystallography in the broadest sense. As well as traditional crystallography, this includes nanocrystals, metacrystals, amorphous materials, quasicrystals, synchrotron and XFEL studies, coherent scattering, diffraction imaging, time-resolved studies and the structure of strain and defects in materials. The journal has two parts, a rapid-publication Advances section and the traditional Foundations section. Articles for the Advances section are of particularly high value and impact. They receive expedited treatment and may be highlighted by an accompanying scientific commentary article and a press release. Further details are given in the November 2013 Editorial. The central themes of the journal are, on the one hand, experimental and theoretical studies of the properties and arrangements of atoms, ions and molecules in condensed matter, periodic, quasiperiodic or amorphous, ideal or real, and, on the other, the theoretical and experimental aspects of the various methods to determine these properties and arrangements.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信