电子分子电离截面的数据驱动机器学习方法

IF 1.5 4区 物理与天体物理 Q3 OPTICS
A L Harris, J Nepomuceno
{"title":"电子分子电离截面的数据驱动机器学习方法","authors":"A L Harris, J Nepomuceno","doi":"10.1088/1361-6455/ad2185","DOIUrl":null,"url":null,"abstract":"Despite their importance in a wide variety of applications, the estimation of ionization cross sections for large molecules continues to present challenges for both experiment and theory. Machine learning (ML) algorithms have been shown to be an effective mechanism for estimating cross section data for atomic targets and a select number of molecular targets. We present an efficient ML model for predicting ionization cross sections for a broad array of molecular targets. Our model is a 3-layer neural network that is trained using published experimental datasets. There is minimal input to the network, making it widely applicable. We show that with training on as few as 10 molecular datasets, the network is able to predict the experimental cross sections of additional molecules with an accuracy similar to experimental uncertainties in existing data. As the number of training molecular datasets increased, the network’s predictions became more accurate and, in the worst case, were within 30% of accepted experimental values. In many cases, predictions were within 10% of accepted values. Using a network trained on datasets for 25 different molecules, we present predictions for an additional 27 molecules, including alkanes, alkenes, molecules with ring structures, and DNA nucleotide bases.","PeriodicalId":16826,"journal":{"name":"Journal of Physics B: Atomic, Molecular and Optical Physics","volume":"10 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A data-driven machine learning approach for electron-molecule ionization cross sections\",\"authors\":\"A L Harris, J Nepomuceno\",\"doi\":\"10.1088/1361-6455/ad2185\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Despite their importance in a wide variety of applications, the estimation of ionization cross sections for large molecules continues to present challenges for both experiment and theory. Machine learning (ML) algorithms have been shown to be an effective mechanism for estimating cross section data for atomic targets and a select number of molecular targets. We present an efficient ML model for predicting ionization cross sections for a broad array of molecular targets. Our model is a 3-layer neural network that is trained using published experimental datasets. There is minimal input to the network, making it widely applicable. We show that with training on as few as 10 molecular datasets, the network is able to predict the experimental cross sections of additional molecules with an accuracy similar to experimental uncertainties in existing data. As the number of training molecular datasets increased, the network’s predictions became more accurate and, in the worst case, were within 30% of accepted experimental values. In many cases, predictions were within 10% of accepted values. Using a network trained on datasets for 25 different molecules, we present predictions for an additional 27 molecules, including alkanes, alkenes, molecules with ring structures, and DNA nucleotide bases.\",\"PeriodicalId\":16826,\"journal\":{\"name\":\"Journal of Physics B: Atomic, Molecular and Optical Physics\",\"volume\":\"10 1\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2024-02-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Physics B: Atomic, Molecular and Optical Physics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1088/1361-6455/ad2185\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Physics B: Atomic, Molecular and Optical Physics","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1088/1361-6455/ad2185","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0

摘要

尽管大分子电离截面在各种应用中都很重要,但其估算仍然是实验和理论的挑战。机器学习(ML)算法已被证明是估算原子目标和部分分子目标截面数据的有效机制。我们提出了一种高效的 ML 模型,用于预测各种分子目标的电离截面。我们的模型是一个 3 层神经网络,使用已发布的实验数据集进行训练。该网络的输入量极小,因此适用范围很广。我们的研究表明,只需在 10 个分子数据集上进行训练,该网络就能预测更多分子的实验截面,其准确度与现有数据的实验不确定性相似。随着训练分子数据集数量的增加,网络预测的准确度也越来越高,在最糟糕的情况下,预测值在公认实验值的 30% 以内。在许多情况下,预测值在公认值的 10%以内。利用在 25 种不同分子数据集上训练的网络,我们对另外 27 种分子进行了预测,其中包括烷、烯、具有环状结构的分子和 DNA 核苷酸碱基。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A data-driven machine learning approach for electron-molecule ionization cross sections
Despite their importance in a wide variety of applications, the estimation of ionization cross sections for large molecules continues to present challenges for both experiment and theory. Machine learning (ML) algorithms have been shown to be an effective mechanism for estimating cross section data for atomic targets and a select number of molecular targets. We present an efficient ML model for predicting ionization cross sections for a broad array of molecular targets. Our model is a 3-layer neural network that is trained using published experimental datasets. There is minimal input to the network, making it widely applicable. We show that with training on as few as 10 molecular datasets, the network is able to predict the experimental cross sections of additional molecules with an accuracy similar to experimental uncertainties in existing data. As the number of training molecular datasets increased, the network’s predictions became more accurate and, in the worst case, were within 30% of accepted experimental values. In many cases, predictions were within 10% of accepted values. Using a network trained on datasets for 25 different molecules, we present predictions for an additional 27 molecules, including alkanes, alkenes, molecules with ring structures, and DNA nucleotide bases.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.60
自引率
6.20%
发文量
182
审稿时长
2.8 months
期刊介绍: Published twice-monthly (24 issues per year), Journal of Physics B: Atomic, Molecular and Optical Physics covers the study of atoms, ions, molecules and clusters, and their structure and interactions with particles, photons or fields. The journal also publishes articles dealing with those aspects of spectroscopy, quantum optics and non-linear optics, laser physics, astrophysics, plasma physics, chemical physics, optical cooling and trapping and other investigations where the objects of study are the elementary atomic, ionic or molecular properties of processes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信