Prediction of the Transit Time of Coronal Mass Ejections with an Ensemble Machine-learning Method

IF 8.6 1区 物理与天体物理 Q1 ASTRONOMY & ASTROPHYSICS
Y. Yang, J. J. Liu, X. S. Feng, P. F. Chen, B. Zhang
{"title":"Prediction of the Transit Time of Coronal Mass Ejections with an Ensemble Machine-learning Method","authors":"Y. Yang, J. J. Liu, X. S. Feng, P. F. Chen, B. Zhang","doi":"10.3847/1538-4365/acf218","DOIUrl":null,"url":null,"abstract":"Abstract Coronal mass ejections (CMEs), a kind of violent solar eruptive activity, can exert a significant impact on space weather. When arriving at the Earth, they interact with the geomagnetic field, which can boost the energy supply to the geomagnetic field and may further result in geomagnetic storms, thus having potentially catastrophic effects on human activities. Therefore, accurate forecasting of the transit time of CMEs from the Sun to the Earth is vital for mitigating the relevant losses brought by them. XGBoost, an ensemble model that has better performance in some other fields, is applied to the space weather forecast for the first time. During multiple tests with random data splits, the best mean absolute error (MAE) of ∼5.72 hr was obtained, and in this test, 62% of the test CMEs had absolute arrival time error of less than 5.72 hr. The average MAE over all random tests was ∼10 hr. It indicates that our method has a better predictive potential and baseline. Moreover, we introduce two effective feature importance ranking methods. One is the information gain method, a built-in method of ensemble models. The other is the permutation method. These two methods combine the learning process of the model and its performance to rank the CME features, respectively. Compared with the direct correlation analysis on the sample data set, they can help select the important features that closely match the model. These two methods can assist researchers to process large sample data sets, which often require feature selection in advance.","PeriodicalId":8588,"journal":{"name":"Astrophysical Journal Supplement Series","volume":"43 1","pages":"0"},"PeriodicalIF":8.6000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astrophysical Journal Supplement Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3847/1538-4365/acf218","RegionNum":1,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Coronal mass ejections (CMEs), a kind of violent solar eruptive activity, can exert a significant impact on space weather. When arriving at the Earth, they interact with the geomagnetic field, which can boost the energy supply to the geomagnetic field and may further result in geomagnetic storms, thus having potentially catastrophic effects on human activities. Therefore, accurate forecasting of the transit time of CMEs from the Sun to the Earth is vital for mitigating the relevant losses brought by them. XGBoost, an ensemble model that has better performance in some other fields, is applied to the space weather forecast for the first time. During multiple tests with random data splits, the best mean absolute error (MAE) of ∼5.72 hr was obtained, and in this test, 62% of the test CMEs had absolute arrival time error of less than 5.72 hr. The average MAE over all random tests was ∼10 hr. It indicates that our method has a better predictive potential and baseline. Moreover, we introduce two effective feature importance ranking methods. One is the information gain method, a built-in method of ensemble models. The other is the permutation method. These two methods combine the learning process of the model and its performance to rank the CME features, respectively. Compared with the direct correlation analysis on the sample data set, they can help select the important features that closely match the model. These two methods can assist researchers to process large sample data sets, which often require feature selection in advance.
用集成机器学习方法预测日冕物质抛射过境时间
日冕物质抛射(cme)是一种剧烈的太阳喷发活动,对空间天气产生重大影响。当到达地球时,它们与地磁场相互作用,可以增加地磁场的能量供应,并可能进一步导致地磁风暴,从而对人类活动产生潜在的灾难性影响。因此,准确预测日冕物质抛射从太阳到地球的穿越时间,对于减轻日冕物质抛射给地球带来的损失至关重要。XGBoost是一个在其他领域具有较好性能的集成模型,首次应用于空间天气预报。在随机数据分割的多次测试中,获得的最佳平均绝对误差(MAE)为~ 5.72小时,在该测试中,62%的测试cme的绝对到达时间误差小于5.72小时。所有随机试验的平均MAE为~ 10小时。结果表明,该方法具有较好的预测潜力和基线。此外,我们还介绍了两种有效的特征重要性排序方法。一种是信息增益法,一种集成模型的内置方法。另一种是排列法。这两种方法分别结合模型的学习过程及其性能对CME特征进行排序。与对样本数据集的直接相关分析相比,它们可以帮助选择与模型密切匹配的重要特征。这两种方法可以帮助研究人员处理通常需要提前选择特征的大样本数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Astrophysical Journal Supplement Series
Astrophysical Journal Supplement Series 地学天文-天文与天体物理
CiteScore
14.50
自引率
5.70%
发文量
264
审稿时长
2 months
期刊介绍: The Astrophysical Journal Supplement (ApJS) serves as an open-access journal that publishes significant articles featuring extensive data or calculations in the field of astrophysics. It also facilitates Special Issues, presenting thematically related papers simultaneously in a single volume.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信