Timing of the solar wind propagation delay between L1 and Earth based on machine learning

IF 3.4 2区 物理与天体物理 Q2 ASTRONOMY & ASTROPHYSICS
C. Baumann, A. McCloskey
{"title":"Timing of the solar wind propagation delay between L1 and Earth based on machine learning","authors":"C. Baumann, A. McCloskey","doi":"10.1051/swsc/2021026","DOIUrl":null,"url":null,"abstract":"Erroneous GNSS positioning, failures in spacecraft operations and power outages due to geomagnetically induced currents are severe threats originating from space weather. Knowing the potential impacts on modern society in advance is key for many end-user applications. This covers not only the timing of severe geomagnetic storms but also predictions of substorm onsets at polar latitudes. In this study, we aim at contributing to the timing problem of space weather impacts and propose a new method to predict the solar wind propagation delay between Lagrangian point L1 and the Earth based on machine learning, specifically decision tree models. The propagation delay is measured from the identification of interplanetary discontinuities detected by the advanced composition explorer (ACE) and their subsequent sudden commencements in the magnetosphere recorded by ground-based magnetometers. A database of the propagation delay has been constructed on this principle including 380 interplanetary shocks with data ranging from 1998 to 2018. The feature set of the machine learning approach consists of six features, namely the three components of each the solar wind speed and position of ACE around L1. The performance assessment of the machine learning model is examined based on of 10-fold cross-validation. The machine learning results are compared to physics-based models, i.e., the flat propagation delay and the more sophisticated method based on the normal vector of solar wind discontinuities (vector delay). After hyperparameter optimization, the trained gradient boosting (GB) model is the best machine learning model among the tested ones. The GB model achieves an RMSE of 4.5 min concerning the measured solar wind propagation delay and also outperforms the physical flat and vector delay models by 50% and 15% respectively. To increase the confidence in the predictions of the trained GB model, we perform a performance validation, provide drop-column feature importance and analyze the feature impact on the model output with Shapley values. The major advantage of the machine learning approach is its simplicity when it comes to its application. After training, values for the solar wind speed and spacecraft position from only one datapoint have to be fed into the algorithm for a good prediction.","PeriodicalId":17034,"journal":{"name":"Journal of Space Weather and Space Climate","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2021-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Space Weather and Space Climate","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1051/swsc/2021026","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 8

Abstract

Erroneous GNSS positioning, failures in spacecraft operations and power outages due to geomagnetically induced currents are severe threats originating from space weather. Knowing the potential impacts on modern society in advance is key for many end-user applications. This covers not only the timing of severe geomagnetic storms but also predictions of substorm onsets at polar latitudes. In this study, we aim at contributing to the timing problem of space weather impacts and propose a new method to predict the solar wind propagation delay between Lagrangian point L1 and the Earth based on machine learning, specifically decision tree models. The propagation delay is measured from the identification of interplanetary discontinuities detected by the advanced composition explorer (ACE) and their subsequent sudden commencements in the magnetosphere recorded by ground-based magnetometers. A database of the propagation delay has been constructed on this principle including 380 interplanetary shocks with data ranging from 1998 to 2018. The feature set of the machine learning approach consists of six features, namely the three components of each the solar wind speed and position of ACE around L1. The performance assessment of the machine learning model is examined based on of 10-fold cross-validation. The machine learning results are compared to physics-based models, i.e., the flat propagation delay and the more sophisticated method based on the normal vector of solar wind discontinuities (vector delay). After hyperparameter optimization, the trained gradient boosting (GB) model is the best machine learning model among the tested ones. The GB model achieves an RMSE of 4.5 min concerning the measured solar wind propagation delay and also outperforms the physical flat and vector delay models by 50% and 15% respectively. To increase the confidence in the predictions of the trained GB model, we perform a performance validation, provide drop-column feature importance and analyze the feature impact on the model output with Shapley values. The major advantage of the machine learning approach is its simplicity when it comes to its application. After training, values for the solar wind speed and spacecraft position from only one datapoint have to be fed into the algorithm for a good prediction.
基于机器学习的L1与地球间太阳风传播延迟时间
全球导航卫星系统定位错误、航天器运行故障以及地磁感应电流导致的停电都是源自太空天气的严重威胁。提前了解对现代社会的潜在影响是许多最终用户应用程序的关键。这不仅包括严重地磁风暴的时间,还包括极纬度亚风暴爆发的预测。在这项研究中,我们旨在为空间天气影响的时间问题做出贡献,并提出了一种基于机器学习的新方法来预测拉格朗日点L1和地球之间的太阳风传播延迟,特别是决策树模型。传播延迟是通过高级成分探测器(ACE)探测到的行星际不连续性的识别以及地面磁力计记录的磁层中随后的突然开始来测量的。根据这一原理,建立了一个传播延迟数据库,其中包括380次行星际撞击,数据范围从1998年到2018年。机器学习方法的特征集由六个特征组成,即每个特征的三个分量——L1周围的太阳风速和ACE的位置。基于10倍交叉验证对机器学习模型的性能评估进行了检验。将机器学习结果与基于物理的模型进行比较,即平面传播延迟和基于太阳风不连续性的法向量(向量延迟)的更复杂的方法。经过超参数优化后,训练的梯度提升(GB)模型是测试模型中最好的机器学习模型。GB模型在测量的太阳风传播延迟方面实现了4.5分钟的RMSE,并且还分别比物理平面和矢量延迟模型好50%和15%。为了提高训练后的GB模型预测的可信度,我们进行了性能验证,提供了下降列特征的重要性,并用Shapley值分析了特征对模型输出的影响。机器学习方法的主要优点是在应用方面简单。训练后,只有一个数据点的太阳风速和航天器位置的值必须输入到算法中,才能进行良好的预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Space Weather and Space Climate
Journal of Space Weather and Space Climate ASTRONOMY & ASTROPHYSICS-GEOCHEMISTRY & GEOPHYSICS
CiteScore
6.90
自引率
6.10%
发文量
40
审稿时长
8 weeks
期刊介绍: The Journal of Space Weather and Space Climate (SWSC) is an international multi-disciplinary and interdisciplinary peer-reviewed open access journal which publishes papers on all aspects of space weather and space climate from a broad range of scientific and technical fields including solar physics, space plasma physics, aeronomy, planetology, radio science, geophysics, biology, medicine, astronautics, aeronautics, electrical engineering, meteorology, climatology, mathematics, economy, informatics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信