{"title":"Predicting CME speed at 20\\(R_{\\odot }\\) using machine learning approaches","authors":"M. Hegde","doi":"10.1007/s10509-025-04482-z","DOIUrl":null,"url":null,"abstract":"<div><p>Coronal mass ejections (CMEs) are significant drivers of space weather, and accurately predicting their propagation speed is crucial for mitigating their impact on Earth’s environment. In this study, we leverage machine learning techniques to model and predict CME speed at 20<span>\\(R_{\\odot }\\)</span> utilizing data from the Coordinated Data Analysis Workshop catalog. We considered data from Solar Cycles 23 and 24, divided into their rising, maxima, decline, and minima phases, to train multivariate linear regression, Random Forest, and XGBoost machine learning models aimed at predicting CME speeds at 20<span>\\(R_{\\odot }\\)</span>. The machine learning models use linear speed, acceleration, width, and kinetic energy as input features to estimate CME speeds at 20<span>\\(R_{\\odot }\\)</span>. Our results indicate that Random Forest and XGBoost models significantly outperform linear regression model across all datasets, achieving high R<sup>2</sup> values (≈0.97) and low relative errors (6%) for most phases, especially during high solar activity. Feature importance analysis identifies CME linear speed and acceleration as the dominant predictors of CME speed at 20<span>\\(R_{\\odot }\\)</span>. This result is consistent with physical models, which describe CME propagation as being influenced primarily by initial speed and the drag force acting through acceleration or deceleration in the interplanetary medium. The trained models were applied to available events from Solar Cycle 25, to predict CME speeds at 20<span>\\(R_{\\odot }\\)</span>. The predicted values showed very good agreement with the actual speeds reported in the CDAW catalog. This successful application demonstrates the models’ generalizability and potential for forecasting future CME dynamics. Furthermore, such data-driven predictions can complement physics-based models—such as the Drag-Based Model—by providing reliable speed estimates at specific heliocentric distances, thereby enhancing the accuracy of space weather forecasts.</p></div>","PeriodicalId":8644,"journal":{"name":"Astrophysics and Space Science","volume":"370 9","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astrophysics and Space Science","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1007/s10509-025-04482-z","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Coronal mass ejections (CMEs) are significant drivers of space weather, and accurately predicting their propagation speed is crucial for mitigating their impact on Earth’s environment. In this study, we leverage machine learning techniques to model and predict CME speed at 20\(R_{\odot }\) utilizing data from the Coordinated Data Analysis Workshop catalog. We considered data from Solar Cycles 23 and 24, divided into their rising, maxima, decline, and minima phases, to train multivariate linear regression, Random Forest, and XGBoost machine learning models aimed at predicting CME speeds at 20\(R_{\odot }\). The machine learning models use linear speed, acceleration, width, and kinetic energy as input features to estimate CME speeds at 20\(R_{\odot }\). Our results indicate that Random Forest and XGBoost models significantly outperform linear regression model across all datasets, achieving high R2 values (≈0.97) and low relative errors (6%) for most phases, especially during high solar activity. Feature importance analysis identifies CME linear speed and acceleration as the dominant predictors of CME speed at 20\(R_{\odot }\). This result is consistent with physical models, which describe CME propagation as being influenced primarily by initial speed and the drag force acting through acceleration or deceleration in the interplanetary medium. The trained models were applied to available events from Solar Cycle 25, to predict CME speeds at 20\(R_{\odot }\). The predicted values showed very good agreement with the actual speeds reported in the CDAW catalog. This successful application demonstrates the models’ generalizability and potential for forecasting future CME dynamics. Furthermore, such data-driven predictions can complement physics-based models—such as the Drag-Based Model—by providing reliable speed estimates at specific heliocentric distances, thereby enhancing the accuracy of space weather forecasts.
日冕物质抛射(cme)是空间天气的重要驱动因素,准确预测其传播速度对于减轻其对地球环境的影响至关重要。在这项研究中,我们利用机器学习技术来模拟和预测20 \(R_{\odot }\)的CME速度,利用协调数据分析研讨会目录中的数据。我们考虑了太阳周期23和24的数据,将其分为上升,最大,下降和最小阶段,以训练多元线性回归,随机森林和XGBoost机器学习模型,旨在预测20日CME速度\(R_{\odot }\)。机器学习模型使用线性速度、加速度、宽度和动能作为输入特征来估计CME速度为20 \(R_{\odot }\)。我们的研究结果表明,随机森林和XGBoost模型在所有数据集上都明显优于线性回归模型,实现了高R2值(≈0.97)和低相对误差(6%) for most phases, especially during high solar activity. Feature importance analysis identifies CME linear speed and acceleration as the dominant predictors of CME speed at 20\(R_{\odot }\). This result is consistent with physical models, which describe CME propagation as being influenced primarily by initial speed and the drag force acting through acceleration or deceleration in the interplanetary medium. The trained models were applied to available events from Solar Cycle 25, to predict CME speeds at 20\(R_{\odot }\). The predicted values showed very good agreement with the actual speeds reported in the CDAW catalog. This successful application demonstrates the models’ generalizability and potential for forecasting future CME dynamics. Furthermore, such data-driven predictions can complement physics-based models—such as the Drag-Based Model—by providing reliable speed estimates at specific heliocentric distances, thereby enhancing the accuracy of space weather forecasts.
期刊介绍:
Astrophysics and Space Science publishes original contributions and invited reviews covering the entire range of astronomy, astrophysics, astrophysical cosmology, planetary and space science and the astrophysical aspects of astrobiology. This includes both observational and theoretical research, the techniques of astronomical instrumentation and data analysis and astronomical space instrumentation. We particularly welcome papers in the general fields of high-energy astrophysics, astrophysical and astrochemical studies of the interstellar medium including star formation, planetary astrophysics, the formation and evolution of galaxies and the evolution of large scale structure in the Universe. Papers in mathematical physics or in general relativity which do not establish clear astrophysical applications will no longer be considered.
The journal also publishes topically selected special issues in research fields of particular scientific interest. These consist of both invited reviews and original research papers. Conference proceedings will not be considered. All papers published in the journal are subject to thorough and strict peer-reviewing.
Astrophysics and Space Science features short publication times after acceptance and colour printing free of charge.