I. M. Gadzhiev, O. G. Barinov, I. N. Myagkova, S. A. Dolenko
{"title":"Using Classification Methods in Forecasting the Level of Geomagnetic Field Disturbance Based on the Kp-Index","authors":"I. M. Gadzhiev, O. G. Barinov, I. N. Myagkova, S. A. Dolenko","doi":"10.1134/S0016793224600140","DOIUrl":null,"url":null,"abstract":"<p>The paper explores the possibilities of using data classification methods when forecasting time series of the geomagnetic <i>Kp</i>-index by machine learning methods. To classify categories of the <i>Kp</i>-index based on the degree of disturbance, linear and logistic regression, random forest, gradient boosting on top of decision trees, and artificial neural networks of various architectures are used. The results of these methods are compared with a trivial inertial forecast (the statistical indicators of which for problems of this type are always high) at horizons from 3 h to 1 day in 3-h increments. The problem of choosing a cross-validation scheme for selecting the model hyperparameters, ways to overcome the imbalance of categories, the relative importance of input features, as well as the dependence of the results on the test sample (beginning of the 25th solar activity cycle) on inclusion in the training sample of data from the 23rd and 24th cycles or only the 24th cycles are studied. Based on the results, conclusions are drawn about the preferred methods for classifying values of the <i>Kp</i>-index based on the level of geomagnetic disturbance. Ways for further research and possible improvement of the classification quality are outlined, including for determining the characteristic hidden states of Earth’s magnetosphere as a dynamic system in order to improve the quality of forecasting geomagnetic indices.</p>","PeriodicalId":55597,"journal":{"name":"Geomagnetism and Aeronomy","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geomagnetism and Aeronomy","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1134/S0016793224600140","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
The paper explores the possibilities of using data classification methods when forecasting time series of the geomagnetic Kp-index by machine learning methods. To classify categories of the Kp-index based on the degree of disturbance, linear and logistic regression, random forest, gradient boosting on top of decision trees, and artificial neural networks of various architectures are used. The results of these methods are compared with a trivial inertial forecast (the statistical indicators of which for problems of this type are always high) at horizons from 3 h to 1 day in 3-h increments. The problem of choosing a cross-validation scheme for selecting the model hyperparameters, ways to overcome the imbalance of categories, the relative importance of input features, as well as the dependence of the results on the test sample (beginning of the 25th solar activity cycle) on inclusion in the training sample of data from the 23rd and 24th cycles or only the 24th cycles are studied. Based on the results, conclusions are drawn about the preferred methods for classifying values of the Kp-index based on the level of geomagnetic disturbance. Ways for further research and possible improvement of the classification quality are outlined, including for determining the characteristic hidden states of Earth’s magnetosphere as a dynamic system in order to improve the quality of forecasting geomagnetic indices.
期刊介绍:
Geomagnetism and Aeronomy is a bimonthly periodical that covers the fields of interplanetary space; geoeffective solar events; the magnetosphere; the ionosphere; the upper and middle atmosphere; the action of solar variability and activity on atmospheric parameters and climate; the main magnetic field and its secular variations, excursion, and inversion; and other related topics.