Chaorong Mi , An Yi , Jingyuan Xue , Changming Dong , Haixia Shan
{"title":"Forecasting summer marine heatwaves in the South China Sea using explainable machine learning models","authors":"Chaorong Mi , An Yi , Jingyuan Xue , Changming Dong , Haixia Shan","doi":"10.1016/j.dsr.2025.104457","DOIUrl":null,"url":null,"abstract":"<div><div>Marine Heatwaves (MHWs), classified as extreme oceanographic meteorological phenomena, have profound effects on ecological systems and socio-economic activities within the South China Sea (SCS). Multiple machine learning (ML) algorithms, specifically Random Forest (RF) and Ridge models, are employed to predict summer Sea Surface Temperature Anomalies (SSTA) and occurrence of MHWs in the SCS. The ML forecast results are also compared with climatology forecasts, persistence forecasts, and the European Centre for Medium-Range Weather Forecasts Sub-seasonal to Seasonal (ECMWF S2S) hindcasts. The results reveal that for regression forecasting with a one-week lead time, the Ridge model performs the best among all models. However, for longer lead times, all models tend towards climatology forecasts. When it comes to the classification forecasting, the RF model demonstrates superior performance to predict extreme MHWs compared to typical MHWs. Moreover, the feature importance is measured via the mean decrease in impurity (MDI) incorporated in the RF model to identify the predictors that contribute to predicting the MHWs. The SHapley Additive exPlanation (SHAP) algorithm is additionally employed to evaluate both the positive and negative impacts of these predictors on the model. SSTA and the Indian Ocean Basin-Wide (IOBW) index are the most two critical forecasting factors influencing the occurrence of MHWs, exhibiting a pronounced positive contribution to the model. The insights derived from this study are expected to provide strong support for the MHWs early warning system in the SCS and provide essential information for the conservation of marine ecosystems and the management of fisheries resources.</div></div>","PeriodicalId":51009,"journal":{"name":"Deep-Sea Research Part I-Oceanographic Research Papers","volume":"218 ","pages":"Article 104457"},"PeriodicalIF":2.3000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Deep-Sea Research Part I-Oceanographic Research Papers","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0967063725000160","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OCEANOGRAPHY","Score":null,"Total":0}
引用次数: 0
Abstract
Marine Heatwaves (MHWs), classified as extreme oceanographic meteorological phenomena, have profound effects on ecological systems and socio-economic activities within the South China Sea (SCS). Multiple machine learning (ML) algorithms, specifically Random Forest (RF) and Ridge models, are employed to predict summer Sea Surface Temperature Anomalies (SSTA) and occurrence of MHWs in the SCS. The ML forecast results are also compared with climatology forecasts, persistence forecasts, and the European Centre for Medium-Range Weather Forecasts Sub-seasonal to Seasonal (ECMWF S2S) hindcasts. The results reveal that for regression forecasting with a one-week lead time, the Ridge model performs the best among all models. However, for longer lead times, all models tend towards climatology forecasts. When it comes to the classification forecasting, the RF model demonstrates superior performance to predict extreme MHWs compared to typical MHWs. Moreover, the feature importance is measured via the mean decrease in impurity (MDI) incorporated in the RF model to identify the predictors that contribute to predicting the MHWs. The SHapley Additive exPlanation (SHAP) algorithm is additionally employed to evaluate both the positive and negative impacts of these predictors on the model. SSTA and the Indian Ocean Basin-Wide (IOBW) index are the most two critical forecasting factors influencing the occurrence of MHWs, exhibiting a pronounced positive contribution to the model. The insights derived from this study are expected to provide strong support for the MHWs early warning system in the SCS and provide essential information for the conservation of marine ecosystems and the management of fisheries resources.
期刊介绍:
Deep-Sea Research Part I: Oceanographic Research Papers is devoted to the publication of the results of original scientific research, including theoretical work of evident oceanographic applicability; and the solution of instrumental or methodological problems with evidence of successful use. The journal is distinguished by its interdisciplinary nature and its breadth, covering the geological, physical, chemical and biological aspects of the ocean and its boundaries with the sea floor and the atmosphere. In addition to regular "Research Papers" and "Instruments and Methods" papers, briefer communications may be published as "Notes". Supplemental matter, such as extensive data tables or graphs and multimedia content, may be published as electronic appendices.