{"title":"利用CMIP6模拟增强基于机器学习的季节降水预报","authors":"Enzo Pinheiro, Taha B.M.J. Ouarda","doi":"10.1016/j.atmosres.2025.108463","DOIUrl":null,"url":null,"abstract":"<div><div>The limited availability of observational and reanalysis data presents a significant challenge in training machine learning (ML) models for seasonal climate forecasting. Here, we show that training ML-based seasonal forecasting models with a larger number of individual simulations from CMIP6 models enhances their generalization ability and improves precipitation forecasts over South America. Using TelNet, a sequence-to-sequence machine learning model, we assess the performance of models trained with different numbers of CMIP6 simulations compared to those trained with ERA5 reanalysis and the CMIP6 ensemble mean. The results reveal that models trained with only a few CMIP6 simulations perform worse than those trained with ERA5, primarily due to instability during ML model tuning and reduced generalization ability. However, as the number of CMIP6 models increases, performance improves and surpasses both ERA5- and ensemble-mean-based ML models. Reliability and sharpness diagrams analysis further demonstrate that ML models trained with more CMIP6 simulations yield more confident and calibrated forecasts. Moreover, CMIP6-based TelNet constantly outperformed state-of-the-art dynamical models across different initialization months and lead times. This study underscores the potential of leveraging large multi-model dynamical simulations for robust ML-based seasonal climate forecasting.</div></div>","PeriodicalId":8600,"journal":{"name":"Atmospheric Research","volume":"329 ","pages":"Article 108463"},"PeriodicalIF":4.4000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing machine learning-based seasonal precipitation forecasting using CMIP6 simulations\",\"authors\":\"Enzo Pinheiro, Taha B.M.J. Ouarda\",\"doi\":\"10.1016/j.atmosres.2025.108463\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The limited availability of observational and reanalysis data presents a significant challenge in training machine learning (ML) models for seasonal climate forecasting. Here, we show that training ML-based seasonal forecasting models with a larger number of individual simulations from CMIP6 models enhances their generalization ability and improves precipitation forecasts over South America. Using TelNet, a sequence-to-sequence machine learning model, we assess the performance of models trained with different numbers of CMIP6 simulations compared to those trained with ERA5 reanalysis and the CMIP6 ensemble mean. The results reveal that models trained with only a few CMIP6 simulations perform worse than those trained with ERA5, primarily due to instability during ML model tuning and reduced generalization ability. However, as the number of CMIP6 models increases, performance improves and surpasses both ERA5- and ensemble-mean-based ML models. Reliability and sharpness diagrams analysis further demonstrate that ML models trained with more CMIP6 simulations yield more confident and calibrated forecasts. Moreover, CMIP6-based TelNet constantly outperformed state-of-the-art dynamical models across different initialization months and lead times. This study underscores the potential of leveraging large multi-model dynamical simulations for robust ML-based seasonal climate forecasting.</div></div>\",\"PeriodicalId\":8600,\"journal\":{\"name\":\"Atmospheric Research\",\"volume\":\"329 \",\"pages\":\"Article 108463\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Atmospheric Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169809525005551\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"METEOROLOGY & ATMOSPHERIC SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169809525005551","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
Enhancing machine learning-based seasonal precipitation forecasting using CMIP6 simulations
The limited availability of observational and reanalysis data presents a significant challenge in training machine learning (ML) models for seasonal climate forecasting. Here, we show that training ML-based seasonal forecasting models with a larger number of individual simulations from CMIP6 models enhances their generalization ability and improves precipitation forecasts over South America. Using TelNet, a sequence-to-sequence machine learning model, we assess the performance of models trained with different numbers of CMIP6 simulations compared to those trained with ERA5 reanalysis and the CMIP6 ensemble mean. The results reveal that models trained with only a few CMIP6 simulations perform worse than those trained with ERA5, primarily due to instability during ML model tuning and reduced generalization ability. However, as the number of CMIP6 models increases, performance improves and surpasses both ERA5- and ensemble-mean-based ML models. Reliability and sharpness diagrams analysis further demonstrate that ML models trained with more CMIP6 simulations yield more confident and calibrated forecasts. Moreover, CMIP6-based TelNet constantly outperformed state-of-the-art dynamical models across different initialization months and lead times. This study underscores the potential of leveraging large multi-model dynamical simulations for robust ML-based seasonal climate forecasting.
期刊介绍:
The journal publishes scientific papers (research papers, review articles, letters and notes) dealing with the part of the atmosphere where meteorological events occur. Attention is given to all processes extending from the earth surface to the tropopause, but special emphasis continues to be devoted to the physics of clouds, mesoscale meteorology and air pollution, i.e. atmospheric aerosols; microphysical processes; cloud dynamics and thermodynamics; numerical simulation, climatology, climate change and weather modification.