利用CMIP6模拟增强基于机器学习的季节降水预报

IF 4.4 2区 地球科学 Q1 METEOROLOGY & ATMOSPHERIC SCIENCES
Enzo Pinheiro, Taha B.M.J. Ouarda
{"title":"利用CMIP6模拟增强基于机器学习的季节降水预报","authors":"Enzo Pinheiro,&nbsp;Taha B.M.J. Ouarda","doi":"10.1016/j.atmosres.2025.108463","DOIUrl":null,"url":null,"abstract":"<div><div>The limited availability of observational and reanalysis data presents a significant challenge in training machine learning (ML) models for seasonal climate forecasting. Here, we show that training ML-based seasonal forecasting models with a larger number of individual simulations from CMIP6 models enhances their generalization ability and improves precipitation forecasts over South America. Using TelNet, a sequence-to-sequence machine learning model, we assess the performance of models trained with different numbers of CMIP6 simulations compared to those trained with ERA5 reanalysis and the CMIP6 ensemble mean. The results reveal that models trained with only a few CMIP6 simulations perform worse than those trained with ERA5, primarily due to instability during ML model tuning and reduced generalization ability. However, as the number of CMIP6 models increases, performance improves and surpasses both ERA5- and ensemble-mean-based ML models. Reliability and sharpness diagrams analysis further demonstrate that ML models trained with more CMIP6 simulations yield more confident and calibrated forecasts. Moreover, CMIP6-based TelNet constantly outperformed state-of-the-art dynamical models across different initialization months and lead times. This study underscores the potential of leveraging large multi-model dynamical simulations for robust ML-based seasonal climate forecasting.</div></div>","PeriodicalId":8600,"journal":{"name":"Atmospheric Research","volume":"329 ","pages":"Article 108463"},"PeriodicalIF":4.4000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing machine learning-based seasonal precipitation forecasting using CMIP6 simulations\",\"authors\":\"Enzo Pinheiro,&nbsp;Taha B.M.J. Ouarda\",\"doi\":\"10.1016/j.atmosres.2025.108463\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The limited availability of observational and reanalysis data presents a significant challenge in training machine learning (ML) models for seasonal climate forecasting. Here, we show that training ML-based seasonal forecasting models with a larger number of individual simulations from CMIP6 models enhances their generalization ability and improves precipitation forecasts over South America. Using TelNet, a sequence-to-sequence machine learning model, we assess the performance of models trained with different numbers of CMIP6 simulations compared to those trained with ERA5 reanalysis and the CMIP6 ensemble mean. The results reveal that models trained with only a few CMIP6 simulations perform worse than those trained with ERA5, primarily due to instability during ML model tuning and reduced generalization ability. However, as the number of CMIP6 models increases, performance improves and surpasses both ERA5- and ensemble-mean-based ML models. Reliability and sharpness diagrams analysis further demonstrate that ML models trained with more CMIP6 simulations yield more confident and calibrated forecasts. Moreover, CMIP6-based TelNet constantly outperformed state-of-the-art dynamical models across different initialization months and lead times. This study underscores the potential of leveraging large multi-model dynamical simulations for robust ML-based seasonal climate forecasting.</div></div>\",\"PeriodicalId\":8600,\"journal\":{\"name\":\"Atmospheric Research\",\"volume\":\"329 \",\"pages\":\"Article 108463\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Atmospheric Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169809525005551\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"METEOROLOGY & ATMOSPHERIC SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169809525005551","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

观测和再分析数据的有限可用性对训练用于季节气候预报的机器学习(ML)模型提出了重大挑战。本研究表明,使用CMIP6模式的大量个体模拟训练基于ml的季节预报模型提高了其泛化能力,并改善了南美洲的降水预报。使用TelNet(一种序列到序列的机器学习模型),我们评估了使用不同数量的CMIP6模拟训练的模型的性能,并将其与使用ERA5再分析和CMIP6集成平均值训练的模型进行了比较。结果表明,仅使用少量CMIP6模拟训练的模型比使用ERA5训练的模型表现更差,主要是由于ML模型调优过程中的不稳定性和泛化能力降低。然而,随着CMIP6模型数量的增加,性能提高并超过了基于ERA5和集成均值的ML模型。可靠性和清晰度图分析进一步表明,使用更多CMIP6模拟训练的ML模型可以产生更自信和校准的预测。此外,基于cmip6的TelNet在不同的初始化月份和交付时间内不断优于最先进的动态模型。这项研究强调了利用大型多模式动力学模拟进行稳健的基于ml的季节气候预报的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Enhancing machine learning-based seasonal precipitation forecasting using CMIP6 simulations
The limited availability of observational and reanalysis data presents a significant challenge in training machine learning (ML) models for seasonal climate forecasting. Here, we show that training ML-based seasonal forecasting models with a larger number of individual simulations from CMIP6 models enhances their generalization ability and improves precipitation forecasts over South America. Using TelNet, a sequence-to-sequence machine learning model, we assess the performance of models trained with different numbers of CMIP6 simulations compared to those trained with ERA5 reanalysis and the CMIP6 ensemble mean. The results reveal that models trained with only a few CMIP6 simulations perform worse than those trained with ERA5, primarily due to instability during ML model tuning and reduced generalization ability. However, as the number of CMIP6 models increases, performance improves and surpasses both ERA5- and ensemble-mean-based ML models. Reliability and sharpness diagrams analysis further demonstrate that ML models trained with more CMIP6 simulations yield more confident and calibrated forecasts. Moreover, CMIP6-based TelNet constantly outperformed state-of-the-art dynamical models across different initialization months and lead times. This study underscores the potential of leveraging large multi-model dynamical simulations for robust ML-based seasonal climate forecasting.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Atmospheric Research
Atmospheric Research 地学-气象与大气科学
CiteScore
9.40
自引率
10.90%
发文量
460
审稿时长
47 days
期刊介绍: The journal publishes scientific papers (research papers, review articles, letters and notes) dealing with the part of the atmosphere where meteorological events occur. Attention is given to all processes extending from the earth surface to the tropopause, but special emphasis continues to be devoted to the physics of clouds, mesoscale meteorology and air pollution, i.e. atmospheric aerosols; microphysical processes; cloud dynamics and thermodynamics; numerical simulation, climatology, climate change and weather modification.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信