{"title":"基于新型概率双机器学习方法的子季节到季节降水预报的集合后处理","authors":"Shengsheng Zhan , Aizhong Ye , Lingyun Wu , Chenguang Zhao","doi":"10.1016/j.jhydrol.2025.133484","DOIUrl":null,"url":null,"abstract":"<div><div>Subseasonal-to-seasonal (S2S) precipitation forecasting is crucial for hydrological modeling; however, its accuracy often falls short of the requirements for hydrological forecasts, necessitating post-processing. A novel improved version of the Double Machine Learning (DML) method, termed Probabilistic Double Machine Learning (PDML), is proposed for ensemble post-processing of S2S forecasts. The new PDML method extends the classifier from binary classification to multi-class classification, improves the regressor from single-value output to probability distribution output, and combines the classifier and regressor based on total probability theorem. PDML not only quantifies uncertainty through ensemble output but also provides additional consideration for extreme precipitation events in the classification and regression progress. Various machine learning methods are compared within the PDML framework, including the state-of-the-art Kolmogorov-Arnold Networks. The results indicate that deep learning models based on Recurrent Neural Networks (RNN) and the U-NET architecture perform the best within the PDML framework. It achieves post-processing of S2S forecasts across different timescales and outperforms the statistical Ensemble Pre-Processor (EPP) method. On average, it improves the original forecast’s correlation coefficient, critical success index, and root mean square error by 85.8 %, 294.6 %, and 45.3 %, respectively, and achieves an 8.6 % improvement on the continuous ranked probability score compared to EPP. The results demonstrate that PDML can effectively perform ensemble post-processing of precipitation forecasts across different timescales, quantify uncertainty, and facilitate further hydrological modeling.</div></div>","PeriodicalId":362,"journal":{"name":"Journal of Hydrology","volume":"660 ","pages":"Article 133484"},"PeriodicalIF":6.3000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ensemble post-processing of sub-seasonal to seasonal precipitation forecasts based on a novel probabilistic double machine learning method\",\"authors\":\"Shengsheng Zhan , Aizhong Ye , Lingyun Wu , Chenguang Zhao\",\"doi\":\"10.1016/j.jhydrol.2025.133484\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Subseasonal-to-seasonal (S2S) precipitation forecasting is crucial for hydrological modeling; however, its accuracy often falls short of the requirements for hydrological forecasts, necessitating post-processing. A novel improved version of the Double Machine Learning (DML) method, termed Probabilistic Double Machine Learning (PDML), is proposed for ensemble post-processing of S2S forecasts. The new PDML method extends the classifier from binary classification to multi-class classification, improves the regressor from single-value output to probability distribution output, and combines the classifier and regressor based on total probability theorem. PDML not only quantifies uncertainty through ensemble output but also provides additional consideration for extreme precipitation events in the classification and regression progress. Various machine learning methods are compared within the PDML framework, including the state-of-the-art Kolmogorov-Arnold Networks. The results indicate that deep learning models based on Recurrent Neural Networks (RNN) and the U-NET architecture perform the best within the PDML framework. It achieves post-processing of S2S forecasts across different timescales and outperforms the statistical Ensemble Pre-Processor (EPP) method. On average, it improves the original forecast’s correlation coefficient, critical success index, and root mean square error by 85.8 %, 294.6 %, and 45.3 %, respectively, and achieves an 8.6 % improvement on the continuous ranked probability score compared to EPP. The results demonstrate that PDML can effectively perform ensemble post-processing of precipitation forecasts across different timescales, quantify uncertainty, and facilitate further hydrological modeling.</div></div>\",\"PeriodicalId\":362,\"journal\":{\"name\":\"Journal of Hydrology\",\"volume\":\"660 \",\"pages\":\"Article 133484\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hydrology\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0022169425008224\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022169425008224","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Ensemble post-processing of sub-seasonal to seasonal precipitation forecasts based on a novel probabilistic double machine learning method
Subseasonal-to-seasonal (S2S) precipitation forecasting is crucial for hydrological modeling; however, its accuracy often falls short of the requirements for hydrological forecasts, necessitating post-processing. A novel improved version of the Double Machine Learning (DML) method, termed Probabilistic Double Machine Learning (PDML), is proposed for ensemble post-processing of S2S forecasts. The new PDML method extends the classifier from binary classification to multi-class classification, improves the regressor from single-value output to probability distribution output, and combines the classifier and regressor based on total probability theorem. PDML not only quantifies uncertainty through ensemble output but also provides additional consideration for extreme precipitation events in the classification and regression progress. Various machine learning methods are compared within the PDML framework, including the state-of-the-art Kolmogorov-Arnold Networks. The results indicate that deep learning models based on Recurrent Neural Networks (RNN) and the U-NET architecture perform the best within the PDML framework. It achieves post-processing of S2S forecasts across different timescales and outperforms the statistical Ensemble Pre-Processor (EPP) method. On average, it improves the original forecast’s correlation coefficient, critical success index, and root mean square error by 85.8 %, 294.6 %, and 45.3 %, respectively, and achieves an 8.6 % improvement on the continuous ranked probability score compared to EPP. The results demonstrate that PDML can effectively perform ensemble post-processing of precipitation forecasts across different timescales, quantify uncertainty, and facilitate further hydrological modeling.
期刊介绍:
The Journal of Hydrology publishes original research papers and comprehensive reviews in all the subfields of the hydrological sciences including water based management and policy issues that impact on economics and society. These comprise, but are not limited to the physical, chemical, biogeochemical, stochastic and systems aspects of surface and groundwater hydrology, hydrometeorology and hydrogeology. Relevant topics incorporating the insights and methodologies of disciplines such as climatology, water resource systems, hydraulics, agrohydrology, geomorphology, soil science, instrumentation and remote sensing, civil and environmental engineering are included. Social science perspectives on hydrological problems such as resource and ecological economics, environmental sociology, psychology and behavioural science, management and policy analysis are also invited. Multi-and interdisciplinary analyses of hydrological problems are within scope. The science published in the Journal of Hydrology is relevant to catchment scales rather than exclusively to a local scale or site.