{"title":"Synergizing GNSS, MODIS, and ERA5 for high-resolution PWV retrieval: a two-stage machine learning approach over Hong Kong","authors":"Guanmei Chen , Aigong Xu , Zongqiu Xu , Zhiguo Deng , Longjiang Tang , Congying Shao , Nannan Yang , Xiang Gao , Meiqi Zhang","doi":"10.1016/j.asr.2026.01.035","DOIUrl":null,"url":null,"abstract":"<div><div>Atmospheric water vapor is a fundamental driver of the Earth’s energy balance and climate system, making precipitable water vapor (PWV) a key indicator for meteorological monitoring and extreme weather forecasting. This study introduces an innovative two-stage machine learning framework, named the Decision Tree and Random Forest PWV Fusion (DTRP) framework, to generate a high-precision, spatiotemporally continuous PWV product over Hong Kong by fusing multi-source data, including GNSS, Moderate Resolution Imaging Spectroradiometer (MODIS), and ERA5. GNSS PWV derived from 18 Continuously Operating Reference Stations (CORS) served as the benchmark. In the initial stage of the DTRP framework, a regression tree model is employed to correct systematic deviations in MODIS Near-Infrared (NIR) PWV. The model exploits the differences between ERA5 and MYD NIR PWV, together with spatiotemporal features, to estimate GNSS–MYD deviations, thereby yielding an enhanced PWV product (EMYD). Building on this, the second stage of DTRP applies a random forest framework to integrate EMYD with original ERA5 PWV along with meteorological covariates, generating the final fused product (RF PWV). Statistical evaluation confirms the initial correction successfully eliminated the systematic negative bias in MYD NIR PWV. The RMSE decreased to 3.45 mm with an R<sup>2</sup> of 0.95, corresponding to improvements of 55.8% in accuracy and 0.17 in explanatory power over the original data. The subsequent fusion stage achieved a further refined RMSE of 3.29 mm, outperforming the MYD, ERA5, and EMYD products by 57.9%, 30.4%, and 4.6%. The final RF PWV product demonstrates nearly unbiased estimation capability and maintains strong agreement with reference data, achieving an R<sup>2</sup> of 0.96. Spatiotemporal and correlation analyses confirmed the superior consistency and reliability of the RF PWV under diverse conditions and its effectiveness in decoupling errors from geographic and atmospheric influences. The proposed DTRP algorithm presents a valuable framework for predicting severe convective weather.</div></div>","PeriodicalId":50850,"journal":{"name":"Advances in Space Research","volume":"77 6","pages":"Pages 6607-6628"},"PeriodicalIF":2.8000,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Space Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0273117726000608","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/19 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Atmospheric water vapor is a fundamental driver of the Earth’s energy balance and climate system, making precipitable water vapor (PWV) a key indicator for meteorological monitoring and extreme weather forecasting. This study introduces an innovative two-stage machine learning framework, named the Decision Tree and Random Forest PWV Fusion (DTRP) framework, to generate a high-precision, spatiotemporally continuous PWV product over Hong Kong by fusing multi-source data, including GNSS, Moderate Resolution Imaging Spectroradiometer (MODIS), and ERA5. GNSS PWV derived from 18 Continuously Operating Reference Stations (CORS) served as the benchmark. In the initial stage of the DTRP framework, a regression tree model is employed to correct systematic deviations in MODIS Near-Infrared (NIR) PWV. The model exploits the differences between ERA5 and MYD NIR PWV, together with spatiotemporal features, to estimate GNSS–MYD deviations, thereby yielding an enhanced PWV product (EMYD). Building on this, the second stage of DTRP applies a random forest framework to integrate EMYD with original ERA5 PWV along with meteorological covariates, generating the final fused product (RF PWV). Statistical evaluation confirms the initial correction successfully eliminated the systematic negative bias in MYD NIR PWV. The RMSE decreased to 3.45 mm with an R2 of 0.95, corresponding to improvements of 55.8% in accuracy and 0.17 in explanatory power over the original data. The subsequent fusion stage achieved a further refined RMSE of 3.29 mm, outperforming the MYD, ERA5, and EMYD products by 57.9%, 30.4%, and 4.6%. The final RF PWV product demonstrates nearly unbiased estimation capability and maintains strong agreement with reference data, achieving an R2 of 0.96. Spatiotemporal and correlation analyses confirmed the superior consistency and reliability of the RF PWV under diverse conditions and its effectiveness in decoupling errors from geographic and atmospheric influences. The proposed DTRP algorithm presents a valuable framework for predicting severe convective weather.
期刊介绍:
The COSPAR publication Advances in Space Research (ASR) is an open journal covering all areas of space research including: space studies of the Earth''s surface, meteorology, climate, the Earth-Moon system, planets and small bodies of the solar system, upper atmospheres, ionospheres and magnetospheres of the Earth and planets including reference atmospheres, space plasmas in the solar system, astrophysics from space, materials sciences in space, fundamental physics in space, space debris, space weather, Earth observations of space phenomena, etc.
NB: Please note that manuscripts related to life sciences as related to space are no more accepted for submission to Advances in Space Research. Such manuscripts should now be submitted to the new COSPAR Journal Life Sciences in Space Research (LSSR).
All submissions are reviewed by two scientists in the field. COSPAR is an interdisciplinary scientific organization concerned with the progress of space research on an international scale. Operating under the rules of ICSU, COSPAR ignores political considerations and considers all questions solely from the scientific viewpoint.