Synergizing GNSS, MODIS, and ERA5 for high-resolution PWV retrieval: a two-stage machine learning approach over Hong Kong

IF 2.8 3区 地球科学 Q2 ASTRONOMY & ASTROPHYSICS
Advances in Space Research Pub Date : 2026-03-15 Epub Date: 2026-01-19 DOI:10.1016/j.asr.2026.01.035
Guanmei Chen , Aigong Xu , Zongqiu Xu , Zhiguo Deng , Longjiang Tang , Congying Shao , Nannan Yang , Xiang Gao , Meiqi Zhang
{"title":"Synergizing GNSS, MODIS, and ERA5 for high-resolution PWV retrieval: a two-stage machine learning approach over Hong Kong","authors":"Guanmei Chen ,&nbsp;Aigong Xu ,&nbsp;Zongqiu Xu ,&nbsp;Zhiguo Deng ,&nbsp;Longjiang Tang ,&nbsp;Congying Shao ,&nbsp;Nannan Yang ,&nbsp;Xiang Gao ,&nbsp;Meiqi Zhang","doi":"10.1016/j.asr.2026.01.035","DOIUrl":null,"url":null,"abstract":"<div><div>Atmospheric water vapor is a fundamental driver of the Earth’s energy balance and climate system, making precipitable water vapor (PWV) a key indicator for meteorological monitoring and extreme weather forecasting. This study introduces an innovative two-stage machine learning framework, named the Decision Tree and Random Forest PWV Fusion (DTRP) framework, to generate a high-precision, spatiotemporally continuous PWV product over Hong Kong by fusing multi-source data, including GNSS, Moderate Resolution Imaging Spectroradiometer (MODIS), and ERA5. GNSS PWV derived from 18 Continuously Operating Reference Stations (CORS) served as the benchmark. In the initial stage of the DTRP framework, a regression tree model is employed to correct systematic deviations in MODIS Near-Infrared (NIR) PWV. The model exploits the differences between ERA5 and MYD NIR PWV, together with spatiotemporal features, to estimate GNSS–MYD deviations, thereby yielding an enhanced PWV product (EMYD). Building on this, the second stage of DTRP applies a random forest framework to integrate EMYD with original ERA5 PWV along with meteorological covariates, generating the final fused product (RF PWV). Statistical evaluation confirms the initial correction successfully eliminated the systematic negative bias in MYD NIR PWV. The RMSE decreased to 3.45 mm with an R<sup>2</sup> of 0.95, corresponding to improvements of 55.8% in accuracy and 0.17 in explanatory power over the original data. The subsequent fusion stage achieved a further refined RMSE of 3.29 mm, outperforming the MYD, ERA5, and EMYD products by 57.9%, 30.4%, and 4.6%. The final RF PWV product demonstrates nearly unbiased estimation capability and maintains strong agreement with reference data, achieving an R<sup>2</sup> of 0.96. Spatiotemporal and correlation analyses confirmed the superior consistency and reliability of the RF PWV under diverse conditions and its effectiveness in decoupling errors from geographic and atmospheric influences. The proposed DTRP algorithm presents a valuable framework for predicting severe convective weather.</div></div>","PeriodicalId":50850,"journal":{"name":"Advances in Space Research","volume":"77 6","pages":"Pages 6607-6628"},"PeriodicalIF":2.8000,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Space Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0273117726000608","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/19 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

Atmospheric water vapor is a fundamental driver of the Earth’s energy balance and climate system, making precipitable water vapor (PWV) a key indicator for meteorological monitoring and extreme weather forecasting. This study introduces an innovative two-stage machine learning framework, named the Decision Tree and Random Forest PWV Fusion (DTRP) framework, to generate a high-precision, spatiotemporally continuous PWV product over Hong Kong by fusing multi-source data, including GNSS, Moderate Resolution Imaging Spectroradiometer (MODIS), and ERA5. GNSS PWV derived from 18 Continuously Operating Reference Stations (CORS) served as the benchmark. In the initial stage of the DTRP framework, a regression tree model is employed to correct systematic deviations in MODIS Near-Infrared (NIR) PWV. The model exploits the differences between ERA5 and MYD NIR PWV, together with spatiotemporal features, to estimate GNSS–MYD deviations, thereby yielding an enhanced PWV product (EMYD). Building on this, the second stage of DTRP applies a random forest framework to integrate EMYD with original ERA5 PWV along with meteorological covariates, generating the final fused product (RF PWV). Statistical evaluation confirms the initial correction successfully eliminated the systematic negative bias in MYD NIR PWV. The RMSE decreased to 3.45 mm with an R2 of 0.95, corresponding to improvements of 55.8% in accuracy and 0.17 in explanatory power over the original data. The subsequent fusion stage achieved a further refined RMSE of 3.29 mm, outperforming the MYD, ERA5, and EMYD products by 57.9%, 30.4%, and 4.6%. The final RF PWV product demonstrates nearly unbiased estimation capability and maintains strong agreement with reference data, achieving an R2 of 0.96. Spatiotemporal and correlation analyses confirmed the superior consistency and reliability of the RF PWV under diverse conditions and its effectiveness in decoupling errors from geographic and atmospheric influences. The proposed DTRP algorithm presents a valuable framework for predicting severe convective weather.
协同GNSS、MODIS和ERA5进行高分辨率PWV检索:香港的两阶段机器学习方法
大气水汽是地球能量平衡和气候系统的基本驱动力,可降水量(PWV)是气象监测和极端天气预报的重要指标。本研究引入了一个创新的两阶段机器学习框架,即决策树和随机森林PWV融合(DTRP)框架,通过融合多源数据,包括GNSS、中分辨率成像光谱仪(MODIS)和ERA5,在香港产生高精度、时空连续的PWV产品。以18个连续运行参考站(CORS)的GNSS PWV为基准。在DTRP框架的初始阶段,采用回归树模型对MODIS近红外(NIR) PWV的系统偏差进行校正。该模型利用ERA5和MYD近红外PWV之间的差异以及时空特征来估计GNSS-MYD偏差,从而产生增强的PWV产品(EMYD)。在此基础上,DTRP的第二阶段应用随机森林框架将EMYD与原始ERA5 PWV以及气象协变量相结合,生成最终的融合产物(RF PWV)。统计评估证实初始校正成功消除了MYD NIR PWV的系统负偏倚。RMSE降至3.45 mm, R2为0.95,与原始数据相比,精度提高了55.8%,解释力提高了0.17。随后的融合阶段实现了进一步细化的RMSE为3.29 mm,比MYD、ERA5和EMYD产品分别高出57.9%、30.4%和4.6%。最终的RF PWV产品展示了几乎无偏的估计能力,并与参考数据保持了很强的一致性,实现了0.96的R2。时空分析和相关分析证实了射频PWV在不同条件下的一致性和可靠性,以及它在解耦地理和大气影响误差方面的有效性。提出的DTRP算法为强对流天气预报提供了一个有价值的框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advances in Space Research
Advances in Space Research 地学天文-地球科学综合
CiteScore
5.20
自引率
11.50%
发文量
800
审稿时长
5.8 months
期刊介绍: The COSPAR publication Advances in Space Research (ASR) is an open journal covering all areas of space research including: space studies of the Earth''s surface, meteorology, climate, the Earth-Moon system, planets and small bodies of the solar system, upper atmospheres, ionospheres and magnetospheres of the Earth and planets including reference atmospheres, space plasmas in the solar system, astrophysics from space, materials sciences in space, fundamental physics in space, space debris, space weather, Earth observations of space phenomena, etc. NB: Please note that manuscripts related to life sciences as related to space are no more accepted for submission to Advances in Space Research. Such manuscripts should now be submitted to the new COSPAR Journal Life Sciences in Space Research (LSSR). All submissions are reviewed by two scientists in the field. COSPAR is an interdisciplinary scientific organization concerned with the progress of space research on an international scale. Operating under the rules of ICSU, COSPAR ignores political considerations and considers all questions solely from the scientific viewpoint.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书