Wenze Li , Wenchao Han , Jiachen Meng , Zipeng Dong , Jun Xu , Qimeng Wang , Lulu Yuan , Han Wang , Zhongzhi Zhang , Miaomiao Cheng
{"title":"利用多源数据,基于机器学习生成中国上空高分辨率三维全覆盖气溶胶分布数据","authors":"Wenze Li , Wenchao Han , Jiachen Meng , Zipeng Dong , Jun Xu , Qimeng Wang , Lulu Yuan , Han Wang , Zhongzhi Zhang , Miaomiao Cheng","doi":"10.1016/j.rse.2025.114772","DOIUrl":null,"url":null,"abstract":"<div><div>Aerosol pollution significantly influences the interaction between solar radiation and the earth's atmosphere and seriously threatens human health. Numerous studies have applied machine learning models such as Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM) to estimate aerosol-related parameters, including aerosol optical depth and particulate matter concentrations (e.g., PM<sub>2.5</sub>). However, current aerosol products primarily provide horizontal or spatially discontinuous vertical data, lacking comprehensive three-dimensional (3D) coverage. To address this gap, we developed the XGBoost-LightGBM-Wavelet (XLW) model, integrating XGBoost, LightGBM, and wavelet transforms to merge multisource data. This approach, for the first time, produced high-resolution, three-dimensional, full-coverage aerosol distribution data for China in 2015. The model outputs a dataset of aerosol spatial distribution with a horizontal resolution of 0.05°, and 167 layers within 10 km in the vertical direction. The XLW model demonstrates excellent predictive ability, effectively filling gaps in aerosol distribution. It enhances signal continuity and strengthens lower-layer signals, closely matching ground LiDAR observations and providing a more accurate representation compared to the Cloud-Aerosol LiDAR and Infrared Pathfinder Satellite Observation (CALIPSO) data. The dataset accurately reveals the 3D distribution of aerosols, which is meaningful for a comprehensive study of aerosol distribution at different altitudes in various regions. At 300 m height above ground level, the most polluted regions are the North China Plain and the Yangtze River Delta region, with an average aerosol extinction coefficient (AEC) of 0.34 and 0.40 km<sup>−1</sup>, respectively. As the height increases to 1 km, the average AEC notably decreases to 0.23 and 0.24 km<sup>−1</sup> in the North China Plain and the Yangtze River Delta. By 3 km, aerosol distribution becomes sparse over most regions of China. For the vertical variations of aerosol distributions in typical cities, in the North China Plain and Yangtze River Delta, aerosol concentrations consistently decrease from the near-surface to 4 km. However, in the Pearl River Delta, aerosol concentrations decrease consistently from 0 to 2 km, with relatively stable between 2 and 3 km. Above 4 km, aerosol concentrations are nearly negligible in all typical cities. The XLW model can accurately produce a high-resolution, 3D, full-coverage aerosol spatial distribution dataset, which is vital for conducting thorough studies on aerosol transport, aerosol radiative effects, and climate change.</div></div>","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":"324 ","pages":"Article 114772"},"PeriodicalIF":11.1000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based generation of high-resolution 3D full-coverage aerosol distribution data over China using multisource data\",\"authors\":\"Wenze Li , Wenchao Han , Jiachen Meng , Zipeng Dong , Jun Xu , Qimeng Wang , Lulu Yuan , Han Wang , Zhongzhi Zhang , Miaomiao Cheng\",\"doi\":\"10.1016/j.rse.2025.114772\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Aerosol pollution significantly influences the interaction between solar radiation and the earth's atmosphere and seriously threatens human health. Numerous studies have applied machine learning models such as Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM) to estimate aerosol-related parameters, including aerosol optical depth and particulate matter concentrations (e.g., PM<sub>2.5</sub>). However, current aerosol products primarily provide horizontal or spatially discontinuous vertical data, lacking comprehensive three-dimensional (3D) coverage. To address this gap, we developed the XGBoost-LightGBM-Wavelet (XLW) model, integrating XGBoost, LightGBM, and wavelet transforms to merge multisource data. This approach, for the first time, produced high-resolution, three-dimensional, full-coverage aerosol distribution data for China in 2015. The model outputs a dataset of aerosol spatial distribution with a horizontal resolution of 0.05°, and 167 layers within 10 km in the vertical direction. The XLW model demonstrates excellent predictive ability, effectively filling gaps in aerosol distribution. It enhances signal continuity and strengthens lower-layer signals, closely matching ground LiDAR observations and providing a more accurate representation compared to the Cloud-Aerosol LiDAR and Infrared Pathfinder Satellite Observation (CALIPSO) data. The dataset accurately reveals the 3D distribution of aerosols, which is meaningful for a comprehensive study of aerosol distribution at different altitudes in various regions. At 300 m height above ground level, the most polluted regions are the North China Plain and the Yangtze River Delta region, with an average aerosol extinction coefficient (AEC) of 0.34 and 0.40 km<sup>−1</sup>, respectively. As the height increases to 1 km, the average AEC notably decreases to 0.23 and 0.24 km<sup>−1</sup> in the North China Plain and the Yangtze River Delta. By 3 km, aerosol distribution becomes sparse over most regions of China. For the vertical variations of aerosol distributions in typical cities, in the North China Plain and Yangtze River Delta, aerosol concentrations consistently decrease from the near-surface to 4 km. However, in the Pearl River Delta, aerosol concentrations decrease consistently from 0 to 2 km, with relatively stable between 2 and 3 km. Above 4 km, aerosol concentrations are nearly negligible in all typical cities. The XLW model can accurately produce a high-resolution, 3D, full-coverage aerosol spatial distribution dataset, which is vital for conducting thorough studies on aerosol transport, aerosol radiative effects, and climate change.</div></div>\",\"PeriodicalId\":417,\"journal\":{\"name\":\"Remote Sensing of Environment\",\"volume\":\"324 \",\"pages\":\"Article 114772\"},\"PeriodicalIF\":11.1000,\"publicationDate\":\"2025-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Remote Sensing of Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0034425725001762\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0034425725001762","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Machine learning-based generation of high-resolution 3D full-coverage aerosol distribution data over China using multisource data
Aerosol pollution significantly influences the interaction between solar radiation and the earth's atmosphere and seriously threatens human health. Numerous studies have applied machine learning models such as Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM) to estimate aerosol-related parameters, including aerosol optical depth and particulate matter concentrations (e.g., PM2.5). However, current aerosol products primarily provide horizontal or spatially discontinuous vertical data, lacking comprehensive three-dimensional (3D) coverage. To address this gap, we developed the XGBoost-LightGBM-Wavelet (XLW) model, integrating XGBoost, LightGBM, and wavelet transforms to merge multisource data. This approach, for the first time, produced high-resolution, three-dimensional, full-coverage aerosol distribution data for China in 2015. The model outputs a dataset of aerosol spatial distribution with a horizontal resolution of 0.05°, and 167 layers within 10 km in the vertical direction. The XLW model demonstrates excellent predictive ability, effectively filling gaps in aerosol distribution. It enhances signal continuity and strengthens lower-layer signals, closely matching ground LiDAR observations and providing a more accurate representation compared to the Cloud-Aerosol LiDAR and Infrared Pathfinder Satellite Observation (CALIPSO) data. The dataset accurately reveals the 3D distribution of aerosols, which is meaningful for a comprehensive study of aerosol distribution at different altitudes in various regions. At 300 m height above ground level, the most polluted regions are the North China Plain and the Yangtze River Delta region, with an average aerosol extinction coefficient (AEC) of 0.34 and 0.40 km−1, respectively. As the height increases to 1 km, the average AEC notably decreases to 0.23 and 0.24 km−1 in the North China Plain and the Yangtze River Delta. By 3 km, aerosol distribution becomes sparse over most regions of China. For the vertical variations of aerosol distributions in typical cities, in the North China Plain and Yangtze River Delta, aerosol concentrations consistently decrease from the near-surface to 4 km. However, in the Pearl River Delta, aerosol concentrations decrease consistently from 0 to 2 km, with relatively stable between 2 and 3 km. Above 4 km, aerosol concentrations are nearly negligible in all typical cities. The XLW model can accurately produce a high-resolution, 3D, full-coverage aerosol spatial distribution dataset, which is vital for conducting thorough studies on aerosol transport, aerosol radiative effects, and climate change.
期刊介绍:
Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing.
The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques.
RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.