A. Salcedo-Bosch , F. Rocadenbosch , C.I. Argañaraz , G. Curci , S. Lolli
{"title":"Retrieval of planetary boundary layer height from CALIPSO satellite observations using a machine learning approach","authors":"A. Salcedo-Bosch , F. Rocadenbosch , C.I. Argañaraz , G. Curci , S. Lolli","doi":"10.1016/j.ecoinf.2025.103431","DOIUrl":null,"url":null,"abstract":"<div><div>The planetary boundary layer height (PBLH) is a key variable in air quality, climate modeling, and weather prediction. Traditional retrieval methods, such as radiosondes, provide high accuracy but lack spatial coverage. This study presents a Random Forest (RF) model based on Machine Learning (ML) to estimate PBLH from ten years of Cloud-Aerosol Lidar with Orthogonal Polarization (CALIOP) on board the Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Observations (CALIPSO), using radiosonde measurements as a reference. The model achieves an <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.67 and an RMSE of 278.02 m with a spatial resolution of <span><math><mo>≈</mo></math></span> 20 × 20 km<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> in a test set that covers mainly Europe and North America. Unlike previous methods, our approach does not require atmospheric typing and uses minimal data filtering, demonstrating robustness under diverse aerosol and cloud conditions. Although validation is currently limited to mid-latitude regions, the method offers a scalable approach to global monitoring and supports the management of climate and air quality. Future work will extend the validation to other geographic zones and explore deep learning models for further improvements.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"92 ","pages":"Article 103431"},"PeriodicalIF":7.3000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125004406","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The planetary boundary layer height (PBLH) is a key variable in air quality, climate modeling, and weather prediction. Traditional retrieval methods, such as radiosondes, provide high accuracy but lack spatial coverage. This study presents a Random Forest (RF) model based on Machine Learning (ML) to estimate PBLH from ten years of Cloud-Aerosol Lidar with Orthogonal Polarization (CALIOP) on board the Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Observations (CALIPSO), using radiosonde measurements as a reference. The model achieves an of 0.67 and an RMSE of 278.02 m with a spatial resolution of 20 × 20 km in a test set that covers mainly Europe and North America. Unlike previous methods, our approach does not require atmospheric typing and uses minimal data filtering, demonstrating robustness under diverse aerosol and cloud conditions. Although validation is currently limited to mid-latitude regions, the method offers a scalable approach to global monitoring and supports the management of climate and air quality. Future work will extend the validation to other geographic zones and explore deep learning models for further improvements.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.