基于贝叶斯优化随机森林的微波激光雷达气溶胶消光系数预测

IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Hao Chen , Fei Gao , Zhimin Rao , Dengxin Hua
{"title":"基于贝叶斯优化随机森林的微波激光雷达气溶胶消光系数预测","authors":"Hao Chen ,&nbsp;Fei Gao ,&nbsp;Zhimin Rao ,&nbsp;Dengxin Hua","doi":"10.1016/j.asoc.2025.113130","DOIUrl":null,"url":null,"abstract":"<div><div>In continuous observation signals of lidar, the identification and selection of effective signals are crucial, especially for the aerosol extinction coefficient retrieval. In this study, the Bayesian Optimized Random Forest (BORF) model, a machine learning approach combining Random Forest regression with Bayesian optimization, was developed for predicting aerosol extinction coefficients. Built upon the foundation of the Random Forest (RF) regression method, this model leverages Bayesian optimization to adjust model parameters precisely, significantly enhancing the accuracy of aerosol extinction coefficient predictions. This approach offers a valuable means to identify and screen anomalous Lidar signals. We constructed a training dataset comprising continuously observed Mie Lidar signals and aerosol extinction coefficients retrieved using the Klett method. The dataset contains dimensions, including Mie Lidar signals, detection time, detection distance, pressure, and temperature. This paper provides a detailed description of the BORF model’s establishment process and the optimization of model parameters using Bayesian optimization. Through model assessments, significance tests, and comparative experiments, we demonstrate the effectiveness of the BORF model. Experimental results indicate that, compared to other relevant models, the BORF model excels in predicting aerosol extinction coefficients, closely aligning with the accuracy of the Klett method. Specifically, in datasets with better data quality, the BORF model exhibits an approximately 4% increase in <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> compared to the RF and BP neural network optimized by genetic algorithm (BPGA), accompanied by a 41% to 47% reduction in MSE and MAE. The Mean Squared Error (MSE) and Mean Absolute Error (MAE) decrease by approximately 40% to 90% in datasets with lower data quality and less apparent data variations. This study provides a robust technical solution to ensure the reliability of Lidar data, thereby contributing to an enhanced understanding of atmospheric aerosols and environmental monitoring.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"176 ","pages":"Article 113130"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"BORF: A Bayesian optimized random forest for prediction of aerosol extinction coefficient from Mie Lidar signal\",\"authors\":\"Hao Chen ,&nbsp;Fei Gao ,&nbsp;Zhimin Rao ,&nbsp;Dengxin Hua\",\"doi\":\"10.1016/j.asoc.2025.113130\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In continuous observation signals of lidar, the identification and selection of effective signals are crucial, especially for the aerosol extinction coefficient retrieval. In this study, the Bayesian Optimized Random Forest (BORF) model, a machine learning approach combining Random Forest regression with Bayesian optimization, was developed for predicting aerosol extinction coefficients. Built upon the foundation of the Random Forest (RF) regression method, this model leverages Bayesian optimization to adjust model parameters precisely, significantly enhancing the accuracy of aerosol extinction coefficient predictions. This approach offers a valuable means to identify and screen anomalous Lidar signals. We constructed a training dataset comprising continuously observed Mie Lidar signals and aerosol extinction coefficients retrieved using the Klett method. The dataset contains dimensions, including Mie Lidar signals, detection time, detection distance, pressure, and temperature. This paper provides a detailed description of the BORF model’s establishment process and the optimization of model parameters using Bayesian optimization. Through model assessments, significance tests, and comparative experiments, we demonstrate the effectiveness of the BORF model. Experimental results indicate that, compared to other relevant models, the BORF model excels in predicting aerosol extinction coefficients, closely aligning with the accuracy of the Klett method. Specifically, in datasets with better data quality, the BORF model exhibits an approximately 4% increase in <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> compared to the RF and BP neural network optimized by genetic algorithm (BPGA), accompanied by a 41% to 47% reduction in MSE and MAE. The Mean Squared Error (MSE) and Mean Absolute Error (MAE) decrease by approximately 40% to 90% in datasets with lower data quality and less apparent data variations. This study provides a robust technical solution to ensure the reliability of Lidar data, thereby contributing to an enhanced understanding of atmospheric aerosols and environmental monitoring.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"176 \",\"pages\":\"Article 113130\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625004417\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625004417","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

在激光雷达的连续观测信号中,有效信号的识别和选择至关重要,尤其是气溶胶消光系数的反演。在本研究中,贝叶斯优化随机森林(BORF)模型是一种将随机森林回归与贝叶斯优化相结合的机器学习方法,用于预测气溶胶消光系数。该模型在随机森林(Random Forest, RF)回归方法的基础上,利用贝叶斯优化对模型参数进行精确调整,显著提高了气溶胶消光系数预测的准确性。这种方法为识别和筛选异常激光雷达信号提供了一种有价值的手段。我们构建了一个训练数据集,包括连续观测到的Mie激光雷达信号和使用Klett方法检索的气溶胶消光系数。数据集包含维度,包括Mie Lidar信号、检测时间、检测距离、压力和温度。本文详细描述了BORF模型的建立过程以及利用贝叶斯优化方法对模型参数进行优化。通过模型评估、显著性检验和对比实验,我们证明了该模型的有效性。实验结果表明,与其他相关模型相比,BORF模型在预测气溶胶消光系数方面表现优异,与Klett方法的精度基本一致。具体来说,在数据质量更好的数据集中,与遗传算法(BPGA)优化的RF和BP神经网络相比,BORF模型的R2增加了约4%,MSE和MAE降低了41%至47%。在数据质量较低和数据变化不明显的数据集中,均方误差(MSE)和平均绝对误差(MAE)降低了约40%至90%。这项研究提供了一个强大的技术解决方案,以确保激光雷达数据的可靠性,从而有助于提高对大气气溶胶和环境监测的理解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
BORF: A Bayesian optimized random forest for prediction of aerosol extinction coefficient from Mie Lidar signal
In continuous observation signals of lidar, the identification and selection of effective signals are crucial, especially for the aerosol extinction coefficient retrieval. In this study, the Bayesian Optimized Random Forest (BORF) model, a machine learning approach combining Random Forest regression with Bayesian optimization, was developed for predicting aerosol extinction coefficients. Built upon the foundation of the Random Forest (RF) regression method, this model leverages Bayesian optimization to adjust model parameters precisely, significantly enhancing the accuracy of aerosol extinction coefficient predictions. This approach offers a valuable means to identify and screen anomalous Lidar signals. We constructed a training dataset comprising continuously observed Mie Lidar signals and aerosol extinction coefficients retrieved using the Klett method. The dataset contains dimensions, including Mie Lidar signals, detection time, detection distance, pressure, and temperature. This paper provides a detailed description of the BORF model’s establishment process and the optimization of model parameters using Bayesian optimization. Through model assessments, significance tests, and comparative experiments, we demonstrate the effectiveness of the BORF model. Experimental results indicate that, compared to other relevant models, the BORF model excels in predicting aerosol extinction coefficients, closely aligning with the accuracy of the Klett method. Specifically, in datasets with better data quality, the BORF model exhibits an approximately 4% increase in R2 compared to the RF and BP neural network optimized by genetic algorithm (BPGA), accompanied by a 41% to 47% reduction in MSE and MAE. The Mean Squared Error (MSE) and Mean Absolute Error (MAE) decrease by approximately 40% to 90% in datasets with lower data quality and less apparent data variations. This study provides a robust technical solution to ensure the reliability of Lidar data, thereby contributing to an enhanced understanding of atmospheric aerosols and environmental monitoring.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Soft Computing
Applied Soft Computing 工程技术-计算机:跨学科应用
CiteScore
15.80
自引率
6.90%
发文量
874
审稿时长
10.9 months
期刊介绍: Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信