应用机器学习算法,利用卫星数据和计量数据估算 PM 2.5

Ishwor Thapa, Bidur Devkota
{"title":"应用机器学习算法,利用卫星数据和计量数据估算 PM 2.5","authors":"Ishwor Thapa, Bidur Devkota","doi":"10.3126/jes2.v3i1.66239","DOIUrl":null,"url":null,"abstract":"Air pollution, particularly fine Particulate Matter (PM 2.5), poses significant health risks and environmental challenges worldwide. Therefore, it is essential to monitor air pollution to act on it. In this study, PM 2.5 was estimated using meteorological data and Sentinel-5P air pollution data using machine learning algorithms. The Sentinel-5P data are   and the meteorological data utilized are air temperature, Relative Humidity (RH), and Wind Speed (WS). The three Air Quality Monitoring (AQM) stations in Kathmandu, Nepal, were chosen as a study area for this research. The effectiveness of several machine learning methods, such as K-Nearest Neighbors (KNN), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost), and Random Forest (RF), were evaluated. Both RF and XGBoost consistently performed better than SVM and KNN in terms of PM 2.5 estimation accuracy. RF got the highest R2 value of 0.80 and SVM with the lowest R2 value of 0.62 in the Sentinel-5P dataset only. The addition of meteorological data further improved the model's performance. After including metrological data in Sentinel-5P data the RF demonstrated the maximum R2 score of 0.816 and XGBoost with R2 score of 0.814. Hence, this study demonstrated machine learning algorithms can be used to estimate PM 2.5 by utilizing satellite and meteorological data, providing important information for air quality monitoring and management.","PeriodicalId":510231,"journal":{"name":"Journal of Engineering and Sciences","volume":"72 19","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Applying machine learning algorithms to estimate PM 2.5 using satellite data and metrological data\",\"authors\":\"Ishwor Thapa, Bidur Devkota\",\"doi\":\"10.3126/jes2.v3i1.66239\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Air pollution, particularly fine Particulate Matter (PM 2.5), poses significant health risks and environmental challenges worldwide. Therefore, it is essential to monitor air pollution to act on it. In this study, PM 2.5 was estimated using meteorological data and Sentinel-5P air pollution data using machine learning algorithms. The Sentinel-5P data are   and the meteorological data utilized are air temperature, Relative Humidity (RH), and Wind Speed (WS). The three Air Quality Monitoring (AQM) stations in Kathmandu, Nepal, were chosen as a study area for this research. The effectiveness of several machine learning methods, such as K-Nearest Neighbors (KNN), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost), and Random Forest (RF), were evaluated. Both RF and XGBoost consistently performed better than SVM and KNN in terms of PM 2.5 estimation accuracy. RF got the highest R2 value of 0.80 and SVM with the lowest R2 value of 0.62 in the Sentinel-5P dataset only. The addition of meteorological data further improved the model's performance. After including metrological data in Sentinel-5P data the RF demonstrated the maximum R2 score of 0.816 and XGBoost with R2 score of 0.814. Hence, this study demonstrated machine learning algorithms can be used to estimate PM 2.5 by utilizing satellite and meteorological data, providing important information for air quality monitoring and management.\",\"PeriodicalId\":510231,\"journal\":{\"name\":\"Journal of Engineering and Sciences\",\"volume\":\"72 19\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Engineering and Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3126/jes2.v3i1.66239\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Engineering and Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3126/jes2.v3i1.66239","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

空气污染,尤其是微粒物质(PM 2.5),在全球范围内造成了重大的健康风险和环境挑战。因此,有必要对空气污染进行监测,以便采取应对措施。在本研究中,利用气象数据和 Sentinel-5P 空气污染数据,使用机器学习算法对 PM 2.5 进行了估算。哨兵-5P 数据和气象数据包括气温、相对湿度 (RH) 和风速 (WS)。本研究选择了尼泊尔加德满都的三个空气质量监测(AQM)站作为研究区域。研究评估了几种机器学习方法的有效性,如 K-Nearest Neighbors (KNN)、Support Vector Machine (SVM)、eXtreme Gradient Boosting (XGBoost) 和 Random Forest (RF)。在 PM 2.5 估算准确度方面,RF 和 XGBoost 的表现始终优于 SVM 和 KNN。仅在哨兵-5P 数据集中,RF 的 R2 值最高,为 0.80,而 SVM 的 R2 值最低,为 0.62。气象数据的加入进一步提高了模型的性能。在 Sentinel-5P 数据中加入气象数据后,RF 的 R2 值最高,为 0.816,XGBoost 的 R2 值为 0.814。因此,本研究表明,机器学习算法可用于利用卫星和气象数据估算 PM 2.5,为空气质量监测和管理提供重要信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Applying machine learning algorithms to estimate PM 2.5 using satellite data and metrological data
Air pollution, particularly fine Particulate Matter (PM 2.5), poses significant health risks and environmental challenges worldwide. Therefore, it is essential to monitor air pollution to act on it. In this study, PM 2.5 was estimated using meteorological data and Sentinel-5P air pollution data using machine learning algorithms. The Sentinel-5P data are   and the meteorological data utilized are air temperature, Relative Humidity (RH), and Wind Speed (WS). The three Air Quality Monitoring (AQM) stations in Kathmandu, Nepal, were chosen as a study area for this research. The effectiveness of several machine learning methods, such as K-Nearest Neighbors (KNN), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost), and Random Forest (RF), were evaluated. Both RF and XGBoost consistently performed better than SVM and KNN in terms of PM 2.5 estimation accuracy. RF got the highest R2 value of 0.80 and SVM with the lowest R2 value of 0.62 in the Sentinel-5P dataset only. The addition of meteorological data further improved the model's performance. After including metrological data in Sentinel-5P data the RF demonstrated the maximum R2 score of 0.816 and XGBoost with R2 score of 0.814. Hence, this study demonstrated machine learning algorithms can be used to estimate PM 2.5 by utilizing satellite and meteorological data, providing important information for air quality monitoring and management.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信