Global air quality index prediction using integrated spatial observation data and geographics machine learning

IF 5.7 Q1 ENVIRONMENTAL SCIENCES
Tania Septi Anggraini , Hitoshi Irie , Anjar Dimara Sakti , Ketut Wikantika
{"title":"Global air quality index prediction using integrated spatial observation data and geographics machine learning","authors":"Tania Septi Anggraini ,&nbsp;Hitoshi Irie ,&nbsp;Anjar Dimara Sakti ,&nbsp;Ketut Wikantika","doi":"10.1016/j.srs.2025.100197","DOIUrl":null,"url":null,"abstract":"<div><div>Air pollution can occur in the whole world, with each region having its unique driving factors that contribute to human's health. However, effective mitigation of air pollution is often hindered by the uneven distribution of air quality monitoring stations, which tend to be concentrated in potential hotspots like major cities. This study aims to detect and improve the accuracy of the Global Air Quality Index from Remote Sensing (AQI-RS) by integrating AQI from ground-based stations with driving factors such as meteorological, environmental, sources of air pollution, and air pollution magnitude from satellite observation parameters as independent variables using Geographics Machine Learning (GML). This study utilizes 425 air pollution stations and the driving factors data globally from 2013 to 2024. The GML considers geographical characteristics in the analysis by calculating the optimal bandwidth area in its algorithm. The study employs nine scenarios to identify which parameters significantly contribute to the model and determine the best parameter combinations. In determining the best scenario, this study considers the R<sup>2</sup> value, Root Mean Square Error (RMSE), and uncertainty in each of the scenarios. This study produced an AQI-RS model with an average R<sup>2</sup>, RMSE, and uncertainty in the best scenario of 0.89, 5.58, and 5.69 (AQI unit), respectively. The results indicate that GML significantly improves the accuracy of global AQI-RS over previous studies. By considering geographical characteristics using GML, this research is expected to gain an accurate prediction of AQI globally especially in regions without ground-based air pollution stations for the worldwide mitigation.</div></div>","PeriodicalId":101147,"journal":{"name":"Science of Remote Sensing","volume":"11 ","pages":"Article 100197"},"PeriodicalIF":5.7000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science of Remote Sensing","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666017225000033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Air pollution can occur in the whole world, with each region having its unique driving factors that contribute to human's health. However, effective mitigation of air pollution is often hindered by the uneven distribution of air quality monitoring stations, which tend to be concentrated in potential hotspots like major cities. This study aims to detect and improve the accuracy of the Global Air Quality Index from Remote Sensing (AQI-RS) by integrating AQI from ground-based stations with driving factors such as meteorological, environmental, sources of air pollution, and air pollution magnitude from satellite observation parameters as independent variables using Geographics Machine Learning (GML). This study utilizes 425 air pollution stations and the driving factors data globally from 2013 to 2024. The GML considers geographical characteristics in the analysis by calculating the optimal bandwidth area in its algorithm. The study employs nine scenarios to identify which parameters significantly contribute to the model and determine the best parameter combinations. In determining the best scenario, this study considers the R2 value, Root Mean Square Error (RMSE), and uncertainty in each of the scenarios. This study produced an AQI-RS model with an average R2, RMSE, and uncertainty in the best scenario of 0.89, 5.58, and 5.69 (AQI unit), respectively. The results indicate that GML significantly improves the accuracy of global AQI-RS over previous studies. By considering geographical characteristics using GML, this research is expected to gain an accurate prediction of AQI globally especially in regions without ground-based air pollution stations for the worldwide mitigation.
基于空间观测数据和地理机器学习的全球空气质量指数预测
空气污染可能发生在全世界,每个地区都有其独特的驱动因素,有助于人类的健康。然而,空气质量监测站分布不均往往阻碍了空气污染的有效缓解,这些监测站往往集中在大城市等潜在热点地区。本研究旨在利用地理机器学习(GML)技术,将地面站的空气质量指数与气象、环境、空气污染源和卫星观测参数的空气污染程度等驱动因素作为自变量相结合,检测并提高遥感全球空气质量指数(AQI- rs)的准确性。本研究利用了2013 - 2024年全球425个大气污染站点及其驱动因子数据。GML算法通过计算最优带宽面积,在分析中考虑了地理特征。该研究采用了9个场景来确定哪些参数对模型有重要贡献,并确定最佳参数组合。在确定最佳方案时,本研究考虑了每个方案中的R2值、均方根误差(RMSE)和不确定性。本研究建立的AQI- rs模型在最佳情景下的平均R2、RMSE和不确定性分别为0.89、5.58和5.69 (AQI单位)。结果表明,GML显著提高了全球AQI-RS的准确性。通过考虑地理特征,本研究有望在全球范围内获得准确的AQI预测,特别是在没有地面空气污染站的地区,为全球缓解提供帮助。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
12.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信