使用卫星数据集估算颗粒物:一种机器学习方法。

IF 5.8 3区 环境科学与生态学 0 ENVIRONMENTAL SCIENCES
Sunita Verma, Ajay Sharma, Swagata Payra, Neelam Chaudhary, Manoj Mishra
{"title":"使用卫星数据集估算颗粒物:一种机器学习方法。","authors":"Sunita Verma,&nbsp;Ajay Sharma,&nbsp;Swagata Payra,&nbsp;Neelam Chaudhary,&nbsp;Manoj Mishra","doi":"10.1007/s11356-024-35564-0","DOIUrl":null,"url":null,"abstract":"<div><p>In the present work, it is the first time an interpretable machine learning model has been developed for the estimation of Particulate Matter 10 µm (PM<sub>10</sub>) concentrations over India using Aerosol Optical Depth (AOD) from two different satellites, i.e. INSAT-3D and Moderate Resolution Imaging Spectroradiometer (MODIS) for the period of 7 years (2014 to 2020). Ground datasets of AOD are taken from the Aerosol Robotic Network (AERONET) for the validation of satellite-retrieved AOD. The observation of particulate matter (PM) data is acquired from the Central Pollution Control Board (CPCB) station across India. Analysis has been performed on a monthly basis for the given time period. The result shows that AOD products of MODIS exhibit good correlation with AERONET AOD whereas INSAT-3D AOD is not well correlated with AERONET AOD. However, after applying an error envelope and threshold-based filtering technique, we have found that INSAT-3D shows significant correlation with ground-level AOD with approximate correlation of 0.66 for Jaipur and 0.57 for Kanpur exhibiting almost similar performance as MODIS-derived AOD. Satellite AOD data together with ground PM concentration data is used to train the machine learning model (random forest) for the estimation of the PM distribution across India for the year 2020. An encouraging correlation of <i>R</i>-squared (<i>R</i><sup>2</sup>) value 0.78 has been observed between the estimated and observed PM<sub>10</sub> concentrations. The model demonstrates effective training, mitigating huge overestimation and underestimation. However, despite closely tracking the trends of estimated PM<sub>10</sub> with observed PM<sub>10</sub>, few instances of overestimation persist. This suggests the need for an expanded training dataset to further refine and enhance the model’s accuracy. Finally, the machine learning model used for PM<sub>10</sub> estimation is found to be optimal for a calibrated satellite AOD product.</p></div>","PeriodicalId":545,"journal":{"name":"Environmental Science and Pollution Research","volume":"31 58","pages":"66372 - 66387"},"PeriodicalIF":5.8000,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Particulate matter estimation using satellite datasets: a machine learning approach\",\"authors\":\"Sunita Verma,&nbsp;Ajay Sharma,&nbsp;Swagata Payra,&nbsp;Neelam Chaudhary,&nbsp;Manoj Mishra\",\"doi\":\"10.1007/s11356-024-35564-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In the present work, it is the first time an interpretable machine learning model has been developed for the estimation of Particulate Matter 10 µm (PM<sub>10</sub>) concentrations over India using Aerosol Optical Depth (AOD) from two different satellites, i.e. INSAT-3D and Moderate Resolution Imaging Spectroradiometer (MODIS) for the period of 7 years (2014 to 2020). Ground datasets of AOD are taken from the Aerosol Robotic Network (AERONET) for the validation of satellite-retrieved AOD. The observation of particulate matter (PM) data is acquired from the Central Pollution Control Board (CPCB) station across India. Analysis has been performed on a monthly basis for the given time period. The result shows that AOD products of MODIS exhibit good correlation with AERONET AOD whereas INSAT-3D AOD is not well correlated with AERONET AOD. However, after applying an error envelope and threshold-based filtering technique, we have found that INSAT-3D shows significant correlation with ground-level AOD with approximate correlation of 0.66 for Jaipur and 0.57 for Kanpur exhibiting almost similar performance as MODIS-derived AOD. Satellite AOD data together with ground PM concentration data is used to train the machine learning model (random forest) for the estimation of the PM distribution across India for the year 2020. An encouraging correlation of <i>R</i>-squared (<i>R</i><sup>2</sup>) value 0.78 has been observed between the estimated and observed PM<sub>10</sub> concentrations. The model demonstrates effective training, mitigating huge overestimation and underestimation. However, despite closely tracking the trends of estimated PM<sub>10</sub> with observed PM<sub>10</sub>, few instances of overestimation persist. This suggests the need for an expanded training dataset to further refine and enhance the model’s accuracy. Finally, the machine learning model used for PM<sub>10</sub> estimation is found to be optimal for a calibrated satellite AOD product.</p></div>\",\"PeriodicalId\":545,\"journal\":{\"name\":\"Environmental Science and Pollution Research\",\"volume\":\"31 58\",\"pages\":\"66372 - 66387\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2024-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Science and Pollution Research\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s11356-024-35564-0\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science and Pollution Research","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s11356-024-35564-0","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

在目前的工作中,这是第一次开发了一个可解释的机器学习模型,用于使用来自两颗不同卫星(即INSAT-3D和中分辨率成像光谱仪(MODIS))的气溶胶光学深度(AOD)估计印度10µm (PM10)浓度,为期7年(2014年至2020年)。地面AOD数据集取自气溶胶机器人网络(AERONET),用于卫星反演AOD的验证。颗粒物质(PM)的观测数据来自印度中央污染控制委员会(CPCB)站。分析是在给定时间段内按月进行的。结果表明,MODIS AOD产品与AERONET AOD具有良好的相关性,而INSAT-3D AOD产品与AERONET AOD相关性不佳。然而,在应用误差包络和基于阈值的滤波技术后,我们发现INSAT-3D与地面AOD具有显著的相关性,斋浦尔的相关性约为0.66,坎普尔的相关性约为0.57,表现出与modis衍生的AOD几乎相似的性能。卫星AOD数据与地面PM浓度数据一起用于训练机器学习模型(随机森林),以估计2020年印度各地的PM分布。在估计的PM10浓度和观测到的PM10浓度之间观察到令人鼓舞的r平方(R2)值0.78的相关性。该模型经过了有效的训练,减轻了严重的高估和低估。然而,尽管密切跟踪估计的PM10与观测到的PM10的趋势,但很少有高估的情况持续存在。这表明需要一个扩展的训练数据集来进一步完善和提高模型的准确性。最后,发现用于PM10估计的机器学习模型对于校准的卫星AOD产品是最优的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Particulate matter estimation using satellite datasets: a machine learning approach

In the present work, it is the first time an interpretable machine learning model has been developed for the estimation of Particulate Matter 10 µm (PM10) concentrations over India using Aerosol Optical Depth (AOD) from two different satellites, i.e. INSAT-3D and Moderate Resolution Imaging Spectroradiometer (MODIS) for the period of 7 years (2014 to 2020). Ground datasets of AOD are taken from the Aerosol Robotic Network (AERONET) for the validation of satellite-retrieved AOD. The observation of particulate matter (PM) data is acquired from the Central Pollution Control Board (CPCB) station across India. Analysis has been performed on a monthly basis for the given time period. The result shows that AOD products of MODIS exhibit good correlation with AERONET AOD whereas INSAT-3D AOD is not well correlated with AERONET AOD. However, after applying an error envelope and threshold-based filtering technique, we have found that INSAT-3D shows significant correlation with ground-level AOD with approximate correlation of 0.66 for Jaipur and 0.57 for Kanpur exhibiting almost similar performance as MODIS-derived AOD. Satellite AOD data together with ground PM concentration data is used to train the machine learning model (random forest) for the estimation of the PM distribution across India for the year 2020. An encouraging correlation of R-squared (R2) value 0.78 has been observed between the estimated and observed PM10 concentrations. The model demonstrates effective training, mitigating huge overestimation and underestimation. However, despite closely tracking the trends of estimated PM10 with observed PM10, few instances of overestimation persist. This suggests the need for an expanded training dataset to further refine and enhance the model’s accuracy. Finally, the machine learning model used for PM10 estimation is found to be optimal for a calibrated satellite AOD product.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
8.70
自引率
17.20%
发文量
6549
审稿时长
3.8 months
期刊介绍: Environmental Science and Pollution Research (ESPR) serves the international community in all areas of Environmental Science and related subjects with emphasis on chemical compounds. This includes: - Terrestrial Biology and Ecology - Aquatic Biology and Ecology - Atmospheric Chemistry - Environmental Microbiology/Biobased Energy Sources - Phytoremediation and Ecosystem Restoration - Environmental Analyses and Monitoring - Assessment of Risks and Interactions of Pollutants in the Environment - Conservation Biology and Sustainable Agriculture - Impact of Chemicals/Pollutants on Human and Animal Health It reports from a broad interdisciplinary outlook.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信