Urgent issues regarding real-time air quality monitoring data in India: Unveiling solutions and implications for policy and health

IF 3.4 Q2 ENVIRONMENTAL SCIENCES
Karn Vohra , Madhumitha S. , Abhishek Chakraborty , Hitansh Shah , Bharrathi AS. , Jayaraju Pakki
{"title":"Urgent issues regarding real-time air quality monitoring data in India: Unveiling solutions and implications for policy and health","authors":"Karn Vohra ,&nbsp;Madhumitha S. ,&nbsp;Abhishek Chakraborty ,&nbsp;Hitansh Shah ,&nbsp;Bharrathi AS. ,&nbsp;Jayaraju Pakki","doi":"10.1016/j.aeaoa.2024.100308","DOIUrl":null,"url":null,"abstract":"<div><div>Deteriorating air quality in India has heightened the emphasis on air quality monitoring. This has resulted in a 16-fold increase in the number of Continuous Ambient Air Quality Monitoring Sites (CAAQMS) across the country over the last decade. The CAAQMS datasets are used globally, but concerns about data quality have also been raised. Missing is a comprehensive assessment quantifying the scale of these air quality data issues and the impact these have on policy- and health-relevant metrics. So, we develop the first open-source automated tool to identify and address data issues and apply it to six pollutants (PM<sub>2.5</sub>, PM<sub>10</sub>, NO, NO<sub>2</sub>, NO<sub>x</sub>, and O<sub>3</sub>) from 213 CAAQMS in 2019–2023. Typical issues in CAAQMS datasets include similar values that repeat continuously for durations exceeding 24 h and outliers that occur at almost the same time every day. We also reveal hidden issues for nitrogen oxides (NO<sub>x</sub> ≈ NO + NO<sub>2</sub>) that include (1) reporting of NO and NO<sub>2</sub> in units not compliant with the Central Pollution Control Board parameter reporting protocol and (2) inconsistency in data reporting when either NO or NO<sub>2</sub> is recorded as “Not Available” but valid NO<sub>x</sub> data is reported. The proportion of data influenced by consecutively similar observations and outliers has remained fairly consistent but sites affected by unit inconsistency issues have grown between 2019 and 2023. No significant difference in data quality issues was observed between CAAQMS maintained by central and state pollution control boards illustrating the country-wide extent of these issues. We find that removing consecutively similar observations and outliers changes annual mean pollutant concentrations by only &lt;5% but correcting for the yet unaddressed issue of unit inconsistency increases annual mean NO<sub>2</sub> concentrations by a dramatic &gt;80% for sites affected by it. We conducted a separate analysis to confirm that the unit inconsistency issue was not identified and addressed in multiple peer-reviewed studies examining the impact of the COVID-19 lockdown, and this is likely to have resulted in reporting of inaccurate absolute air quality improvements.</div><div>A substantial impact of data cleaning on air quality-derived metrics is observed for nitrogen oxides. The impact is marginal for other pollutants. We find that after data cleaning, 23 sites in 2019 became non-compliant with national ambient air quality standards for NO<sub>2</sub>. Worsening of NO<sub>2</sub> data quality over the years increased the number of non-compliant sites to 45 in 2023 after using our tool. For PM<sub>2.5</sub> and PM<sub>10</sub>, fewer than 5 sites changed compliance post-data cleaning. Given marginal changes in concentrations of PM<sub>2.5</sub> and O<sub>3</sub>, premature mortality attributable to exposure to these in Delhi, Mumbai, and Kolkata changed only by &lt;10% after data cleaning. The impact on premature mortality was substantial for exposure to NO<sub>2,</sub> with NO<sub>2</sub>-related premature deaths increasing by 8–67% in the three megacities. These findings have implications for the global research community and policy formulation and underscore the urgent need for ratified CAAQMS data.</div></div>","PeriodicalId":37150,"journal":{"name":"Atmospheric Environment: X","volume":"25 ","pages":"Article 100308"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Environment: X","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590162124000753","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Deteriorating air quality in India has heightened the emphasis on air quality monitoring. This has resulted in a 16-fold increase in the number of Continuous Ambient Air Quality Monitoring Sites (CAAQMS) across the country over the last decade. The CAAQMS datasets are used globally, but concerns about data quality have also been raised. Missing is a comprehensive assessment quantifying the scale of these air quality data issues and the impact these have on policy- and health-relevant metrics. So, we develop the first open-source automated tool to identify and address data issues and apply it to six pollutants (PM2.5, PM10, NO, NO2, NOx, and O3) from 213 CAAQMS in 2019–2023. Typical issues in CAAQMS datasets include similar values that repeat continuously for durations exceeding 24 h and outliers that occur at almost the same time every day. We also reveal hidden issues for nitrogen oxides (NOx ≈ NO + NO2) that include (1) reporting of NO and NO2 in units not compliant with the Central Pollution Control Board parameter reporting protocol and (2) inconsistency in data reporting when either NO or NO2 is recorded as “Not Available” but valid NOx data is reported. The proportion of data influenced by consecutively similar observations and outliers has remained fairly consistent but sites affected by unit inconsistency issues have grown between 2019 and 2023. No significant difference in data quality issues was observed between CAAQMS maintained by central and state pollution control boards illustrating the country-wide extent of these issues. We find that removing consecutively similar observations and outliers changes annual mean pollutant concentrations by only <5% but correcting for the yet unaddressed issue of unit inconsistency increases annual mean NO2 concentrations by a dramatic >80% for sites affected by it. We conducted a separate analysis to confirm that the unit inconsistency issue was not identified and addressed in multiple peer-reviewed studies examining the impact of the COVID-19 lockdown, and this is likely to have resulted in reporting of inaccurate absolute air quality improvements.
A substantial impact of data cleaning on air quality-derived metrics is observed for nitrogen oxides. The impact is marginal for other pollutants. We find that after data cleaning, 23 sites in 2019 became non-compliant with national ambient air quality standards for NO2. Worsening of NO2 data quality over the years increased the number of non-compliant sites to 45 in 2023 after using our tool. For PM2.5 and PM10, fewer than 5 sites changed compliance post-data cleaning. Given marginal changes in concentrations of PM2.5 and O3, premature mortality attributable to exposure to these in Delhi, Mumbai, and Kolkata changed only by <10% after data cleaning. The impact on premature mortality was substantial for exposure to NO2, with NO2-related premature deaths increasing by 8–67% in the three megacities. These findings have implications for the global research community and policy formulation and underscore the urgent need for ratified CAAQMS data.

Abstract Image

关于印度实时空气质量监测数据的紧迫问题:揭示解决方案及其对政策和健康的影响
印度不断恶化的空气质量提高了对空气质量监测的重视程度。这导致在过去十年中,全国连续环境空气质量监测点(CAAQMS)的数量增加了16倍。CAAQMS数据集在全球范围内使用,但也提出了对数据质量的担忧。缺少对这些空气质量数据问题的规模及其对政策和健康相关指标的影响进行量化的全面评估。因此,我们开发了第一个开源自动化工具来识别和解决数据问题,并将其应用于2019-2023年213个CAAQMS中的六种污染物(PM2.5, PM10, NO, NO2, NOx和O3)。CAAQMS数据集中的典型问题包括连续重复超过24小时的相似值和每天几乎在同一时间出现的异常值。我们还揭示了氮氧化物(NOx≈NO + NO2)的隐藏问题,包括(1)在不符合中央污染控制委员会参数报告协议的装置中报告NO和NO2;(2)当NO或NO2被记录为“不可用”但报告了有效的NOx数据时,数据报告不一致。受连续相似观测值和异常值影响的数据比例保持相当一致,但受单位不一致问题影响的站点在2019年至2023年期间有所增加。在中央和州污染控制委员会维护的CAAQMS之间,没有观察到数据质量问题的显着差异,说明这些问题在全国范围内的范围。我们发现,去除连续相似的观测值和异常值只会使年平均污染物浓度变化5%,但对尚未解决的单位不一致性问题进行校正,会使受其影响的站点的年平均NO2浓度显著增加80%。我们进行了单独的分析,以确认在检查COVID-19封锁影响的多项同行评议研究中没有发现和解决单元不一致问题,这可能导致报告的绝对空气质量改善不准确。数据清洗对氮氧化物的空气质量衍生指标有重大影响。对其他污染物的影响微乎其微。我们发现,经过数据清理,2019年有23个站点的二氧化氮不符合国家环境空气质量标准。使用我们的工具后,由于多年来二氧化氮数据质量的恶化,2023年不合规站点的数量增加到45个。对于PM2.5和PM10,只有不到5个站点在数据清理后改变了合规。考虑到PM2.5和O3浓度的微小变化,在数据清理后,德里、孟买和加尔各答因暴露于这些物质而导致的过早死亡率仅变化了10%。二氧化氮暴露对过早死亡的影响是巨大的,在三个特大城市中,与二氧化氮相关的过早死亡增加了8-67%。这些发现对全球研究界和政策制定具有启示意义,并强调迫切需要获得批准的CAAQMS数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Atmospheric Environment: X
Atmospheric Environment: X Environmental Science-Environmental Science (all)
CiteScore
8.00
自引率
0.00%
发文量
47
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信