Leveraging low-cost sensors to predict nitrogen dioxide for epidemiologic exposure assessment.

IF 4.1 3区 医学 Q2 ENVIRONMENTAL SCIENCES
Christopher Zuidema, Jianzhao Bi, Dustin Burnham, Nancy Carmona, Amanda J Gassett, David L Slager, Cooper Schumacher, Elena Austin, Edmund Seto, Adam A Szpiro, Lianne Sheppard
{"title":"Leveraging low-cost sensors to predict nitrogen dioxide for epidemiologic exposure assessment.","authors":"Christopher Zuidema, Jianzhao Bi, Dustin Burnham, Nancy Carmona, Amanda J Gassett, David L Slager, Cooper Schumacher, Elena Austin, Edmund Seto, Adam A Szpiro, Lianne Sheppard","doi":"10.1038/s41370-024-00667-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Statistical models of air pollution enable intra-urban characterization of pollutant concentrations, benefiting exposure assessment for environmental epidemiology. The new generation of low-cost sensors facilitate the deployment of dense monitoring networks and can potentially be used to improve intra-urban models of air pollution.</p><p><strong>Objective: </strong>Develop and evaluate a spatiotemporal model for nitrogen dioxide (NO<sub>2</sub>) in the Puget Sound region of WA, USA for the Adult Changes in Thought Air Pollution (ACT-AP) study and assess the contribution of low-cost sensor data to the model's performance through cross-validation.</p><p><strong>Methods: </strong>We developed a spatiotemporal NO<sub>2</sub> model for the study region incorporating data from 11 agency locations, 364 supplementary monitoring locations, and 117 low-cost sensor (LCS) locations for the 1996-2020 time period. Model features included long-term time trends and dimension-reduced land use regression. We evaluated the contribution of LCS network data by comparing models fit with and without sensor data using cross-validated (CV) summary performance statistics.</p><p><strong>Results: </strong>The best performing model had one time trend and geographic covariates summarized into three partial least squares components. The model, fit with LCS data, performed as well as other recent studies (agency cross-validation: CV- root mean square error (RMSE) = 2.5 ppb NO<sub>2</sub>; CV- coefficient of determination ( <math> <msup><mrow><mi>R</mi></mrow> <mrow><mn>2</mn></mrow> </msup> </math> ) = 0.85). Predictions of NO<sub>2</sub> concentrations developed with LCS were higher at residential locations compared to a model without LCS, especially in recent years. While LCS did not provide a strong performance gain at agency sites (CV-RMSE = 2.8 ppb NO<sub>2</sub>; CV- <math> <msup><mrow><mi>R</mi></mrow> <mrow><mn>2</mn></mrow> </msup> </math>  = 0.82 without LCS), at residential locations, the improvement was substantial, with RMSE = 3.8 ppb NO<sub>2</sub> and <math> <msup><mrow><mi>R</mi></mrow> <mrow><mn>2</mn></mrow> </msup> </math>  = 0.08 (without LCS), compared to CV-RMSE = 2.8 ppb NO<sub>2</sub> and CV- <math> <msup><mrow><mi>R</mi></mrow> <mrow><mn>2</mn></mrow> </msup> </math>  = 0.51 (with LCS).</p><p><strong>Impact: </strong>We developed a spatiotemporal model for nitrogen dioxide (NO<sub>2</sub>) pollution in Washington's Puget Sound region for epidemiologic exposure assessment for the Adult Changes in Thought Air Pollution study. We examined the impact of including low-cost sensor data in the NO<sub>2</sub> model and found the additional spatial information the sensors provided predicted NO<sub>2</sub> concentrations that were higher than without low-cost sensors, particularly in recent years. We did not observe a clear, substantial improvement in cross-validation performance over a similar model fit without low-cost sensor data; however, the prediction improvement with low-cost sensors at residential locations was substantial. The performance gains from low-cost sensors may have been attenuated due to spatial information provided by other supplementary monitoring data.</p>","PeriodicalId":15684,"journal":{"name":"Journal of Exposure Science and Environmental Epidemiology","volume":" ","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Exposure Science and Environmental Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41370-024-00667-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Statistical models of air pollution enable intra-urban characterization of pollutant concentrations, benefiting exposure assessment for environmental epidemiology. The new generation of low-cost sensors facilitate the deployment of dense monitoring networks and can potentially be used to improve intra-urban models of air pollution.

Objective: Develop and evaluate a spatiotemporal model for nitrogen dioxide (NO2) in the Puget Sound region of WA, USA for the Adult Changes in Thought Air Pollution (ACT-AP) study and assess the contribution of low-cost sensor data to the model's performance through cross-validation.

Methods: We developed a spatiotemporal NO2 model for the study region incorporating data from 11 agency locations, 364 supplementary monitoring locations, and 117 low-cost sensor (LCS) locations for the 1996-2020 time period. Model features included long-term time trends and dimension-reduced land use regression. We evaluated the contribution of LCS network data by comparing models fit with and without sensor data using cross-validated (CV) summary performance statistics.

Results: The best performing model had one time trend and geographic covariates summarized into three partial least squares components. The model, fit with LCS data, performed as well as other recent studies (agency cross-validation: CV- root mean square error (RMSE) = 2.5 ppb NO2; CV- coefficient of determination ( R 2 ) = 0.85). Predictions of NO2 concentrations developed with LCS were higher at residential locations compared to a model without LCS, especially in recent years. While LCS did not provide a strong performance gain at agency sites (CV-RMSE = 2.8 ppb NO2; CV- R 2  = 0.82 without LCS), at residential locations, the improvement was substantial, with RMSE = 3.8 ppb NO2 and R 2  = 0.08 (without LCS), compared to CV-RMSE = 2.8 ppb NO2 and CV- R 2  = 0.51 (with LCS).

Impact: We developed a spatiotemporal model for nitrogen dioxide (NO2) pollution in Washington's Puget Sound region for epidemiologic exposure assessment for the Adult Changes in Thought Air Pollution study. We examined the impact of including low-cost sensor data in the NO2 model and found the additional spatial information the sensors provided predicted NO2 concentrations that were higher than without low-cost sensors, particularly in recent years. We did not observe a clear, substantial improvement in cross-validation performance over a similar model fit without low-cost sensor data; however, the prediction improvement with low-cost sensors at residential locations was substantial. The performance gains from low-cost sensors may have been attenuated due to spatial information provided by other supplementary monitoring data.

Abstract Image

利用低成本传感器预测二氧化氮,进行流行病学暴露评估。
背景:空气污染统计模型能够描述城市内部的污染物浓度,有利于环境流行病学的暴露评估。新一代低成本传感器有助于部署密集的监测网络,并有可能用于改进城市内的空气污染模型:为 "成人认为空气污染的变化"(ACT-AP)研究开发并评估美国华盛顿州普吉特海湾地区的二氧化氮(NO2)时空模型,并通过交叉验证评估低成本传感器数据对模型性能的贡献:方法:我们为研究区域开发了一个二氧化氮时空模型,该模型包含了 1996-2020 年期间来自 11 个机构地点、364 个补充监测地点和 117 个低成本传感器 (LCS) 地点的数据。模型特征包括长期时间趋势和降维土地利用回归。我们使用交叉验证(CV)性能统计摘要,通过比较有传感器数据和无传感器数据的拟合模型,评估了 LCS 网络数据的贡献:结果:表现最好的模型有一个时间趋势和地理协变量,归纳为三个偏最小二乘成分。使用 LCS 数据拟合的模型与近期的其他研究(机构交叉验证、CV-均方根误差(CV-root mean square error)、CV-均方根误差(CV-root mean square error)、CV-均方根误差(CV-root mean square error))一样性能良好:CV-均方根误差 (RMSE) = 2.5 ppb NO2;CV-判定系数 ( R 2 ) = 0.85)。与不使用 LCS 的模型相比,使用 LCS 预测的居民点二氧化氮浓度更高,尤其是在近几年。虽然 LCS 在机构地点的性能提升不大(CV-RMSE = 2.8 ppb NO2;无 LCS 时 CV- R 2 = 0.82),但在居民点的性能提升很大,RMSE = 3.8 ppb NO2,R 2 = 0.08(无 LCS),而 CV-RMSE = 2.8 ppb NO2,CV- R 2 = 0.51(有 LCS):我们为华盛顿州普吉特海湾地区的二氧化氮(NO2)污染开发了一个时空模型,用于 "成人思想空气污染变化 "研究的流行病学暴露评估。我们研究了在二氧化氮模型中加入低成本传感器数据的影响,发现传感器提供的额外空间信息所预测的二氧化氮浓度高于没有低成本传感器的情况,尤其是在最近几年。与没有低成本传感器数据的类似模型相比,我们没有观察到交叉验证性能有明显的大幅提高;但是,在居民点使用低成本传感器后,预测性能有了大幅提高。由于其他补充监测数据提供了空间信息,低成本传感器带来的性能提升可能有所减弱。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.90
自引率
6.70%
发文量
93
审稿时长
3 months
期刊介绍: Journal of Exposure Science and Environmental Epidemiology (JESEE) aims to be the premier and authoritative source of information on advances in exposure science for professionals in a wide range of environmental and public health disciplines. JESEE publishes original peer-reviewed research presenting significant advances in exposure science and exposure analysis, including development and application of the latest technologies for measuring exposures, and innovative computational approaches for translating novel data streams to characterize and predict exposures. The types of papers published in the research section of JESEE are original research articles, translation studies, and correspondence. Reported results should further understanding of the relationship between environmental exposure and human health, describe evaluated novel exposure science tools, or demonstrate potential of exposure science to enable decisions and actions that promote and protect human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信