Comparison of Long-Term Air Pollution Exposure from Mobile and Routine Monitoring, Low-Cost Sensors, and Dispersion Models.

Research report (Health Effects Institute) Pub Date : 2025-03-01

G Hoek, F Bouma, N Janssen, J Wesseling, S van Ratingen, J Kerckhoffs, U Gehring, W Hendricx, R Vermeulen, K de Hoogh

{"title":"Comparison of Long-Term Air Pollution Exposure from Mobile and Routine Monitoring, Low-Cost Sensors, and Dispersion Models.","authors":"G Hoek, F Bouma, N Janssen, J Wesseling, S van Ratingen, J Kerckhoffs, U Gehring, W Hendricx, R Vermeulen, K de Hoogh","doi":"","DOIUrl":null,"url":null,"abstract":"Introduction: Assessment of long-term exposure to outdoor air pollution remains a major challenge for epidemiological studies. One of these challenges is characterizing fine-scale spatial variation of the ambient concentrations of key traffic-related air pollutants - including ultrafine particles (UFPs), black carbon (BC), and nitrogen dioxide (NO2). Epidemiological studies have used widely different approaches to address these challenges, including empirical land use regression (LUR) models based on fixed-site routine or targeted monitoring, low-cost sensor networks, mobile monitoring, and deterministic dispersion models. Little information is available about the relative performance of these different approaches for assessing long-term exposure to traffic-related air pollution. Different methods may result in heterogeneity in health effect estimates from epidemiological studies applying different exposure-assessment approaches.The Specific Aims of the study.1. Develop long-term ambient air pollution exposure estimates for selected epidemiological studies based on low-cost sensors, mobile and fixed-site monitoring, and deterministic dispersion modeling.2. Compare different exposure assessment methods in terms of their ability to predict spatial variation of long-term average concentrations using external validation data.3. Compare different exposure assessment methods in terms of air pollution effect estimates in selected epidemiological studies.We assessed UFPs, NO2, BC, and particulate matter ≤2.5 μm in aerodynamic diameter (PM2.5).Methods: We evaluated annual average air pollution concentrations across the Netherlands using a suite of different exposure models, which differed in modeling approach (empirical LUR, deterministic dispersion models) and monitoring data used (low-cost sensors, mobile monitoring, nationwide and Europewide routine monitoring, and study-specific targeted monitoring). For empirical models, we tested three model development algorithms: supervised linear regression (SLR), Random Forest, and least absolute shrinkage and selection operator (LASSO). The predictions of the models were compared at 20,000 addresses across the Netherlands. The performance was also tested on external validation data, which were obtained from a new campaign (2021-2023) and existing data from different years, allowing assessment of how well recent models predict past air pollution exposure. Epidemiological analyses in three cohort studies were conducted to compare health effect estimates of the different exposure models. We assessed associations of air pollution in a national administrative cohort with natural-cause and cause-specific mortality, in a cohort study that had detailed lifestyle data with natural-cause mortality and incidence of stroke and coronary events, and in a mature birth cohort with lung function and asthma incidence.Results: Exposure predictions at residential sites from the dispersion model and the Europewide hybrid LUR models were available for multiple years in the period 2010-2019. For these models, exposure predictions of different years in the period 2010-2019 were highly correlated for BC, NO2, and PM2.5 (Correlation coefficient R > 0.9). Consistently, the year of the exposure model did not affect the presence of an association with mortality and morbidity. Small differences in hazard ratios (HR) were related to exposure contrast for different years. The HR for the association of NO2 with natural-cause mortality was 1.026 (95% confidence interval [CI]: 1.022-1.031) for the 2010 exposure estimate and 1.030 (1.024-1.035) for the 2019 exposure estimate of the Europewide LUR model, expressed per 10 µg/m3.The exposure models generally resulted in highly to moderately correlated exposure predictions at residential sites across the Netherlands (R > 0.7 for BC, NO2, and UFPs; R > 0.5 for PM2.5). The predicted level of exposure and exposure contrast could differ substantially between models and algorithms within models; for example, the interquartile range (IQR) for BC for each of the various models at the 20,000 residential locations ranged between 0.1 and 2.2 µg/m3. Mobile monitoring studies generally resulted in modestly higher BC concentrations and exposure contrasts compared to other exposure models. Small differences were found between the different models in explaining the spatial variation of air pollution concentrations at the new and existing validation sites. Models explained historical exposure patterns at external sites covering more than 10 years moderately well, especially for BC (R > 0.7) and NO2 (R > 0.7), and moderately so for UFPs (R > 0.5). Most models predicted the small concentration contrasts of PM2.5 relatively poorly.Consistent with the high correlation of the different exposure models, the application of these models generally resulted in similar conclusions on the presence of associations with natural-cause, respiratory, and lung cancer mortality in the large nationwide cohort, and with asthma incidence and lung function in the birth cohort. However, the effect estimates differed substantially; for example, the HR for natural-cause mortality in the nationwide administrative cohort for a 1 µg/m3 increase in BC ranged from 1.01 (95% CI: 0.99-1.02) to 1.09 (1.07-1.10). For the outcomes with small effect estimates and the smaller cohort studies, differences in conclusions related to the exposure assessment method were more distinct.Differences in exposure assessment may contribute substantially to the observed heterogeneity of effect estimates in systematic reviews of epidemiological studies. High heterogeneity was indicated by the commonly used heterogeneity measure I2, where the value was above 80% for a meta-analysis of the different effect estimates for natural-cause mortality in the nationwide cohort.Validation of long-term exposure models for the nonroutinely monitored pollutants BC and especially UFPs was challenging, despite generally successful monitoring. The new external validation monitoring campaign resulted in rather unstable estimates of the long-term average spatial contrast, both across sites and where affected by temporal variation, especially for BC and PM2.5.No consistent differences were found in the model performance of SLR, Random Forest, and LASSO, both in internal cross-validation of model building and on external validation sites not used in model building. Exposure predictions from the three algorithms were generally highly correlated and resulted in similar associations with health. However, for individual models, occasionally large differences were found in exposure contrast, validation statistics, and associations with mortality and morbidity outcomes.There was little benefit in using low-cost sensors for NO2 and PM2.5. The addition of low-cost sensor data did not improve NO2 estimates in models that combined dispersion model estimates and data from the national monitoring network data.Conclusions: The main conclusions of the project.• Exposure predictions of BC, NO2, and PM2.5 for different years between 2010-2019 were highly correlated, documenting stable spatial contrast patterns. Consistently, the year of the exposure model did not affect the presence of an association with mortality and morbidity outcomes.• Models explained historical exposure patterns at external sites covering more than 10 years moderately well, especially for BC.• Different exposure models generally resulted in highly to moderately correlated exposure predictions. The predicted level of exposure and exposure contrast could differ substantially between models. Small differences were found between the different models in explaining spatial variation at validation sites.• Application of different exposure models resulted in similar conclusions about the presence of associations with health outcomes, but effect estimates differed substantially in magnitude between individual exposure models. No consistent differences in effect estimates were found between groups of mobile, dispersion, and fixed-site LUR models.• Differences in exposure models may therefore contribute substantially to the observed heterogeneity of effect estimates in systematic reviews of epidemiological studies. Factors that explained some of the heterogeneity of effect estimates included the performance of the model at external validation sites and the predicted exposure contrast.• Exposure predictions from the three algorithms were generally highly correlated and resulted in similar associations with health. No consistent differences were found in their model performances.","PeriodicalId":74687,"journal":{"name":"Research report (Health Effects Institute)","volume":" 226","pages":"1-101"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099207/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research report (Health Effects Institute)","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: Assessment of long-term exposure to outdoor air pollution remains a major challenge for epidemiological studies. One of these challenges is characterizing fine-scale spatial variation of the ambient concentrations of key traffic-related air pollutants - including ultrafine particles (UFPs), black carbon (BC), and nitrogen dioxide (NO₂). Epidemiological studies have used widely different approaches to address these challenges, including empirical land use regression (LUR) models based on fixed-site routine or targeted monitoring, low-cost sensor networks, mobile monitoring, and deterministic dispersion models. Little information is available about the relative performance of these different approaches for assessing long-term exposure to traffic-related air pollution. Different methods may result in heterogeneity in health effect estimates from epidemiological studies applying different exposure-assessment approaches.

The Specific Aims of the study.

1. Develop long-term ambient air pollution exposure estimates for selected epidemiological studies based on low-cost sensors, mobile and fixed-site monitoring, and deterministic dispersion modeling.

2. Compare different exposure assessment methods in terms of their ability to predict spatial variation of long-term average concentrations using external validation data.

3. Compare different exposure assessment methods in terms of air pollution effect estimates in selected epidemiological studies.

We assessed UFPs, NO₂, BC, and particulate matter ≤2.5 μm in aerodynamic diameter (PM_2.5).

Methods: We evaluated annual average air pollution concentrations across the Netherlands using a suite of different exposure models, which differed in modeling approach (empirical LUR, deterministic dispersion models) and monitoring data used (low-cost sensors, mobile monitoring, nationwide and Europewide routine monitoring, and study-specific targeted monitoring). For empirical models, we tested three model development algorithms: supervised linear regression (SLR), Random Forest, and least absolute shrinkage and selection operator (LASSO). The predictions of the models were compared at 20,000 addresses across the Netherlands. The performance was also tested on external validation data, which were obtained from a new campaign (2021-2023) and existing data from different years, allowing assessment of how well recent models predict past air pollution exposure. Epidemiological analyses in three cohort studies were conducted to compare health effect estimates of the different exposure models. We assessed associations of air pollution in a national administrative cohort with natural-cause and cause-specific mortality, in a cohort study that had detailed lifestyle data with natural-cause mortality and incidence of stroke and coronary events, and in a mature birth cohort with lung function and asthma incidence.

Results: Exposure predictions at residential sites from the dispersion model and the Europewide hybrid LUR models were available for multiple years in the period 2010-2019. For these models, exposure predictions of different years in the period 2010-2019 were highly correlated for BC, NO₂, and PM_2.5 (Correlation coefficient R > 0.9). Consistently, the year of the exposure model did not affect the presence of an association with mortality and morbidity. Small differences in hazard ratios (HR) were related to exposure contrast for different years. The HR for the association of NO₂ with natural-cause mortality was 1.026 (95% confidence interval [CI]: 1.022-1.031) for the 2010 exposure estimate and 1.030 (1.024-1.035) for the 2019 exposure estimate of the Europewide LUR model, expressed per 10 µg/m³.

The exposure models generally resulted in highly to moderately correlated exposure predictions at residential sites across the Netherlands (R > 0.7 for BC, NO₂, and UFPs; R > 0.5 for PM_2.5). The predicted level of exposure and exposure contrast could differ substantially between models and algorithms within models; for example, the interquartile range (IQR) for BC for each of the various models at the 20,000 residential locations ranged between 0.1 and 2.2 µg/m³. Mobile monitoring studies generally resulted in modestly higher BC concentrations and exposure contrasts compared to other exposure models. Small differences were found between the different models in explaining the spatial variation of air pollution concentrations at the new and existing validation sites. Models explained historical exposure patterns at external sites covering more than 10 years moderately well, especially for BC (R > 0.7) and NO₂ (R > 0.7), and moderately so for UFPs (R > 0.5). Most models predicted the small concentration contrasts of PM_2.5 relatively poorly.

Consistent with the high correlation of the different exposure models, the application of these models generally resulted in similar conclusions on the presence of associations with natural-cause, respiratory, and lung cancer mortality in the large nationwide cohort, and with asthma incidence and lung function in the birth cohort. However, the effect estimates differed substantially; for example, the HR for natural-cause mortality in the nationwide administrative cohort for a 1 µg/m³ increase in BC ranged from 1.01 (95% CI: 0.99-1.02) to 1.09 (1.07-1.10). For the outcomes with small effect estimates and the smaller cohort studies, differences in conclusions related to the exposure assessment method were more distinct.

Differences in exposure assessment may contribute substantially to the observed heterogeneity of effect estimates in systematic reviews of epidemiological studies. High heterogeneity was indicated by the commonly used heterogeneity measure I², where the value was above 80% for a meta-analysis of the different effect estimates for natural-cause mortality in the nationwide cohort.

Validation of long-term exposure models for the nonroutinely monitored pollutants BC and especially UFPs was challenging, despite generally successful monitoring. The new external validation monitoring campaign resulted in rather unstable estimates of the long-term average spatial contrast, both across sites and where affected by temporal variation, especially for BC and PM_2.5.

No consistent differences were found in the model performance of SLR, Random Forest, and LASSO, both in internal cross-validation of model building and on external validation sites not used in model building. Exposure predictions from the three algorithms were generally highly correlated and resulted in similar associations with health. However, for individual models, occasionally large differences were found in exposure contrast, validation statistics, and associations with mortality and morbidity outcomes.

There was little benefit in using low-cost sensors for NO₂ and PM_2.5. The addition of low-cost sensor data did not improve NO₂ estimates in models that combined dispersion model estimates and data from the national monitoring network data.

Conclusions: The main conclusions of the project.

• Exposure predictions of BC, NO₂, and PM_2.5 for different years between 2010-2019 were highly correlated, documenting stable spatial contrast patterns. Consistently, the year of the exposure model did not affect the presence of an association with mortality and morbidity outcomes.

• Models explained historical exposure patterns at external sites covering more than 10 years moderately well, especially for BC.

• Different exposure models generally resulted in highly to moderately correlated exposure predictions. The predicted level of exposure and exposure contrast could differ substantially between models. Small differences were found between the different models in explaining spatial variation at validation sites.

• Application of different exposure models resulted in similar conclusions about the presence of associations with health outcomes, but effect estimates differed substantially in magnitude between individual exposure models. No consistent differences in effect estimates were found between groups of mobile, dispersion, and fixed-site LUR models.

• Differences in exposure models may therefore contribute substantially to the observed heterogeneity of effect estimates in systematic reviews of epidemiological studies. Factors that explained some of the heterogeneity of effect estimates included the performance of the model at external validation sites and the predicted exposure contrast.

• Exposure predictions from the three algorithms were generally highly correlated and resulted in similar associations with health. No consistent differences were found in their model performances.

本刊更多论文

移动监测和常规监测、低成本传感器和扩散模型对长期空气污染暴露的比较。

导言：评估长期暴露于室外空气污染仍然是流行病学研究的一个主要挑战。其中一个挑战是表征与交通相关的主要空气污染物（包括超细颗粒（ufp）、黑碳（BC）和二氧化氮（NO2））的环境浓度的精细尺度空间变化。流行病学研究使用了广泛不同的方法来应对这些挑战，包括基于固定站点常规或目标监测的经验土地利用回归（LUR）模型、低成本传感器网络、移动监测和确定性分散模型。关于这些评估长期暴露于交通相关空气污染的不同方法的相对性能的信息很少。不同的方法可能导致采用不同暴露评估方法的流行病学研究中健康影响估计的异质性。研究的具体目的。1 .根据低成本传感器、移动和固定地点监测以及确定性扩散模型，为选定的流行病学研究制定长期环境空气污染暴露估计。比较不同的暴露评估方法在使用外部验证数据预测长期平均浓度空间变化方面的能力。在选定的流行病学研究中比较不同的接触评估方法对空气污染影响的估计。我们评估了ufp、NO2、BC和空气动力学直径≤2.5 μm的颗粒物（PM2.5）。方法：我们使用一套不同的暴露模型评估了荷兰的年平均空气污染浓度，这些模型在建模方法（经验LUR，确定性弥散模型）和使用的监测数据（低成本传感器，移动监测，全国和欧洲范围内的常规监测以及研究特定目标监测）方面存在差异。对于经验模型，我们测试了三种模型开发算法：监督线性回归（SLR）、随机森林（Random Forest）和最小绝对收缩和选择算子（LASSO）。这些模型的预测结果在荷兰各地的2万个地址进行了比较。该模型的性能还在外部验证数据上进行了测试，这些数据来自一个新的运动（2021-2023）和不同年份的现有数据，从而可以评估最近的模型预测过去空气污染暴露的效果。在三个队列研究中进行了流行病学分析，以比较不同暴露模型的健康影响估计。我们在一项国家行政队列研究中评估了空气污染与自然原因死亡率和特定原因死亡率的关系，在一项队列研究中评估了详细的生活方式数据与自然原因死亡率、中风和冠状动脉事件发生率的关系，在一项成熟的出生队列研究中评估了肺功能和哮喘发病率的关系。结果：在2010-2019年期间，利用分散模型和全欧洲混合LUR模型对居民点的暴露进行了多年预测。对于这些模型，2010-2019年期间不同年份的暴露预测与BC、NO2和PM2.5高度相关（相关系数R > 0.9）。与此一致的是，暴露模型的年份并不影响与死亡率和发病率之间存在的关联。危险比（HR）的微小差异与不同年份的暴露对比有关。在欧洲范围内的LUR模型中，2010年暴露估计NO2与自然原因死亡率相关的HR为1.026(95%可信区间[CI]: 1.022-1.031)， 2019年暴露估计的HR为1.030(1.024-1.035)，以每10µg/m3表示。暴露模型通常在荷兰各地的居民点得出高度到中度相关的暴露预测(BC、NO2和ufp的R为0.7；r>.5为PM2.5)。模型之间和模型内算法之间的预测暴露水平和暴露对比度可能存在很大差异；例如，在20,000个住宅地点的各种模型中，BC的四分位数范围（IQR）在0.1至2.2 μ g/m3之间。与其他暴露模型相比，移动监测研究通常导致BC浓度和暴露对比略高。不同的模型在解释新的和现有的验证点的空气污染浓度的空间变化方面存在微小的差异。模型较好地解释了超过10年的外部站点的历史暴露模式，特别是对BC （R >.7）和NO2 (R >.7)，对ufp （R >.5）也有中等程度的解释。大多数模型对PM2.5浓度差异的预测相对较差。与不同暴露模型的高度相关性一致，这些模型的应用通常得出类似的结论，即在全国大型队列中存在与自然原因、呼吸和肺癌死亡率的关联，以及与出生队列中哮喘发病率和肺功能的关联。然而，对效果的估计差异很大；例如，在全国行政队列中，BC每增加1微克/立方米，自然原因死亡率的HR范围从1.01 （95% CI: 0.99-1.02）到1.09（1.07-1.10）。对于效应估计较小的结果和较小的队列研究，与暴露评估方法相关的结论差异更为明显。在流行病学研究的系统回顾中，暴露评估的差异可能在很大程度上导致观察到的效应估计的异质性。通常使用的异质性测量I2显示了高异质性，在对全国队列中自然原因死亡率的不同影响估计的荟萃分析中，其值高于80%。尽管监测总体上是成功的，但对非常规监测的污染物，特别是ufp的长期暴露模型的验证是具有挑战性的。新的外部验证监测活动导致长期平均空间对比的估计相当不稳定，无论是跨站点还是受时间变化影响的地方，特别是BC和PM2.5。无论是在模型构建的内部交叉验证中，还是在模型构建中未使用的外部验证站点上，SLR、Random Forest和LASSO的模型性能都没有发现一致的差异。来自这三种算法的暴露预测通常高度相关，并导致与健康相似的关联。然而，对于单个模型，偶尔会发现暴露对比、验证统计以及与死亡率和发病率结果的关联存在较大差异。使用低成本的二氧化氮和PM2.5传感器几乎没有什么好处。在将分散模型估计值与国家监测网络数据相结合的模型中，添加低成本传感器数据并没有改善NO2估计值。结论：本研究的主要结论。•2010-2019年不同年份的BC、NO2和PM2.5暴露预测高度相关，记录了稳定的空间对比模式。一致地，暴露模型的年份不影响与死亡率和发病率结果的关联。•模型较好地解释了10年以上外部站点的历史暴露模式，特别是BC省。•不同的暴露模型通常导致高度或中度相关的暴露预测。预测的暴露水平和暴露对比度在不同的模型之间可能有很大的不同。不同的模型在解释验证点的空间变化方面存在较小的差异。•应用不同的暴露模型得出了与健康结果存在关联的类似结论，但不同暴露模型之间的影响估计在量级上存在很大差异。在移动、分散和固定地点LUR模型组之间的效果估计没有一致的差异。•因此，暴露模型的差异可能在很大程度上导致流行病学研究系统评价中观察到的效应估计的异质性。解释效果估计的一些异质性的因素包括模型在外部验证点的表现和预测的暴露对比。•来自三种算法的暴露预测通常高度相关，并导致与健康相似的关联。他们的模型性能没有一致的差异。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Research report (Health Effects Institute)

自引率

0.00%

发文量