Evaluating statistical methods to predict indoor black carbon in an urban birth cohort

Sherry WeMott , Grace Kuiper , Sheena E. Martenies , Matthew D. Koslovsky , William B. Allshouse , John L. Adgate , Anne P. Starling , Dana Dabelea , Sheryl Magzamen
{"title":"Evaluating statistical methods to predict indoor black carbon in an urban birth cohort","authors":"Sherry WeMott ,&nbsp;Grace Kuiper ,&nbsp;Sheena E. Martenies ,&nbsp;Matthew D. Koslovsky ,&nbsp;William B. Allshouse ,&nbsp;John L. Adgate ,&nbsp;Anne P. Starling ,&nbsp;Dana Dabelea ,&nbsp;Sheryl Magzamen","doi":"10.1016/j.indenv.2025.100084","DOIUrl":null,"url":null,"abstract":"<div><div>Most air pollution epidemiology studies rely on outdoor exposure data from various sources, such as reference monitors, low-cost monitors, models, or Earth observations. However, people spend 90 % of their time indoors, with 70 % of that time spent at home, which may result in misclassification of air pollution exposure when using data reflecting ambient concentrations. In this study, we evaluated methods to predict residential indoor black carbon (BC) from outdoor BC, PM2.5, and housing characteristics to support future efforts in estimating personal air pollution exposure. Households from the Healthy Start cohort in Denver, CO hosted paired indoor/outdoor low-cost air samplers for one-week periods during spring 2018, summer 2018, and winter 2019. Participants completed questionnaires about housing characteristics like building type, flooring, and heating and cooling methods. Filters were analyzed for BC using transmissometry. Ridge, LASSO and ordinary least squares regression (OLS) techniques were used to build predictive models of indoor BC given the available set of covariates. Leave-one-out cross-validation was used to assess the predictive accuracy of each model. We hypothesized that Ridge and LASSO will obtain improved predictive performance over the OLS model due to regularization. A total of 27 households participated, with 39 paired measurements available after data cleaning. All winter data were excluded due to high variability and incomplete sampling times for outdoor measurements. Performance issues suggested insufficient weatherproofing of monitors for low temperatures. The Ridge regression showed the best predictive performance. The final inference model included outdoor PM<sub>2.5</sub>, hard floors, and the presence of pets in the home, accounting for approximately 28 % of the variability in indoor BC concentrations measured in participant homes. In the absence of indoor monitoring, household characteristics like flooring and the presence of pets can help predict indoor levels of BC.</div></div>","PeriodicalId":100665,"journal":{"name":"Indoor Environments","volume":"2 2","pages":"Article 100084"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indoor Environments","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S295036202500013X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Most air pollution epidemiology studies rely on outdoor exposure data from various sources, such as reference monitors, low-cost monitors, models, or Earth observations. However, people spend 90 % of their time indoors, with 70 % of that time spent at home, which may result in misclassification of air pollution exposure when using data reflecting ambient concentrations. In this study, we evaluated methods to predict residential indoor black carbon (BC) from outdoor BC, PM2.5, and housing characteristics to support future efforts in estimating personal air pollution exposure. Households from the Healthy Start cohort in Denver, CO hosted paired indoor/outdoor low-cost air samplers for one-week periods during spring 2018, summer 2018, and winter 2019. Participants completed questionnaires about housing characteristics like building type, flooring, and heating and cooling methods. Filters were analyzed for BC using transmissometry. Ridge, LASSO and ordinary least squares regression (OLS) techniques were used to build predictive models of indoor BC given the available set of covariates. Leave-one-out cross-validation was used to assess the predictive accuracy of each model. We hypothesized that Ridge and LASSO will obtain improved predictive performance over the OLS model due to regularization. A total of 27 households participated, with 39 paired measurements available after data cleaning. All winter data were excluded due to high variability and incomplete sampling times for outdoor measurements. Performance issues suggested insufficient weatherproofing of monitors for low temperatures. The Ridge regression showed the best predictive performance. The final inference model included outdoor PM2.5, hard floors, and the presence of pets in the home, accounting for approximately 28 % of the variability in indoor BC concentrations measured in participant homes. In the absence of indoor monitoring, household characteristics like flooring and the presence of pets can help predict indoor levels of BC.
评价预测城市出生队列室内黑碳的统计方法
大多数空气污染流行病学研究依赖于各种来源的室外暴露数据,如参考监测仪、低成本监测仪、模型或地球观测。然而,人们90% %的时间在室内度过,70% %的时间在家中度过,这可能导致在使用反映环境浓度的数据时对空气污染暴露进行错误分类。在这项研究中,我们评估了从室外黑碳、PM2.5和住房特征预测住宅室内黑碳(BC)的方法,以支持未来估算个人空气污染暴露的努力。来自科罗拉多州丹佛市“健康起步”队列的家庭在2018年春季、2018年夏季和2019年冬季举办了为期一周的成对室内/室外低成本空气采样器。参与者完成了关于房屋特征的问卷调查,如建筑类型、地板、供暖和制冷方式。用透射法对滤光片进行了分析。使用Ridge、LASSO和普通最小二乘回归(OLS)技术建立室内BC的预测模型,给定一组可用的协变量。使用留一交叉验证来评估每个模型的预测准确性。我们假设由于正则化,Ridge和LASSO将比OLS模型获得更好的预测性能。共有27个家庭参与,数据清洗后可获得39对测量结果。由于室外测量的高变异性和不完整的采样时间,所有冬季数据都被排除在外。性能问题表明显示器对低温的防风雨性不足。Ridge回归的预测效果最好。最终的推断模型包括室外PM2.5、硬地板和家中宠物的存在,约占参与家庭室内BC浓度变化的28% %。在没有室内监测的情况下,地板和宠物等家庭特征可以帮助预测室内BC水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信