Sherry WeMott , Grace Kuiper , Sheena E. Martenies , Matthew D. Koslovsky , William B. Allshouse , John L. Adgate , Anne P. Starling , Dana Dabelea , Sheryl Magzamen
{"title":"评价预测城市出生队列室内黑碳的统计方法","authors":"Sherry WeMott , Grace Kuiper , Sheena E. Martenies , Matthew D. Koslovsky , William B. Allshouse , John L. Adgate , Anne P. Starling , Dana Dabelea , Sheryl Magzamen","doi":"10.1016/j.indenv.2025.100084","DOIUrl":null,"url":null,"abstract":"<div><div>Most air pollution epidemiology studies rely on outdoor exposure data from various sources, such as reference monitors, low-cost monitors, models, or Earth observations. However, people spend 90 % of their time indoors, with 70 % of that time spent at home, which may result in misclassification of air pollution exposure when using data reflecting ambient concentrations. In this study, we evaluated methods to predict residential indoor black carbon (BC) from outdoor BC, PM2.5, and housing characteristics to support future efforts in estimating personal air pollution exposure. Households from the Healthy Start cohort in Denver, CO hosted paired indoor/outdoor low-cost air samplers for one-week periods during spring 2018, summer 2018, and winter 2019. Participants completed questionnaires about housing characteristics like building type, flooring, and heating and cooling methods. Filters were analyzed for BC using transmissometry. Ridge, LASSO and ordinary least squares regression (OLS) techniques were used to build predictive models of indoor BC given the available set of covariates. Leave-one-out cross-validation was used to assess the predictive accuracy of each model. We hypothesized that Ridge and LASSO will obtain improved predictive performance over the OLS model due to regularization. A total of 27 households participated, with 39 paired measurements available after data cleaning. All winter data were excluded due to high variability and incomplete sampling times for outdoor measurements. Performance issues suggested insufficient weatherproofing of monitors for low temperatures. The Ridge regression showed the best predictive performance. The final inference model included outdoor PM<sub>2.5</sub>, hard floors, and the presence of pets in the home, accounting for approximately 28 % of the variability in indoor BC concentrations measured in participant homes. In the absence of indoor monitoring, household characteristics like flooring and the presence of pets can help predict indoor levels of BC.</div></div>","PeriodicalId":100665,"journal":{"name":"Indoor Environments","volume":"2 2","pages":"Article 100084"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating statistical methods to predict indoor black carbon in an urban birth cohort\",\"authors\":\"Sherry WeMott , Grace Kuiper , Sheena E. Martenies , Matthew D. Koslovsky , William B. Allshouse , John L. Adgate , Anne P. Starling , Dana Dabelea , Sheryl Magzamen\",\"doi\":\"10.1016/j.indenv.2025.100084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Most air pollution epidemiology studies rely on outdoor exposure data from various sources, such as reference monitors, low-cost monitors, models, or Earth observations. However, people spend 90 % of their time indoors, with 70 % of that time spent at home, which may result in misclassification of air pollution exposure when using data reflecting ambient concentrations. In this study, we evaluated methods to predict residential indoor black carbon (BC) from outdoor BC, PM2.5, and housing characteristics to support future efforts in estimating personal air pollution exposure. Households from the Healthy Start cohort in Denver, CO hosted paired indoor/outdoor low-cost air samplers for one-week periods during spring 2018, summer 2018, and winter 2019. Participants completed questionnaires about housing characteristics like building type, flooring, and heating and cooling methods. Filters were analyzed for BC using transmissometry. Ridge, LASSO and ordinary least squares regression (OLS) techniques were used to build predictive models of indoor BC given the available set of covariates. Leave-one-out cross-validation was used to assess the predictive accuracy of each model. We hypothesized that Ridge and LASSO will obtain improved predictive performance over the OLS model due to regularization. A total of 27 households participated, with 39 paired measurements available after data cleaning. All winter data were excluded due to high variability and incomplete sampling times for outdoor measurements. Performance issues suggested insufficient weatherproofing of monitors for low temperatures. The Ridge regression showed the best predictive performance. The final inference model included outdoor PM<sub>2.5</sub>, hard floors, and the presence of pets in the home, accounting for approximately 28 % of the variability in indoor BC concentrations measured in participant homes. In the absence of indoor monitoring, household characteristics like flooring and the presence of pets can help predict indoor levels of BC.</div></div>\",\"PeriodicalId\":100665,\"journal\":{\"name\":\"Indoor Environments\",\"volume\":\"2 2\",\"pages\":\"Article 100084\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Indoor Environments\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S295036202500013X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indoor Environments","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S295036202500013X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Evaluating statistical methods to predict indoor black carbon in an urban birth cohort
Most air pollution epidemiology studies rely on outdoor exposure data from various sources, such as reference monitors, low-cost monitors, models, or Earth observations. However, people spend 90 % of their time indoors, with 70 % of that time spent at home, which may result in misclassification of air pollution exposure when using data reflecting ambient concentrations. In this study, we evaluated methods to predict residential indoor black carbon (BC) from outdoor BC, PM2.5, and housing characteristics to support future efforts in estimating personal air pollution exposure. Households from the Healthy Start cohort in Denver, CO hosted paired indoor/outdoor low-cost air samplers for one-week periods during spring 2018, summer 2018, and winter 2019. Participants completed questionnaires about housing characteristics like building type, flooring, and heating and cooling methods. Filters were analyzed for BC using transmissometry. Ridge, LASSO and ordinary least squares regression (OLS) techniques were used to build predictive models of indoor BC given the available set of covariates. Leave-one-out cross-validation was used to assess the predictive accuracy of each model. We hypothesized that Ridge and LASSO will obtain improved predictive performance over the OLS model due to regularization. A total of 27 households participated, with 39 paired measurements available after data cleaning. All winter data were excluded due to high variability and incomplete sampling times for outdoor measurements. Performance issues suggested insufficient weatherproofing of monitors for low temperatures. The Ridge regression showed the best predictive performance. The final inference model included outdoor PM2.5, hard floors, and the presence of pets in the home, accounting for approximately 28 % of the variability in indoor BC concentrations measured in participant homes. In the absence of indoor monitoring, household characteristics like flooring and the presence of pets can help predict indoor levels of BC.