Identifying Predictors of Spatiotemporal Variations in Residential Radon Concentrations across North Carolina Using Machine Learning Analytics

IF 7.6 2区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Zhenchun Yang, Lauren Prox, Clare Meernik, Yadurshini Raveendran, David Press, Phillip Gibson, Amie Koch, Olufemi Ajumobi, Jeffrey Clarke, Ruoxue Chen, Junfeng (Jim) Zhang, Tomi Akinyemiju
{"title":"Identifying Predictors of Spatiotemporal Variations in Residential Radon Concentrations across North Carolina Using Machine Learning Analytics","authors":"Zhenchun Yang, Lauren Prox, Clare Meernik, Yadurshini Raveendran, David Press, Phillip Gibson, Amie Koch, Olufemi Ajumobi, Jeffrey Clarke, Ruoxue Chen, Junfeng (Jim) Zhang, Tomi Akinyemiju","doi":"10.1016/j.envpol.2024.125592","DOIUrl":null,"url":null,"abstract":"Radon is a naturally occurring radioactive gas derived from the decay of uranium in the Earth’s crust. Radon exposure is the leading cause of lung cancer among non-smokers in the US. Radon infiltrates homes through soil and building foundations. This study advances methodologies for assessing residential radon exposure by leveraging a comprehensive dataset of 126,382 short-term (2-7 days) radon test results collected across North Carolina from 2010 to 2020. Employing a combination of linear regression and advanced machine learning techniques, including random forest models.Analysis through linear regression, linear mixed-effects models (LME), and generalized additive models (GAM) using the first-time tested radon levels reveals that elevation, proximity to geological faults, and soil moisture are pivotal in determining radon concentration. Specifically, elevation consistently shows a positive relationship with radon levels across models (linear regression: β=0.12, p<0.001; LME: β=0.17, p<0.001; GAM: β=0.11, p<0.001). Conversely, the distance to geological faults negatively correlates with radon concentration (linear regression: β=-0.11, p<0.001; LME: β=-0.06, p<0.001; GAM: β=-0.07, p<0.001), indicating lower radon levels further from faults.Using the random forest model, our study identifies the most influential environmental predictors of first-time tested radon levels. Elevation is the most influential variable, followed by median instantaneous surface pressure and soil moisture in the upper 10 cm layer, illustrating the significant role of geological and immediate surface conditions. Additional important factors include precipitation, mean temperature, and deeper soil moisture levels (40-200 cm), which underscores the influence of climate on radon variability. Root zone soil moisture and the Normalized Difference Vegetation Index (NDVI) also contribute to predicting radon levels, reflecting the importance of soil and vegetation dynamics in radon emanation. By integrating multiple statistical models, this research provides a nuanced understanding of the predictors of radon concentration, enhancing predictive accuracy and reliability.","PeriodicalId":311,"journal":{"name":"Environmental Pollution","volume":"116 1","pages":""},"PeriodicalIF":7.6000,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Pollution","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.envpol.2024.125592","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Radon is a naturally occurring radioactive gas derived from the decay of uranium in the Earth’s crust. Radon exposure is the leading cause of lung cancer among non-smokers in the US. Radon infiltrates homes through soil and building foundations. This study advances methodologies for assessing residential radon exposure by leveraging a comprehensive dataset of 126,382 short-term (2-7 days) radon test results collected across North Carolina from 2010 to 2020. Employing a combination of linear regression and advanced machine learning techniques, including random forest models.Analysis through linear regression, linear mixed-effects models (LME), and generalized additive models (GAM) using the first-time tested radon levels reveals that elevation, proximity to geological faults, and soil moisture are pivotal in determining radon concentration. Specifically, elevation consistently shows a positive relationship with radon levels across models (linear regression: β=0.12, p<0.001; LME: β=0.17, p<0.001; GAM: β=0.11, p<0.001). Conversely, the distance to geological faults negatively correlates with radon concentration (linear regression: β=-0.11, p<0.001; LME: β=-0.06, p<0.001; GAM: β=-0.07, p<0.001), indicating lower radon levels further from faults.Using the random forest model, our study identifies the most influential environmental predictors of first-time tested radon levels. Elevation is the most influential variable, followed by median instantaneous surface pressure and soil moisture in the upper 10 cm layer, illustrating the significant role of geological and immediate surface conditions. Additional important factors include precipitation, mean temperature, and deeper soil moisture levels (40-200 cm), which underscores the influence of climate on radon variability. Root zone soil moisture and the Normalized Difference Vegetation Index (NDVI) also contribute to predicting radon levels, reflecting the importance of soil and vegetation dynamics in radon emanation. By integrating multiple statistical models, this research provides a nuanced understanding of the predictors of radon concentration, enhancing predictive accuracy and reliability.

Abstract Image

使用机器学习分析识别北卡罗来纳州住宅氡浓度时空变化的预测因子
氡是一种自然产生的放射性气体,由地壳中的铀衰变产生。在美国,氡暴露是导致非吸烟者患肺癌的主要原因。氡通过土壤和建筑地基渗入房屋。本研究通过利用2010年至2020年在北卡罗来纳州收集的126,382个短期(2-7天)氡测试结果的综合数据集,推进了评估住宅氡暴露的方法。结合线性回归和先进的机器学习技术,包括随机森林模型。利用首次测试的氡水平,通过线性回归、线性混合效应模型(LME)和广义加性模型(GAM)进行分析表明,海拔高度、与地质断层的接近程度和土壤湿度是决定氡浓度的关键因素。具体而言,海拔高度与氡水平在各模型中一致显示出正相关关系(线性回归:β=0.12, p<0.001;LME: β=0.17, p < 0.01;GAM: β=0.11, p<0.001)。相反,与地质断层的距离与氡浓度呈负相关(线性回归:β=-0.11, p<0.001;LME: β=-0.06, p < 0.01;GAM: β=-0.07, p<0.001),表明离断层越远,氡水平越低。使用随机森林模型,我们的研究确定了首次测试氡水平的最具影响力的环境预测因子。高程是影响最大的变量,其次是中间瞬时地表压力和上层10cm层的土壤湿度,说明地质和直接地表条件的重要作用。其他重要因素包括降水、平均温度和更深的土壤湿度水平(40-200厘米),这强调了气候对氡变异的影响。根区土壤湿度和归一化植被指数(NDVI)也有助于预测氡水平,反映了土壤和植被动态对氡辐射的重要性。通过整合多个统计模型,本研究提供了对氡浓度预测因子的细致理解,提高了预测的准确性和可靠性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Pollution
Environmental Pollution 环境科学-环境科学
CiteScore
16.00
自引率
6.70%
发文量
2082
审稿时长
2.9 months
期刊介绍: Environmental Pollution is an international peer-reviewed journal that publishes high-quality research papers and review articles covering all aspects of environmental pollution and its impacts on ecosystems and human health. Subject areas include, but are not limited to: • Sources and occurrences of pollutants that are clearly defined and measured in environmental compartments, food and food-related items, and human bodies; • Interlinks between contaminant exposure and biological, ecological, and human health effects, including those of climate change; • Contaminants of emerging concerns (including but not limited to antibiotic resistant microorganisms or genes, microplastics/nanoplastics, electronic wastes, light, and noise) and/or their biological, ecological, or human health effects; • Laboratory and field studies on the remediation/mitigation of environmental pollution via new techniques and with clear links to biological, ecological, or human health effects; • Modeling of pollution processes, patterns, or trends that is of clear environmental and/or human health interest; • New techniques that measure and examine environmental occurrences, transport, behavior, and effects of pollutants within the environment or the laboratory, provided that they can be clearly used to address problems within regional or global environmental compartments.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信