The use and misuse of surrogate variables in environmental epidemiology

Frederick W. Lipfert
{"title":"The use and misuse of surrogate variables in environmental epidemiology","authors":"Frederick W. Lipfert","doi":"10.1002/jem.40","DOIUrl":null,"url":null,"abstract":"<p>This paper discusses some common statistical problems that are often encountered in the specification and interpretation of regression models used in environmental epidemiology; such models have been used to establish new or modified ambient standards intended to protect public health. These statistical problems include: collinearity (identifying the ‘correct’ pollutant), confounding (omission of other variables that may be correlated with both response and putative dose), the ‘ecological fallacy’ (aggregating individual doses and responses over space or time), measurement error (uncertainties in data, applicability and measurement <i>per se</i>) and linearity (identifying curvature or thresholds in dose-response function). These problems occur in both time-series and cross-sectional studies. Although none of these potential problem areas is new, they have rarely been considered together or comprehensively. This paper considers them as specific instances of the general problem of surrogate variables, for which an analytical framework is presented together with some examples of their practical consequences and some guidelines for interpreting environmental epidemiology studies. Findings of the analysis include: single-pollutant regression models are likely to overstate effects; although aggregation results in loss of information, it biases the estimates only when confounding is present; the traditional approaches to correcting for measurement errors implied by the difference between personal exposures and ambient air quality do not apply, but estimates may be based on consideration of the ‘error’ term as an additional source of exposure; it may not be possible to deduce the correct shape of a dose-response function in the presence of measurement error and correlated covariates. These findings are intended to be descriptive rather than definitive; the main purpose is to stimulate the detailed research required to develop practical remedies that would allow epidemiology to be used appropriately in setting environmental standards.Copyright © 1999 John Wiley &amp; Sons, Ltd.</p>","PeriodicalId":100780,"journal":{"name":"Journal of Environmental Medicine","volume":"1 4","pages":"267-278"},"PeriodicalIF":0.0000,"publicationDate":"2001-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/jem.40","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Medicine","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jem.40","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

This paper discusses some common statistical problems that are often encountered in the specification and interpretation of regression models used in environmental epidemiology; such models have been used to establish new or modified ambient standards intended to protect public health. These statistical problems include: collinearity (identifying the ‘correct’ pollutant), confounding (omission of other variables that may be correlated with both response and putative dose), the ‘ecological fallacy’ (aggregating individual doses and responses over space or time), measurement error (uncertainties in data, applicability and measurement per se) and linearity (identifying curvature or thresholds in dose-response function). These problems occur in both time-series and cross-sectional studies. Although none of these potential problem areas is new, they have rarely been considered together or comprehensively. This paper considers them as specific instances of the general problem of surrogate variables, for which an analytical framework is presented together with some examples of their practical consequences and some guidelines for interpreting environmental epidemiology studies. Findings of the analysis include: single-pollutant regression models are likely to overstate effects; although aggregation results in loss of information, it biases the estimates only when confounding is present; the traditional approaches to correcting for measurement errors implied by the difference between personal exposures and ambient air quality do not apply, but estimates may be based on consideration of the ‘error’ term as an additional source of exposure; it may not be possible to deduce the correct shape of a dose-response function in the presence of measurement error and correlated covariates. These findings are intended to be descriptive rather than definitive; the main purpose is to stimulate the detailed research required to develop practical remedies that would allow epidemiology to be used appropriately in setting environmental standards.Copyright © 1999 John Wiley & Sons, Ltd.

环境流行病学中替代变量的使用和误用
本文讨论了环境流行病学中回归模型的规范和解释中经常遇到的一些常见统计问题;此类模型已被用于建立旨在保护公众健康的新的或修改的环境标准。这些统计问题包括:共线性(确定“正确”的污染物)、混淆(遗漏了可能与反应和推定剂量相关的其他变量)、“生态谬误”(在空间或时间上聚合单个剂量和反应)、,测量误差(数据的不确定性、适用性和测量本身)和线性(确定剂量反应函数中的曲率或阈值)。这些问题出现在时间序列和横断面研究中。尽管这些潜在的问题领域都不是新的,但它们很少被一起或全面地考虑。本文将其视为替代变量一般问题的具体实例,并为其提供了一个分析框架,以及其实际后果的一些例子和解释环境流行病学研究的一些指南。分析结果包括:单一污染物回归模型可能夸大了影响;尽管聚合会导致信息丢失,但只有当存在混淆时,它才会使估计产生偏差;校正个人暴露和环境空气质量之间的差异所隐含的测量误差的传统方法不适用,但估计可能基于将“误差”术语视为额外暴露源的考虑;在存在测量误差和相关协变量的情况下,可能不可能推导出剂量反应函数的正确形状。这些发现是描述性的,而不是决定性的;其主要目的是促进所需的详细研究,以开发实用的补救措施,使流行病学能够在制定环境标准时得到适当的应用。版权所有©1999 John Wiley&;有限公司。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信