Unbiased estimates using temporally aggregated outcome data in time series analysis: generalization to different outcomes, exposures and types of aggregation.

IF 4.4 2区医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH

Epidemiology Pub Date : 2025-10-02 DOI:10.1097/EDE.0000000000001923

Xavier Basagaña, Joan Ballester

{"title":"Unbiased estimates using temporally aggregated outcome data in time series analysis: generalization to different outcomes, exposures and types of aggregation.","authors":"Xavier Basagaña, Joan Ballester","doi":"10.1097/EDE.0000000000001923","DOIUrl":null,"url":null,"abstract":"Background: A new method for time series analysis was recently formulated and implemented that uses temporally aggregated outcome data to generate unbiased estimates of the underlying association between temporally disaggregated outcome and covariate data. However, the performance of the method was only tested in the context of the delayed nonlinear relation between temperature and mortality, and only in the case of the aggregation of sets of consecutive days.Methods: We conduct a simulation analysis to test the performance of the method using (i) mortality and hospital admissions as health outcomes, (ii) temperature and nitrogen dioxide as exposures, and (iii) the three aggregation schemes most widely used in open access health data, including aggregations of sets of non-consecutive days.Results: With sufficient data for analysis, the method can recover the underlying association for all combinations of outcomes, exposures, and aggregation schemes. The bias and variability of the estimates increase with the degree of aggregation of the outcome data, and they decrease with increasing sample size (length of dataset, number of cases). Remarkably, estimates are also unbiased even in extreme cases with weekly outcome data in an association confounded by the day of the week, such as those of air pollution models.Conclusions: With sufficient data, the method is able to flexibly generate unbiased estimates, generalizing previous results to other outcomes, exposures and types and degrees of aggregation. Such results can boost the use of available temporally aggregated health data for research, translation, and policymaking, especially in low-resource and rural areas.","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/EDE.0000000000001923","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}

引用次数: 0

Abstract

Background: A new method for time series analysis was recently formulated and implemented that uses temporally aggregated outcome data to generate unbiased estimates of the underlying association between temporally disaggregated outcome and covariate data. However, the performance of the method was only tested in the context of the delayed nonlinear relation between temperature and mortality, and only in the case of the aggregation of sets of consecutive days.

Methods: We conduct a simulation analysis to test the performance of the method using (i) mortality and hospital admissions as health outcomes, (ii) temperature and nitrogen dioxide as exposures, and (iii) the three aggregation schemes most widely used in open access health data, including aggregations of sets of non-consecutive days.

Results: With sufficient data for analysis, the method can recover the underlying association for all combinations of outcomes, exposures, and aggregation schemes. The bias and variability of the estimates increase with the degree of aggregation of the outcome data, and they decrease with increasing sample size (length of dataset, number of cases). Remarkably, estimates are also unbiased even in extreme cases with weekly outcome data in an association confounded by the day of the week, such as those of air pollution models.

Conclusions: With sufficient data, the method is able to flexibly generate unbiased estimates, generalizing previous results to other outcomes, exposures and types and degrees of aggregation. Such results can boost the use of available temporally aggregated health data for research, translation, and policymaking, especially in low-resource and rural areas.

查看原文本刊更多论文

在时间序列分析中使用临时汇总结果数据的无偏估计：对不同结果、暴露和汇总类型的概化。

背景：最近制定并实施了一种新的时间序列分析方法，该方法使用时间汇总结果数据来生成时间分解结果和协变量数据之间潜在关联的无偏估计。然而，该方法的性能仅在温度与死亡率之间的延迟非线性关系的背景下进行了测试，并且仅在连续天数集合的情况下进行了测试。方法：我们使用(i)死亡率和住院率作为健康结果，（ii）温度和二氧化氮作为暴露，以及（iii）开放获取健康数据中最广泛使用的三种聚合方案（包括非连续天数的集合）进行模拟分析，以测试该方法的性能。结果：有了足够的分析数据，该方法可以恢复所有结果、暴露和汇总方案组合的潜在关联。估计的偏差和变异性随结果数据的聚集程度而增加，随样本量（数据集长度、病例数）的增加而减少。值得注意的是，即使在极端情况下，每周的结果数据与一周中的某一天混淆在一起，比如空气污染模型，估计也是无偏的。结论：在数据充足的情况下，该方法能够灵活地产生无偏估计，将以前的结果推广到其他结果、暴露和聚集类型和程度。这些结果可以促进利用现有的临时汇总卫生数据进行研究、翻译和决策，特别是在资源匮乏和农村地区。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Epidemiology 医学-公共卫生、环境卫生与职业卫生

CiteScore

6.70

自引率

3.70%

发文量

177

审稿时长

6-12 weeks

期刊介绍： Epidemiology publishes original research from all fields of epidemiology. The journal also welcomes review articles and meta-analyses, novel hypotheses, descriptions and applications of new methods, and discussions of research theory or public health policy. We give special consideration to papers from developing countries.