Bias in mobility datasets drives divergence in modeled outbreak dynamics.

IF 5.4 Q1 MEDICINE, RESEARCH & EXPERIMENTAL
Taylor Chin, Michael A Johansson, Anir Chowdhury, Shayan Chowdhury, Kawsar Hosan, Md Tanvir Quader, Caroline O Buckee, Ayesha S Mahmud
{"title":"Bias in mobility datasets drives divergence in modeled outbreak dynamics.","authors":"Taylor Chin, Michael A Johansson, Anir Chowdhury, Shayan Chowdhury, Kawsar Hosan, Md Tanvir Quader, Caroline O Buckee, Ayesha S Mahmud","doi":"10.1038/s43856-024-00714-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Digital data sources such as mobile phone call detail records (CDRs) are increasingly being used to estimate population mobility fluxes and to predict the spatiotemporal dynamics of infectious disease outbreaks. Differences in mobile phone operators' geographic coverage, however, may result in biased mobility estimates.</p><p><strong>Methods: </strong>We leverage a unique dataset consisting of CDRs from three mobile phone operators in Bangladesh and digital trace data from Meta's Data for Good program to compare mobility patterns across these sources. We use a metapopulation model to compare the sources' effects on simulated outbreak trajectories, and compare results with a benchmark model with data from all three operators, representing around 100 million subscribers across the country.</p><p><strong>Results: </strong>We show that mobility sources can vary significantly in their coverage of travel routes and geographic mobility patterns. Differences in projected outbreak dynamics are more pronounced at finer spatial scales, especially if the outbreak is seeded in smaller and/or geographically isolated regions. In some instances, a simple diffusion (gravity) model was better able to capture the timing and spatial spread of the outbreak compared to the sparser mobility sources.</p><p><strong>Conclusions: </strong>Our results highlight the potential biases in predicted outbreak dynamics from a metapopulation model parameterized with non-population representative data, and the limits to the generalizability of models built on these types of novel human behavioral data.</p>","PeriodicalId":72646,"journal":{"name":"Communications medicine","volume":"5 1","pages":"8"},"PeriodicalIF":5.4000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11706981/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s43856-024-00714-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Digital data sources such as mobile phone call detail records (CDRs) are increasingly being used to estimate population mobility fluxes and to predict the spatiotemporal dynamics of infectious disease outbreaks. Differences in mobile phone operators' geographic coverage, however, may result in biased mobility estimates.

Methods: We leverage a unique dataset consisting of CDRs from three mobile phone operators in Bangladesh and digital trace data from Meta's Data for Good program to compare mobility patterns across these sources. We use a metapopulation model to compare the sources' effects on simulated outbreak trajectories, and compare results with a benchmark model with data from all three operators, representing around 100 million subscribers across the country.

Results: We show that mobility sources can vary significantly in their coverage of travel routes and geographic mobility patterns. Differences in projected outbreak dynamics are more pronounced at finer spatial scales, especially if the outbreak is seeded in smaller and/or geographically isolated regions. In some instances, a simple diffusion (gravity) model was better able to capture the timing and spatial spread of the outbreak compared to the sparser mobility sources.

Conclusions: Our results highlight the potential biases in predicted outbreak dynamics from a metapopulation model parameterized with non-population representative data, and the limits to the generalizability of models built on these types of novel human behavioral data.

流动性数据集的偏差导致暴发动力学模型出现分歧。
背景:移动电话通话详细记录(cdr)等数字数据源正越来越多地用于估计人口流动通量和预测传染病暴发的时空动态。然而,移动电话运营商地理覆盖范围的差异可能会导致移动估计的偏差。方法:我们利用一个独特的数据集,该数据集由孟加拉国三家移动电话运营商的话单组成,并利用Meta的“数据为好”项目的数字跟踪数据来比较这些来源的移动模式。我们使用一个元人口模型来比较传染源对模拟爆发轨迹的影响,并将结果与包含所有三家运营商数据的基准模型进行比较,这些运营商代表了全国约1亿用户。结果:我们发现流动来源在其旅行路线和地理流动模式的覆盖范围上存在显著差异。预测的疫情动态差异在更小的空间尺度上更为明显,特别是在较小和/或地理上孤立的地区爆发疫情时。在某些情况下,与更稀疏的流动源相比,简单的扩散(重力)模型能够更好地捕捉爆发的时间和空间传播。结论:我们的研究结果强调了用非种群代表性数据参数化的元种群模型预测疫情动态的潜在偏差,以及基于这些新型人类行为数据建立的模型的推广局限性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信