Ebola data from the Internet: An Opportunity for Syndromic Surveillance or a News Event?

E. Yom-Tov
{"title":"Ebola data from the Internet: An Opportunity for Syndromic Surveillance or a News Event?","authors":"E. Yom-Tov","doi":"10.1145/2750511.2750512","DOIUrl":null,"url":null,"abstract":"Syndromic surveillance refers to the analysis of medical information for the purpose of detecting outbreaks of disease earlier than would have been possible otherwise and to estimate the prevalence of the disease in a population. Internet data, especially search engine queries and social media postings, have shown promise in contributing to syndromic surveillance for influenza and dengue fever. Here we focus on the recent outbreak of Ebola Virus Disease and ask whether three major sources of Internet data could have been used for early detection of the outbreak and for its ongoing monitoring. We analyze queries submitted to the Bing search engine, postings made by people using Twitter, and news articles in mainstream media, all collected from both the main infected countries in Africa and from across the world between November 2013 and October 2014. Our results indicate that it is unlikely any of the three sources would have provided an alert more than a week before the official announcement of the World Health Organization. Furthermore, over time, the number of Twitter messages and Bing queries related to Ebola are better correlated with the number of news articles than with the number of cases of the disease, even in the most affected countries. Information sought by users was predominantly from news sites and Wikipedia, and exhibited temporal patterns similar to those typical of news events. Thus, it is likely that the majority of Internet data about Ebola stems from news-like interest, not from information needs of people with Ebola. We discuss the differences between the current Ebola outbreak and seasonal influenza with respect to syndromic surveillance, and suggest further research is needed to understand where Internet data can assist in surveillance, and where it cannot.","PeriodicalId":91246,"journal":{"name":"DH'15: proceedings of the 5th International Conference on Digital Health 2015 : May 18-20, 2015, Florence, Italy. International Conference on Digital Health (5th : 2015 : Florence, Italy)","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2015-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DH'15: proceedings of the 5th International Conference on Digital Health 2015 : May 18-20, 2015, Florence, Italy. International Conference on Digital Health (5th : 2015 : Florence, Italy)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2750511.2750512","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

Abstract

Syndromic surveillance refers to the analysis of medical information for the purpose of detecting outbreaks of disease earlier than would have been possible otherwise and to estimate the prevalence of the disease in a population. Internet data, especially search engine queries and social media postings, have shown promise in contributing to syndromic surveillance for influenza and dengue fever. Here we focus on the recent outbreak of Ebola Virus Disease and ask whether three major sources of Internet data could have been used for early detection of the outbreak and for its ongoing monitoring. We analyze queries submitted to the Bing search engine, postings made by people using Twitter, and news articles in mainstream media, all collected from both the main infected countries in Africa and from across the world between November 2013 and October 2014. Our results indicate that it is unlikely any of the three sources would have provided an alert more than a week before the official announcement of the World Health Organization. Furthermore, over time, the number of Twitter messages and Bing queries related to Ebola are better correlated with the number of news articles than with the number of cases of the disease, even in the most affected countries. Information sought by users was predominantly from news sites and Wikipedia, and exhibited temporal patterns similar to those typical of news events. Thus, it is likely that the majority of Internet data about Ebola stems from news-like interest, not from information needs of people with Ebola. We discuss the differences between the current Ebola outbreak and seasonal influenza with respect to syndromic surveillance, and suggest further research is needed to understand where Internet data can assist in surveillance, and where it cannot.
来自互联网的埃博拉数据:综合征监测的机会还是新闻事件?
综合症监测是指分析医疗信息,以便及早发现疾病的爆发,并估计疾病在人群中的流行情况。互联网数据,特别是搜索引擎查询和社交媒体帖子,在促进流感和登革热综合征监测方面显示出了希望。在此,我们重点关注最近爆发的埃博拉病毒病,并询问是否可以使用三个主要的互联网数据来源来早期发现疫情并对其进行持续监测。我们分析了2013年11月至2014年10月期间提交给必应搜索引擎的查询、人们使用Twitter发布的帖子以及主流媒体的新闻文章,所有这些都来自非洲和世界各地的主要感染国家。我们的结果表明,这三个来源中的任何一个都不太可能在世界卫生组织正式宣布之前一周多的时间里发出警报。此外,随着时间的推移,与埃博拉相关的Twitter消息和必应查询的数量与新闻文章的数量之间的相关性要大于与埃博拉病例的数量之间的相关性,即使在疫情最严重的国家也是如此。用户搜索的信息主要来自新闻网站和维基百科,并且呈现出与典型新闻事件相似的时间模式。因此,大多数关于埃博拉病毒的互联网数据很可能来自类似新闻的兴趣,而不是来自埃博拉病毒感染者的信息需求。我们讨论了当前埃博拉疫情与季节性流感在综合征监测方面的差异,并建议需要进一步研究,以了解互联网数据在哪些方面有助于监测,哪些方面不能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信