{"title":"Forecasting Influenza Levels Using Real-Time Social Media Streams","authors":"Kathy Lee, Ankit Agrawal, A. Choudhary","doi":"10.1109/ICHI.2017.68","DOIUrl":null,"url":null,"abstract":"Seasonal influenza is a contagious respiratory illness that can cause various complications, worsen chronic illnesses, and sometimes lead to deaths. During 2009 H1N1 flu pandemic, up to 203,000 deaths occurred worldwide. Early detection and prediction of disease outbreak is critical because it can provide more time to prepare a response and significantly reduce the impact caused by a pandemic. The traditional influenza surveillance system by Centers for Disease Control and Prevention (CDC) collects U.S. Influenza-Like Illness related physicians visits data from sentinel practices and provides a retrospective analysis delayed by two weeks. Google Flu Trends proposed a method that uses online search queries data to estimate current (real-time) influenza activity. Here we present a system that (1) predicts future influenza activities, (2) provides more accurate real-time assessment than before, and (3) combines real-time big social media data streams and CDC historical datasets for predictive models to accomplish accurate predictions. Although retrospective analysis and observations are important, prediction of future flu levels can represent a big leap because such predictions provide actionable insights for public health that can be used for planning, resource allocation, treatments and prevention. Thus, compared to previous work, our work represents an advancement in accuracy of assessments, prediction of future flu activity accurately and an ability to combine big social data and observed CDC data to build predictive models.","PeriodicalId":263611,"journal":{"name":"2017 IEEE International Conference on Healthcare Informatics (ICHI)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"47","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Healthcare Informatics (ICHI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICHI.2017.68","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 47
Abstract
Seasonal influenza is a contagious respiratory illness that can cause various complications, worsen chronic illnesses, and sometimes lead to deaths. During 2009 H1N1 flu pandemic, up to 203,000 deaths occurred worldwide. Early detection and prediction of disease outbreak is critical because it can provide more time to prepare a response and significantly reduce the impact caused by a pandemic. The traditional influenza surveillance system by Centers for Disease Control and Prevention (CDC) collects U.S. Influenza-Like Illness related physicians visits data from sentinel practices and provides a retrospective analysis delayed by two weeks. Google Flu Trends proposed a method that uses online search queries data to estimate current (real-time) influenza activity. Here we present a system that (1) predicts future influenza activities, (2) provides more accurate real-time assessment than before, and (3) combines real-time big social media data streams and CDC historical datasets for predictive models to accomplish accurate predictions. Although retrospective analysis and observations are important, prediction of future flu levels can represent a big leap because such predictions provide actionable insights for public health that can be used for planning, resource allocation, treatments and prevention. Thus, compared to previous work, our work represents an advancement in accuracy of assessments, prediction of future flu activity accurately and an ability to combine big social data and observed CDC data to build predictive models.