Amir Habibdoust, Maryam Seifaddini, Moosa Tatar, Ozgur M Araz, Fernando A Wilson
{"title":"利用谷歌趋势数据和机器学习方法预测加利福尼亚州的 COVID-19 新病例。","authors":"Amir Habibdoust, Maryam Seifaddini, Moosa Tatar, Ozgur M Araz, Fernando A Wilson","doi":"10.1080/17538157.2024.2315246","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Google Trends data can be a valuable source of information for health-related issues such as predicting infectious disease trends.</p><p><strong>Objectives: </strong>To evaluate the accuracy of predicting COVID-19 new cases in California using Google Trends data, we develop and use a GMDH-type neural network model and compare its performance with a LTSM model.</p><p><strong>Methods: </strong>We predicted COVID-19 new cases using Google query data over three periods. Our first period covered March 1, 2020, to July 31, 2020, including the first peak of infection. We also estimated a model from October 1, 2020, to January 7, 2021, including the second wave of COVID-19 and avoiding possible biases from public interest in searching about the new pandemic. In addition, we extended our forecasting period from May 20, 2020, to January 31, 2021, to cover an extended period of time.</p><p><strong>Results: </strong>Our findings show that Google relative search volume (RSV) can be used to accurately predict new COVID-19 cases. We find that among our Google relative search volume terms, \"Fever,\" \"COVID Testing,\" \"Signs of COVID,\" \"COVID Treatment,\" and \"Shortness of Breath\" increase model predictive accuracy.</p><p><strong>Conclusions: </strong>Our findings highlight the value of using data sources providing <i>near</i> real-time data, e.g., Google Trends, to detect trends in COVID-19 cases, in order to supplement and extend existing epidemiological models.</p>","PeriodicalId":101409,"journal":{"name":"Informatics for health & social care","volume":" ","pages":"56-72"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting COVID-19 new cases in California with Google Trends data and a machine learning approach.\",\"authors\":\"Amir Habibdoust, Maryam Seifaddini, Moosa Tatar, Ozgur M Araz, Fernando A Wilson\",\"doi\":\"10.1080/17538157.2024.2315246\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Google Trends data can be a valuable source of information for health-related issues such as predicting infectious disease trends.</p><p><strong>Objectives: </strong>To evaluate the accuracy of predicting COVID-19 new cases in California using Google Trends data, we develop and use a GMDH-type neural network model and compare its performance with a LTSM model.</p><p><strong>Methods: </strong>We predicted COVID-19 new cases using Google query data over three periods. Our first period covered March 1, 2020, to July 31, 2020, including the first peak of infection. We also estimated a model from October 1, 2020, to January 7, 2021, including the second wave of COVID-19 and avoiding possible biases from public interest in searching about the new pandemic. In addition, we extended our forecasting period from May 20, 2020, to January 31, 2021, to cover an extended period of time.</p><p><strong>Results: </strong>Our findings show that Google relative search volume (RSV) can be used to accurately predict new COVID-19 cases. We find that among our Google relative search volume terms, \\\"Fever,\\\" \\\"COVID Testing,\\\" \\\"Signs of COVID,\\\" \\\"COVID Treatment,\\\" and \\\"Shortness of Breath\\\" increase model predictive accuracy.</p><p><strong>Conclusions: </strong>Our findings highlight the value of using data sources providing <i>near</i> real-time data, e.g., Google Trends, to detect trends in COVID-19 cases, in order to supplement and extend existing epidemiological models.</p>\",\"PeriodicalId\":101409,\"journal\":{\"name\":\"Informatics for health & social care\",\"volume\":\" \",\"pages\":\"56-72\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Informatics for health & social care\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/17538157.2024.2315246\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/2/14 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics for health & social care","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/17538157.2024.2315246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/14 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
Predicting COVID-19 new cases in California with Google Trends data and a machine learning approach.
Background: Google Trends data can be a valuable source of information for health-related issues such as predicting infectious disease trends.
Objectives: To evaluate the accuracy of predicting COVID-19 new cases in California using Google Trends data, we develop and use a GMDH-type neural network model and compare its performance with a LTSM model.
Methods: We predicted COVID-19 new cases using Google query data over three periods. Our first period covered March 1, 2020, to July 31, 2020, including the first peak of infection. We also estimated a model from October 1, 2020, to January 7, 2021, including the second wave of COVID-19 and avoiding possible biases from public interest in searching about the new pandemic. In addition, we extended our forecasting period from May 20, 2020, to January 31, 2021, to cover an extended period of time.
Results: Our findings show that Google relative search volume (RSV) can be used to accurately predict new COVID-19 cases. We find that among our Google relative search volume terms, "Fever," "COVID Testing," "Signs of COVID," "COVID Treatment," and "Shortness of Breath" increase model predictive accuracy.
Conclusions: Our findings highlight the value of using data sources providing near real-time data, e.g., Google Trends, to detect trends in COVID-19 cases, in order to supplement and extend existing epidemiological models.