{"title":"统计数据驱动的建模与预测:在COVID-19大流行中的应用","authors":"Shalabh, Subhra Sankar Dhar, Sabara Parshad Rajeshbhai","doi":"10.1007/s40745-024-00583-8","DOIUrl":null,"url":null,"abstract":"<div><p>One of the key objectives of statistics is to provide a model compatible with the data generated by an unknown random process. Often, it happens that the unknown process is intractable, and no prior data or information associated with the unknown process is available. Under such circumstances, well-known techniques like regression modelling techniques may not work. As a result, an alternative approach may be to observe the general features of the process from the available data. Afterward, a suitable statistical distribution, like a mixture of certain distributions, can be fitted to the existing available data, and future observations can be predicted using this fitting. For example, one may consider the prediction related to the COVID-19 pandemic. As it occurred for the first time, no prior data was available to apprehend the behaviour and progression of the COVID-19 pandemic. For such cases, a data-based statistical modelling procedure can be adopted to predict future occurrences based on a small data set. This article presents such an application-oriented, data-based statistical modelling procedure with an implementation on the COVID-19 data. The proposed procedure can be used for a wide range of modelling and forecasting of future events.</p></div>","PeriodicalId":36280,"journal":{"name":"Annals of Data Science","volume":"12 5","pages":"1747 - 1770"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Statistical Data-Driven Modelling and Forecasting: An Application to COVID-19 Pandemic\",\"authors\":\"Shalabh, Subhra Sankar Dhar, Sabara Parshad Rajeshbhai\",\"doi\":\"10.1007/s40745-024-00583-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>One of the key objectives of statistics is to provide a model compatible with the data generated by an unknown random process. Often, it happens that the unknown process is intractable, and no prior data or information associated with the unknown process is available. Under such circumstances, well-known techniques like regression modelling techniques may not work. As a result, an alternative approach may be to observe the general features of the process from the available data. Afterward, a suitable statistical distribution, like a mixture of certain distributions, can be fitted to the existing available data, and future observations can be predicted using this fitting. For example, one may consider the prediction related to the COVID-19 pandemic. As it occurred for the first time, no prior data was available to apprehend the behaviour and progression of the COVID-19 pandemic. For such cases, a data-based statistical modelling procedure can be adopted to predict future occurrences based on a small data set. This article presents such an application-oriented, data-based statistical modelling procedure with an implementation on the COVID-19 data. The proposed procedure can be used for a wide range of modelling and forecasting of future events.</p></div>\",\"PeriodicalId\":36280,\"journal\":{\"name\":\"Annals of Data Science\",\"volume\":\"12 5\",\"pages\":\"1747 - 1770\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of Data Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s40745-024-00583-8\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Decision Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Data Science","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s40745-024-00583-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Decision Sciences","Score":null,"Total":0}
Statistical Data-Driven Modelling and Forecasting: An Application to COVID-19 Pandemic
One of the key objectives of statistics is to provide a model compatible with the data generated by an unknown random process. Often, it happens that the unknown process is intractable, and no prior data or information associated with the unknown process is available. Under such circumstances, well-known techniques like regression modelling techniques may not work. As a result, an alternative approach may be to observe the general features of the process from the available data. Afterward, a suitable statistical distribution, like a mixture of certain distributions, can be fitted to the existing available data, and future observations can be predicted using this fitting. For example, one may consider the prediction related to the COVID-19 pandemic. As it occurred for the first time, no prior data was available to apprehend the behaviour and progression of the COVID-19 pandemic. For such cases, a data-based statistical modelling procedure can be adopted to predict future occurrences based on a small data set. This article presents such an application-oriented, data-based statistical modelling procedure with an implementation on the COVID-19 data. The proposed procedure can be used for a wide range of modelling and forecasting of future events.
期刊介绍:
Annals of Data Science (ADS) publishes cutting-edge research findings, experimental results and case studies of data science. Although Data Science is regarded as an interdisciplinary field of using mathematics, statistics, databases, data mining, high-performance computing, knowledge management and virtualization to discover knowledge from Big Data, it should have its own scientific contents, such as axioms, laws and rules, which are fundamentally important for experts in different fields to explore their own interests from Big Data. ADS encourages contributors to address such challenging problems at this exchange platform. At present, how to discover knowledge from heterogeneous data under Big Data environment needs to be addressed. ADS is a series of volumes edited by either the editorial office or guest editors. Guest editors will be responsible for call-for-papers and the review process for high-quality contributions in their volumes.