N. Adeboye, Ilaro Federal Polytechnic, Oyedunsi Olayiwola
{"title":"Big Data Affluence in Statistics Application: A Comparison of Real Life and Simulated Open Data","authors":"N. Adeboye, Ilaro Federal Polytechnic, Oyedunsi Olayiwola","doi":"10.52041/IASE.20102","DOIUrl":null,"url":null,"abstract":"Large data repositories or database management still remain a mirage and tough challenge to accomplish by most developing countries and establishments around the globe. This necessitates the need to improvise on the gathering of suitable data with a good spread to serve as a complement, in the absence of sufficient real-life data. Statisticians are increasingly posed with thought-provoking and even paradoxical questions, challenging our qualifications for entering the statistical paradises created by Big Data. Through classroom activities that involved both sourced real-life and simulated big data in R-environment, models were built and estimates obtained from the adopted techniques revealed the robustness of simulated datasets in a unified observation with improved significant values as reflected in the results. Students appreciated the use of such big data as it enhances their machine learning ability and the availability of sufficient data without delay.","PeriodicalId":448781,"journal":{"name":"New Skills in the Changing World of Statistics Education","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Skills in the Changing World of Statistics Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52041/IASE.20102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Large data repositories or database management still remain a mirage and tough challenge to accomplish by most developing countries and establishments around the globe. This necessitates the need to improvise on the gathering of suitable data with a good spread to serve as a complement, in the absence of sufficient real-life data. Statisticians are increasingly posed with thought-provoking and even paradoxical questions, challenging our qualifications for entering the statistical paradises created by Big Data. Through classroom activities that involved both sourced real-life and simulated big data in R-environment, models were built and estimates obtained from the adopted techniques revealed the robustness of simulated datasets in a unified observation with improved significant values as reflected in the results. Students appreciated the use of such big data as it enhances their machine learning ability and the availability of sufficient data without delay.