Big Data Affluence in Statistics Application: A Comparison of Real Life and Simulated Open Data

N. Adeboye, Ilaro Federal Polytechnic, Oyedunsi Olayiwola
{"title":"Big Data Affluence in Statistics Application: A Comparison of Real Life and Simulated Open Data","authors":"N. Adeboye, Ilaro Federal Polytechnic, Oyedunsi Olayiwola","doi":"10.52041/IASE.20102","DOIUrl":null,"url":null,"abstract":"Large data repositories or database management still remain a mirage and tough challenge to accomplish by most developing countries and establishments around the globe. This necessitates the need to improvise on the gathering of suitable data with a good spread to serve as a complement, in the absence of sufficient real-life data. Statisticians are increasingly posed with thought-provoking and even paradoxical questions, challenging our qualifications for entering the statistical paradises created by Big Data. Through classroom activities that involved both sourced real-life and simulated big data in R-environment, models were built and estimates obtained from the adopted techniques revealed the robustness of simulated datasets in a unified observation with improved significant values as reflected in the results. Students appreciated the use of such big data as it enhances their machine learning ability and the availability of sufficient data without delay.","PeriodicalId":448781,"journal":{"name":"New Skills in the Changing World of Statistics Education","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Skills in the Changing World of Statistics Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52041/IASE.20102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Large data repositories or database management still remain a mirage and tough challenge to accomplish by most developing countries and establishments around the globe. This necessitates the need to improvise on the gathering of suitable data with a good spread to serve as a complement, in the absence of sufficient real-life data. Statisticians are increasingly posed with thought-provoking and even paradoxical questions, challenging our qualifications for entering the statistical paradises created by Big Data. Through classroom activities that involved both sourced real-life and simulated big data in R-environment, models were built and estimates obtained from the adopted techniques revealed the robustness of simulated datasets in a unified observation with improved significant values as reflected in the results. Students appreciated the use of such big data as it enhances their machine learning ability and the availability of sufficient data without delay.
统计应用中的大数据影响:现实生活和模拟开放数据的比较
大型数据存储库或数据库管理仍然是世界上大多数发展中国家和机构难以实现的海市蜃楼和艰巨挑战。这就需要在缺乏足够的实际数据的情况下,临时收集具有良好传播范围的适当数据作为补充。统计学家面临着越来越多发人深省甚至矛盾的问题,挑战着我们进入大数据创造的统计天堂的资格。通过课堂活动,在R-environment中建立了真实的大数据和模拟的大数据,通过所采用的技术建立了模型,并获得了估计,结果表明模拟数据集在统一观测中的鲁棒性得到了改善,显著值得到了提高。学生们很欣赏这种大数据的使用,因为它增强了他们的机器学习能力,并且可以及时获得足够的数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信