{"title":"A Progressive Sampling based Approach to Reduce Sampling Time","authors":"Nandita Bangera, K. N.","doi":"10.1109/RTEICT46194.2019.9016768","DOIUrl":null,"url":null,"abstract":"Analytics plays vital role in Data Science. It involves finding trends and patterns from the huge repository of data. Scanning huge amount of data consumes lot of time, which can be reduced by sampling. In this paper we have demonstrated effectiveness of Progressive sampling wherein the sample size is gradually increased till it reaches a desired accuracy. By applying an algorithm based on Rademacher average to mine frequent datasets using Progressive sampling, we have shown that the runtime and the sampling time is considerably reduced as compared with static sampling.","PeriodicalId":269385,"journal":{"name":"2019 4th International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RTEICT46194.2019.9016768","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Analytics plays vital role in Data Science. It involves finding trends and patterns from the huge repository of data. Scanning huge amount of data consumes lot of time, which can be reduced by sampling. In this paper we have demonstrated effectiveness of Progressive sampling wherein the sample size is gradually increased till it reaches a desired accuracy. By applying an algorithm based on Rademacher average to mine frequent datasets using Progressive sampling, we have shown that the runtime and the sampling time is considerably reduced as compared with static sampling.