{"title":"Building Effective and Efficient Procedure for Preprocessing Marketplace Data","authors":"Usman Bustaman, Dhiar Niken Larasati, Zulfa Hidayah Satria Putri, Siti Mariyah, Takdir, S. Pramana","doi":"10.1109/ICITEE49829.2020.9271717","DOIUrl":null,"url":null,"abstract":"Rapid development of digitalization have enforced National Statistics Offices to utilize big data as one of new sources for producing official statistics. An alternative source is marketplace data which now growing rapidly. Many challenges exist for transforming these massive datasets into statistics for public policy. This paper aims to explain the challenges of analyzing marketplace data and building effective and efficient preprocessing procedure to analyses big data which can be used for public policy. An optimal pipeline for preprocessing including validating, cleaning and aggregating marketplace data have been developed.","PeriodicalId":245013,"journal":{"name":"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEE49829.2020.9271717","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Rapid development of digitalization have enforced National Statistics Offices to utilize big data as one of new sources for producing official statistics. An alternative source is marketplace data which now growing rapidly. Many challenges exist for transforming these massive datasets into statistics for public policy. This paper aims to explain the challenges of analyzing marketplace data and building effective and efficient preprocessing procedure to analyses big data which can be used for public policy. An optimal pipeline for preprocessing including validating, cleaning and aggregating marketplace data have been developed.