Nasy’an Taufiq Al Ghifari, Ary Setijadi Prihatmanto, Rifki Wijaya, Rahadian Yusuf
{"title":"Data Quality Measures and Data Cleaning for Pattern Analysis Angkot Transportation in Bandung City","authors":"Nasy’an Taufiq Al Ghifari, Ary Setijadi Prihatmanto, Rifki Wijaya, Rahadian Yusuf","doi":"10.1109/ICoSTA48221.2020.1570613756","DOIUrl":null,"url":null,"abstract":"Detecting and repairing ‘dirty’ data is one of the perennial challenges in data analytics, and failure to do so can result in inaccurate analytics and unreliable decisions. To detect errors at an early stage and handle them efficiently, it is necessary to determine steps for cleaning and improving data quality. The data used in this study are data collected from previous studies. Data is collected through two sources, namely the Angkot mobile application and the GPS tracker microcontroller module. Some data cleaning tasks here are performed for geospatial data types. This paper provides an overview of data cleaning problems, data quality, cleaning approaches and requirements for public transportation pattern analysis.","PeriodicalId":375166,"journal":{"name":"2020 International Conference on Smart Technology and Applications (ICoSTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Smart Technology and Applications (ICoSTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoSTA48221.2020.1570613756","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Detecting and repairing ‘dirty’ data is one of the perennial challenges in data analytics, and failure to do so can result in inaccurate analytics and unreliable decisions. To detect errors at an early stage and handle them efficiently, it is necessary to determine steps for cleaning and improving data quality. The data used in this study are data collected from previous studies. Data is collected through two sources, namely the Angkot mobile application and the GPS tracker microcontroller module. Some data cleaning tasks here are performed for geospatial data types. This paper provides an overview of data cleaning problems, data quality, cleaning approaches and requirements for public transportation pattern analysis.