{"title":"Performance Evaluation for Class Center-Based Missing Data Imputation Algorithm","authors":"Heru Nugroho, N. P. Utama, K. Surendro","doi":"10.1145/3384544.3384575","DOIUrl":null,"url":null,"abstract":"The imputation method should be able to reproduce the actual values in the data or Predictive Accuracy (PAC) and maintaining the distribution of these values or Distributional Accuracy (DAC). However, in most studies, evaluation of imputation performance was measured based on classification accuracy. On classification issues, class center-based methods for missing data imputation are developed and outperform other methods for numeric and mixed data types. This paper will be evaluated the accuracy of class center-based methods for missing data imputation, which has been modified by considering the correlation between attributes. A class center-based method for missing data imputation produces an average value of r is 0.96, with the lowest average value for MSE and DKS is 0.04 and 0.03. This result shows that the imputation method is more efficient and can maintain the actual data value distribution.","PeriodicalId":200246,"journal":{"name":"Proceedings of the 2020 9th International Conference on Software and Computer Applications","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 9th International Conference on Software and Computer Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3384544.3384575","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
The imputation method should be able to reproduce the actual values in the data or Predictive Accuracy (PAC) and maintaining the distribution of these values or Distributional Accuracy (DAC). However, in most studies, evaluation of imputation performance was measured based on classification accuracy. On classification issues, class center-based methods for missing data imputation are developed and outperform other methods for numeric and mixed data types. This paper will be evaluated the accuracy of class center-based methods for missing data imputation, which has been modified by considering the correlation between attributes. A class center-based method for missing data imputation produces an average value of r is 0.96, with the lowest average value for MSE and DKS is 0.04 and 0.03. This result shows that the imputation method is more efficient and can maintain the actual data value distribution.