{"title":"Missing values imputation hypothesis: An experimental evaluation","authors":"Huaxiong Li, Xianzhong Zhou, Yiyu Yao","doi":"10.1109/COGINF.2009.5250727","DOIUrl":null,"url":null,"abstract":"Missing values imputation is a basic strategy to deal with incomplete data. Many developed methods treat filled-in values as if they are original data. The correctness of such hypothesis has not been widely studied. In this paper, a philosophical and experimental study on the hypothesis of missing values imputation is discussed. In the experiments, classification accuracy of three learning algorithms with regard to six incomplete data sets are compared, which indicates that missing values imputation may not always help to improve the learning performance. Learning directly from incomplete data without imputation may reach a satisfying performance. The study not only provides an experimental analysis on missing values imputation, but also presents a new view on rule induction from incomplete data, which is much different from previous standpoint.","PeriodicalId":420853,"journal":{"name":"2009 8th IEEE International Conference on Cognitive Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 8th IEEE International Conference on Cognitive Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COGINF.2009.5250727","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Missing values imputation is a basic strategy to deal with incomplete data. Many developed methods treat filled-in values as if they are original data. The correctness of such hypothesis has not been widely studied. In this paper, a philosophical and experimental study on the hypothesis of missing values imputation is discussed. In the experiments, classification accuracy of three learning algorithms with regard to six incomplete data sets are compared, which indicates that missing values imputation may not always help to improve the learning performance. Learning directly from incomplete data without imputation may reach a satisfying performance. The study not only provides an experimental analysis on missing values imputation, but also presents a new view on rule induction from incomplete data, which is much different from previous standpoint.