{"title":"缺失值归算假说:一个实验评估","authors":"Huaxiong Li, Xianzhong Zhou, Yiyu Yao","doi":"10.1109/COGINF.2009.5250727","DOIUrl":null,"url":null,"abstract":"Missing values imputation is a basic strategy to deal with incomplete data. Many developed methods treat filled-in values as if they are original data. The correctness of such hypothesis has not been widely studied. In this paper, a philosophical and experimental study on the hypothesis of missing values imputation is discussed. In the experiments, classification accuracy of three learning algorithms with regard to six incomplete data sets are compared, which indicates that missing values imputation may not always help to improve the learning performance. Learning directly from incomplete data without imputation may reach a satisfying performance. The study not only provides an experimental analysis on missing values imputation, but also presents a new view on rule induction from incomplete data, which is much different from previous standpoint.","PeriodicalId":420853,"journal":{"name":"2009 8th IEEE International Conference on Cognitive Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Missing values imputation hypothesis: An experimental evaluation\",\"authors\":\"Huaxiong Li, Xianzhong Zhou, Yiyu Yao\",\"doi\":\"10.1109/COGINF.2009.5250727\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Missing values imputation is a basic strategy to deal with incomplete data. Many developed methods treat filled-in values as if they are original data. The correctness of such hypothesis has not been widely studied. In this paper, a philosophical and experimental study on the hypothesis of missing values imputation is discussed. In the experiments, classification accuracy of three learning algorithms with regard to six incomplete data sets are compared, which indicates that missing values imputation may not always help to improve the learning performance. Learning directly from incomplete data without imputation may reach a satisfying performance. The study not only provides an experimental analysis on missing values imputation, but also presents a new view on rule induction from incomplete data, which is much different from previous standpoint.\",\"PeriodicalId\":420853,\"journal\":{\"name\":\"2009 8th IEEE International Conference on Cognitive Informatics\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 8th IEEE International Conference on Cognitive Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COGINF.2009.5250727\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 8th IEEE International Conference on Cognitive Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COGINF.2009.5250727","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Missing values imputation hypothesis: An experimental evaluation
Missing values imputation is a basic strategy to deal with incomplete data. Many developed methods treat filled-in values as if they are original data. The correctness of such hypothesis has not been widely studied. In this paper, a philosophical and experimental study on the hypothesis of missing values imputation is discussed. In the experiments, classification accuracy of three learning algorithms with regard to six incomplete data sets are compared, which indicates that missing values imputation may not always help to improve the learning performance. Learning directly from incomplete data without imputation may reach a satisfying performance. The study not only provides an experimental analysis on missing values imputation, but also presents a new view on rule induction from incomplete data, which is much different from previous standpoint.