Cao Truong Tran, Mengjie Zhang, Peter M. Andreae, Bing Xue, L. Bui
{"title":"不完整数据的基于规则的分类器集合","authors":"Cao Truong Tran, Mengjie Zhang, Peter M. Andreae, Bing Xue, L. Bui","doi":"10.1109/IESYS.2017.8233553","DOIUrl":null,"url":null,"abstract":"Many real-world datasets suffer from the problem of missing values. Imputation which replaces missing values with plausible values is a major method for classification with data containing missing values. However, powerful imputation methods including multiple imputation are usually computationally intensive for estimating missing values in unseen incomplete instances. Rule-based classification algorithms have been widely used in data mining, but the majority of them are not able to directly work with data containing missing values. This paper proposes an approach to effectively combining multiple imputation, feature selection and rule-based classification to construct a set of classifiers, which can be used to classify any incomplete instance without requiring imputation. Empirical results show that the method not only can be more accurate than other common methods, but can also be faster to classify new instances than the other methods.","PeriodicalId":429982,"journal":{"name":"2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES)","volume":"154 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"An ensemble of rule-based classifiers for incomplete data\",\"authors\":\"Cao Truong Tran, Mengjie Zhang, Peter M. Andreae, Bing Xue, L. Bui\",\"doi\":\"10.1109/IESYS.2017.8233553\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many real-world datasets suffer from the problem of missing values. Imputation which replaces missing values with plausible values is a major method for classification with data containing missing values. However, powerful imputation methods including multiple imputation are usually computationally intensive for estimating missing values in unseen incomplete instances. Rule-based classification algorithms have been widely used in data mining, but the majority of them are not able to directly work with data containing missing values. This paper proposes an approach to effectively combining multiple imputation, feature selection and rule-based classification to construct a set of classifiers, which can be used to classify any incomplete instance without requiring imputation. Empirical results show that the method not only can be more accurate than other common methods, but can also be faster to classify new instances than the other methods.\",\"PeriodicalId\":429982,\"journal\":{\"name\":\"2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES)\",\"volume\":\"154 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IESYS.2017.8233553\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IESYS.2017.8233553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An ensemble of rule-based classifiers for incomplete data
Many real-world datasets suffer from the problem of missing values. Imputation which replaces missing values with plausible values is a major method for classification with data containing missing values. However, powerful imputation methods including multiple imputation are usually computationally intensive for estimating missing values in unseen incomplete instances. Rule-based classification algorithms have been widely used in data mining, but the majority of them are not able to directly work with data containing missing values. This paper proposes an approach to effectively combining multiple imputation, feature selection and rule-based classification to construct a set of classifiers, which can be used to classify any incomplete instance without requiring imputation. Empirical results show that the method not only can be more accurate than other common methods, but can also be faster to classify new instances than the other methods.