Xiaochen Lai, Xin Liu, Liyong Zhang, Chi Lin, M. Obaidat, K. Hsiao
{"title":"Missing Value Imputations by Rule-Based Incomplete Data Fuzzy Modeling","authors":"Xiaochen Lai, Xin Liu, Liyong Zhang, Chi Lin, M. Obaidat, K. Hsiao","doi":"10.1109/ICC.2019.8761052","DOIUrl":null,"url":null,"abstract":"Missing values are a common phenomenon in real-world datasets, which decreases the quality and reliability of data mining. Traditional regression-based imputation method estimates missing values through the relationship between attributes inferred by complete records. In order to describe the relationship more appropriately and make better use of present values, a rule-based incomplete data modeling method is proposed to impute missing values in this paper. The method utilizes incomplete records together with complete records for establishing Takagi-Sugeno (TS) models. In this process, the incomplete dataset is divided into several subsets and the linear functions containing only significant variables are built to describe the relationships between attributes in each subset. Experimental results demonstrate that the proposed method can effectively improve the performance of missing value imputation.","PeriodicalId":402732,"journal":{"name":"ICC 2019 - 2019 IEEE International Conference on Communications (ICC)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICC 2019 - 2019 IEEE International Conference on Communications (ICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICC.2019.8761052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Missing values are a common phenomenon in real-world datasets, which decreases the quality and reliability of data mining. Traditional regression-based imputation method estimates missing values through the relationship between attributes inferred by complete records. In order to describe the relationship more appropriately and make better use of present values, a rule-based incomplete data modeling method is proposed to impute missing values in this paper. The method utilizes incomplete records together with complete records for establishing Takagi-Sugeno (TS) models. In this process, the incomplete dataset is divided into several subsets and the linear functions containing only significant variables are built to describe the relationships between attributes in each subset. Experimental results demonstrate that the proposed method can effectively improve the performance of missing value imputation.