{"title":"Approach to Missing Data Recovery","authors":"E. Xu, Shaocheng Tong, Y. Wang, Shang Xu, Peng Li","doi":"10.1109/ISECS.2008.79","DOIUrl":null,"url":null,"abstract":"In order to recover the missing data in an information system, the paper proposed a new approach based on rough set to reduce the redundant attributes, discretize the continuous attributes and fill in the missing data. According to indiscernible relationship, discernible vector were defined and used the discernible vector addition rule to reduce attributes. And then, depending on the concept of super-club data and entropy of the information table, discretization of the continuous attributes was implemented. Finally, by use of the corresponding relationship of condition attributes and decision attributes, the definition of interval value and interval value addition rule were defined and filled up the incomplete data. The illustration and experimental results indicate that the approach is effective and efficient.","PeriodicalId":144075,"journal":{"name":"2008 International Symposium on Electronic Commerce and Security","volume":"286 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Symposium on Electronic Commerce and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISECS.2008.79","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In order to recover the missing data in an information system, the paper proposed a new approach based on rough set to reduce the redundant attributes, discretize the continuous attributes and fill in the missing data. According to indiscernible relationship, discernible vector were defined and used the discernible vector addition rule to reduce attributes. And then, depending on the concept of super-club data and entropy of the information table, discretization of the continuous attributes was implemented. Finally, by use of the corresponding relationship of condition attributes and decision attributes, the definition of interval value and interval value addition rule were defined and filled up the incomplete data. The illustration and experimental results indicate that the approach is effective and efficient.