{"title":"A New Supervised Clustering Algorithm for Data Set with Mixed Attributes","authors":"Shijin Li, Yuelong Zhu, Jing Liu, Xiaohu Zhang","doi":"10.1109/SNPD.2007.360","DOIUrl":null,"url":null,"abstract":"Because of the complexity of data set with mixed attributes, the traditional clustering algorithms appropriate for this kind of dataset are few and the effect of clustering is not good. K-prototype clustering is one of the most commonly used methods in data mining for this kind of data. We borrow the ideas from the multiple classifiers combing technology, use k- prototype as the basis clustering algorithm to design a multi-level clustering ensemble algorithm in this paper, which adoptively selects attributes for re-clustering. Comparison experiments on Adult data set from UCI machine learning data repository show very competitive results and the proposed method is suitable for data editing.","PeriodicalId":197058,"journal":{"name":"Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNPD.2007.360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Because of the complexity of data set with mixed attributes, the traditional clustering algorithms appropriate for this kind of dataset are few and the effect of clustering is not good. K-prototype clustering is one of the most commonly used methods in data mining for this kind of data. We borrow the ideas from the multiple classifiers combing technology, use k- prototype as the basis clustering algorithm to design a multi-level clustering ensemble algorithm in this paper, which adoptively selects attributes for re-clustering. Comparison experiments on Adult data set from UCI machine learning data repository show very competitive results and the proposed method is suitable for data editing.