{"title":"Semi-Supervised Similarity Preserving Co-Selection","authors":"Raywat Makkhongkaew, K. Benabdeslem","doi":"10.1109/ICDMW.2016.0111","DOIUrl":null,"url":null,"abstract":"Semi-supervised learning is the required paradigm when data are partially labeled. It is more adapted for large domain applications when labels are hardly and costly to obtain. In addition, when data are large, feature selection and instance selection are two important dual operations for removing irrelevant information. To address theses challenges together, we propose a unified framework, called sCOs, for semi-supervised co-selection of features and instances, simultaneously. In particular, we propose a novel cost function based on l2, 1-norm regularization and similarity preserving selection of both features and instances. Experimental results on some known benchmark datasets are provided for validating sCOs and comparing it with some representative methods in the state-of-the art.","PeriodicalId":373866,"journal":{"name":"2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2016.0111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Semi-supervised learning is the required paradigm when data are partially labeled. It is more adapted for large domain applications when labels are hardly and costly to obtain. In addition, when data are large, feature selection and instance selection are two important dual operations for removing irrelevant information. To address theses challenges together, we propose a unified framework, called sCOs, for semi-supervised co-selection of features and instances, simultaneously. In particular, we propose a novel cost function based on l2, 1-norm regularization and similarity preserving selection of both features and instances. Experimental results on some known benchmark datasets are provided for validating sCOs and comparing it with some representative methods in the state-of-the art.