{"title":"基于高斯混合的半监督增强非平衡数据分类","authors":"Mahit Kumar Paul, B. Pal","doi":"10.1109/ICECTE.2016.7879620","DOIUrl":null,"url":null,"abstract":"Semi supervised approaches are practical in problem domain where pattern clustering is supposed to provide good classification. Gaussian Mixture Model (GMM) can approximate arbitrary probability distribution, thus is considered as a dominant tool for classification in such domains. This paper appraises the functioning for GMM as it is applied to imbalanced datasets which consists of uneven distribution of samples from all the classes. Later, an ensemble approach is presented to boost the GMMs in a semi supervised manner via Adaptive Boosting technique. Experiment on benchmark imbalanced datasets with different imbalance ratio has been carried out. Empirical result demonstrates the efficacy of the proposed Boosted GMM classifier compared to baseline approaches like K-means and GMM.","PeriodicalId":6578,"journal":{"name":"2016 2nd International Conference on Electrical, Computer & Telecommunication Engineering (ICECTE)","volume":"39 1","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Gaussian mixture based semi supervised boosting for imbalanced data classification\",\"authors\":\"Mahit Kumar Paul, B. Pal\",\"doi\":\"10.1109/ICECTE.2016.7879620\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semi supervised approaches are practical in problem domain where pattern clustering is supposed to provide good classification. Gaussian Mixture Model (GMM) can approximate arbitrary probability distribution, thus is considered as a dominant tool for classification in such domains. This paper appraises the functioning for GMM as it is applied to imbalanced datasets which consists of uneven distribution of samples from all the classes. Later, an ensemble approach is presented to boost the GMMs in a semi supervised manner via Adaptive Boosting technique. Experiment on benchmark imbalanced datasets with different imbalance ratio has been carried out. Empirical result demonstrates the efficacy of the proposed Boosted GMM classifier compared to baseline approaches like K-means and GMM.\",\"PeriodicalId\":6578,\"journal\":{\"name\":\"2016 2nd International Conference on Electrical, Computer & Telecommunication Engineering (ICECTE)\",\"volume\":\"39 1\",\"pages\":\"1-4\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 2nd International Conference on Electrical, Computer & Telecommunication Engineering (ICECTE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECTE.2016.7879620\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 2nd International Conference on Electrical, Computer & Telecommunication Engineering (ICECTE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECTE.2016.7879620","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Gaussian mixture based semi supervised boosting for imbalanced data classification
Semi supervised approaches are practical in problem domain where pattern clustering is supposed to provide good classification. Gaussian Mixture Model (GMM) can approximate arbitrary probability distribution, thus is considered as a dominant tool for classification in such domains. This paper appraises the functioning for GMM as it is applied to imbalanced datasets which consists of uneven distribution of samples from all the classes. Later, an ensemble approach is presented to boost the GMMs in a semi supervised manner via Adaptive Boosting technique. Experiment on benchmark imbalanced datasets with different imbalance ratio has been carried out. Empirical result demonstrates the efficacy of the proposed Boosted GMM classifier compared to baseline approaches like K-means and GMM.