{"title":"利用集成技术改进了多类分类和离群点检测方法","authors":"Dalton Ndirangu, W. Mwangi, L. Nderu","doi":"10.1145/3290420.3290450","DOIUrl":null,"url":null,"abstract":"Class imbalance problems in multiclass have attracted much research focus due to classification difficulty caused by imbalance class distribution, presence of outliers, and irrelevant features that degrades performance of classifiers. Most of the commonly used classification algorithms are developed for binary problem. Rare classes in the multiclass datasets may be treated as outlier classes. The study proposed development of an ensemble multiclass classification and outlier method using adaboost, random subspace algorithms with random forest as base classifiers and voting combination technique. Experimental results shows that the proposed method outperformed most of the commonly used classification algorithms such as KNN, bagging, Naive Bayes, and random forest. Using rare classes as the outlier classes, the proposed method performed better than outlier detection methods built using Naive Bayes, KNN, decision trees, and random forest algorithms. The study concludes that ensemble techniques leads to an improved multiclass classification and outlier detection method.","PeriodicalId":259201,"journal":{"name":"International Conference on Critical Infrastructure Protection","volume":"110 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Improving multiclass classification and outlier detection method through ensemble technique\",\"authors\":\"Dalton Ndirangu, W. Mwangi, L. Nderu\",\"doi\":\"10.1145/3290420.3290450\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Class imbalance problems in multiclass have attracted much research focus due to classification difficulty caused by imbalance class distribution, presence of outliers, and irrelevant features that degrades performance of classifiers. Most of the commonly used classification algorithms are developed for binary problem. Rare classes in the multiclass datasets may be treated as outlier classes. The study proposed development of an ensemble multiclass classification and outlier method using adaboost, random subspace algorithms with random forest as base classifiers and voting combination technique. Experimental results shows that the proposed method outperformed most of the commonly used classification algorithms such as KNN, bagging, Naive Bayes, and random forest. Using rare classes as the outlier classes, the proposed method performed better than outlier detection methods built using Naive Bayes, KNN, decision trees, and random forest algorithms. The study concludes that ensemble techniques leads to an improved multiclass classification and outlier detection method.\",\"PeriodicalId\":259201,\"journal\":{\"name\":\"International Conference on Critical Infrastructure Protection\",\"volume\":\"110 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Critical Infrastructure Protection\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3290420.3290450\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Critical Infrastructure Protection","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3290420.3290450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving multiclass classification and outlier detection method through ensemble technique
Class imbalance problems in multiclass have attracted much research focus due to classification difficulty caused by imbalance class distribution, presence of outliers, and irrelevant features that degrades performance of classifiers. Most of the commonly used classification algorithms are developed for binary problem. Rare classes in the multiclass datasets may be treated as outlier classes. The study proposed development of an ensemble multiclass classification and outlier method using adaboost, random subspace algorithms with random forest as base classifiers and voting combination technique. Experimental results shows that the proposed method outperformed most of the commonly used classification algorithms such as KNN, bagging, Naive Bayes, and random forest. Using rare classes as the outlier classes, the proposed method performed better than outlier detection methods built using Naive Bayes, KNN, decision trees, and random forest algorithms. The study concludes that ensemble techniques leads to an improved multiclass classification and outlier detection method.