{"title":"A decomposition approach to imbalanced classification","authors":"A. Shrivastava, Junjie Cao","doi":"10.1109/ISI.2011.5984093","DOIUrl":null,"url":null,"abstract":"An important characteristic of many modern systems is the availability of large amounts of event data, collected through various sensors. Certain events occur very rarely among these, but may be critical to a successfully functioning system. Examples of these include faulty products, credit card frauds, among others. In this paper, we propose a framework for solving this problem, of detecting rare events, when modeled as a supervised learning task. Specifically, we consider an imbalanced 2-class classification problem. We overcome the challenge of class imbalance by decomposing the original learning task into many simpler learning tasks. A useful feature of the proposed algorithm is that the decision rule is simple enough to infer the importance of individual covariates in rare event detection. We present performance results on some public datasets to demonstrate the effectiveness of the proposed algorithm.","PeriodicalId":220165,"journal":{"name":"Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISI.2011.5984093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
An important characteristic of many modern systems is the availability of large amounts of event data, collected through various sensors. Certain events occur very rarely among these, but may be critical to a successfully functioning system. Examples of these include faulty products, credit card frauds, among others. In this paper, we propose a framework for solving this problem, of detecting rare events, when modeled as a supervised learning task. Specifically, we consider an imbalanced 2-class classification problem. We overcome the challenge of class imbalance by decomposing the original learning task into many simpler learning tasks. A useful feature of the proposed algorithm is that the decision rule is simple enough to infer the importance of individual covariates in rare event detection. We present performance results on some public datasets to demonstrate the effectiveness of the proposed algorithm.