{"title":"The Application of SMOTE Algorithm for Unbalanced Data","authors":"Dong Lv, Zhicheng Ma, Shibo Yang, Xianbo Li, Zhixin Ma, Fan Jiang","doi":"10.1145/3293663.3293686","DOIUrl":null,"url":null,"abstract":"The current power user data is unbalanced when it is used to analyze the behavior of the leakage user. In other words, the normal user data and the leakage user data have an inconsistent scale. When the automatic identification model of the leakage user is established, the analysis of the information of the leakage user's behavior feature is not clear, which leads to the reduction of the model's efficiency of classification. In this paper, we use Python and deal with the leakage user data based on SMOTE algorithm to increase the basic information of the users and extract more accurate leakage user behavior characteristics.","PeriodicalId":420290,"journal":{"name":"International Conference on Artificial Intelligence and Virtual Reality","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Artificial Intelligence and Virtual Reality","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3293663.3293686","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
The current power user data is unbalanced when it is used to analyze the behavior of the leakage user. In other words, the normal user data and the leakage user data have an inconsistent scale. When the automatic identification model of the leakage user is established, the analysis of the information of the leakage user's behavior feature is not clear, which leads to the reduction of the model's efficiency of classification. In this paper, we use Python and deal with the leakage user data based on SMOTE algorithm to increase the basic information of the users and extract more accurate leakage user behavior characteristics.