{"title":"基于不平衡数据集的机器学习电信欺诈检测","authors":"Ivan Krasić, S. Celar","doi":"10.23919/softcom55329.2022.9911518","DOIUrl":null,"url":null,"abstract":"This paper compares the performance of different ML algorithms to a fraud dataset. Telecom operators have long history and experience against fraud activities. During the time and changing the role of telecom operators, from service and infrastructure carriers to communication service providers handling data, voice, and content transfer. There is an urgent need to develop efficient and adaptive algorithms for early fraud detection and prevention. In this paper, we used machine learning techniques as an effective method to detect fraudsters in mobile communications. Fraud datasets are derived from the real telco operator's environment. Different experiments with a few popular machine learning classification algorithms were performed to evaluate the performance of the model. The used dataset, characterized by highly imbalanced distribution, is oversampled with SMOTE method to generate synthetic minority class instances and provide experiments. Performance metrics are evaluated and future research directives are proposed.","PeriodicalId":261625,"journal":{"name":"2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Telecom Fraud Detection with Machine Learning on Imbalanced Dataset\",\"authors\":\"Ivan Krasić, S. Celar\",\"doi\":\"10.23919/softcom55329.2022.9911518\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper compares the performance of different ML algorithms to a fraud dataset. Telecom operators have long history and experience against fraud activities. During the time and changing the role of telecom operators, from service and infrastructure carriers to communication service providers handling data, voice, and content transfer. There is an urgent need to develop efficient and adaptive algorithms for early fraud detection and prevention. In this paper, we used machine learning techniques as an effective method to detect fraudsters in mobile communications. Fraud datasets are derived from the real telco operator's environment. Different experiments with a few popular machine learning classification algorithms were performed to evaluate the performance of the model. The used dataset, characterized by highly imbalanced distribution, is oversampled with SMOTE method to generate synthetic minority class instances and provide experiments. Performance metrics are evaluated and future research directives are proposed.\",\"PeriodicalId\":261625,\"journal\":{\"name\":\"2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/softcom55329.2022.9911518\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/softcom55329.2022.9911518","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Telecom Fraud Detection with Machine Learning on Imbalanced Dataset
This paper compares the performance of different ML algorithms to a fraud dataset. Telecom operators have long history and experience against fraud activities. During the time and changing the role of telecom operators, from service and infrastructure carriers to communication service providers handling data, voice, and content transfer. There is an urgent need to develop efficient and adaptive algorithms for early fraud detection and prevention. In this paper, we used machine learning techniques as an effective method to detect fraudsters in mobile communications. Fraud datasets are derived from the real telco operator's environment. Different experiments with a few popular machine learning classification algorithms were performed to evaluate the performance of the model. The used dataset, characterized by highly imbalanced distribution, is oversampled with SMOTE method to generate synthetic minority class instances and provide experiments. Performance metrics are evaluated and future research directives are proposed.