Yu Zhang, Huan Wu, Jianzhong Zhang, Jingjing Wang, Xueqiang Zou
{"title":"TW-FCM:一种改进的模糊c均值算法用于SPIT检测","authors":"Yu Zhang, Huan Wu, Jianzhong Zhang, Jingjing Wang, Xueqiang Zou","doi":"10.1109/ICCCN.2018.8487369","DOIUrl":null,"url":null,"abstract":"With the popularity of VoIP (Voice over Internet Protocol) systems, there has been an explosive growth in VoIP spam. In order to effectively prevent spam calls, various methods based on the analysis of call behavior features have been proposed. However, few of the existing methods consider that different features have different weights, resulting in a low detection precision of SPIT (Spam over Internet Telephony) users. Meanwhile, most methods are tested based on the experimental data generated by simulation, it is not sure whether these methods work well in the real world. In this paper, we propose a Weighted-Fuzzy-C-Means (W- FCM) algorithm, which can automatically adjust the weight of each call feature in the clustering process. Experiments based on the real world data show that our proposed algorithm could effectively improve the detection precision (about 6.7%) and recall (about 0.3%) of SPIT users. We also analyze the impact of different membership thresholds on the clustering results and propose a Threshold-W-FCM (TW-FCM) algorithm, through which we can select appropriate membership thresholds to alleviate the class-imbalance problem, and thereby improve the overall performance of SPIT detection compared with traditional FCM method.","PeriodicalId":399145,"journal":{"name":"2018 27th International Conference on Computer Communication and Networks (ICCCN)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"TW-FCM: An Improved Fuzzy-C-Means Algorithm for SPIT Detection\",\"authors\":\"Yu Zhang, Huan Wu, Jianzhong Zhang, Jingjing Wang, Xueqiang Zou\",\"doi\":\"10.1109/ICCCN.2018.8487369\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the popularity of VoIP (Voice over Internet Protocol) systems, there has been an explosive growth in VoIP spam. In order to effectively prevent spam calls, various methods based on the analysis of call behavior features have been proposed. However, few of the existing methods consider that different features have different weights, resulting in a low detection precision of SPIT (Spam over Internet Telephony) users. Meanwhile, most methods are tested based on the experimental data generated by simulation, it is not sure whether these methods work well in the real world. In this paper, we propose a Weighted-Fuzzy-C-Means (W- FCM) algorithm, which can automatically adjust the weight of each call feature in the clustering process. Experiments based on the real world data show that our proposed algorithm could effectively improve the detection precision (about 6.7%) and recall (about 0.3%) of SPIT users. We also analyze the impact of different membership thresholds on the clustering results and propose a Threshold-W-FCM (TW-FCM) algorithm, through which we can select appropriate membership thresholds to alleviate the class-imbalance problem, and thereby improve the overall performance of SPIT detection compared with traditional FCM method.\",\"PeriodicalId\":399145,\"journal\":{\"name\":\"2018 27th International Conference on Computer Communication and Networks (ICCCN)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 27th International Conference on Computer Communication and Networks (ICCCN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCN.2018.8487369\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 27th International Conference on Computer Communication and Networks (ICCCN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCN.2018.8487369","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
随着VoIP (Internet Protocol Voice)系统的普及,VoIP垃圾邮件呈爆炸式增长。为了有效地防止垃圾电话,人们在分析呼叫行为特征的基础上提出了各种方法。然而,现有的方法很少考虑到不同的特征具有不同的权值,导致对垃圾邮件用户的检测精度较低。同时,大多数方法都是基于仿真生成的实验数据进行测试,不确定这些方法在现实世界中是否有效。本文提出了一种加权模糊c均值(W- FCM)算法,该算法可以在聚类过程中自动调整每个呼叫特征的权重。基于真实世界数据的实验表明,本文提出的算法可以有效地提高SPIT用户的检测精度(约6.7%)和召回率(约0.3%)。我们还分析了不同隶属度阈值对聚类结果的影响,提出了一种阈值- w -FCM (Threshold-W-FCM)算法,通过该算法可以选择合适的隶属度阈值来缓解类不平衡问题,从而与传统的FCM方法相比,提高了SPIT检测的整体性能。
TW-FCM: An Improved Fuzzy-C-Means Algorithm for SPIT Detection
With the popularity of VoIP (Voice over Internet Protocol) systems, there has been an explosive growth in VoIP spam. In order to effectively prevent spam calls, various methods based on the analysis of call behavior features have been proposed. However, few of the existing methods consider that different features have different weights, resulting in a low detection precision of SPIT (Spam over Internet Telephony) users. Meanwhile, most methods are tested based on the experimental data generated by simulation, it is not sure whether these methods work well in the real world. In this paper, we propose a Weighted-Fuzzy-C-Means (W- FCM) algorithm, which can automatically adjust the weight of each call feature in the clustering process. Experiments based on the real world data show that our proposed algorithm could effectively improve the detection precision (about 6.7%) and recall (about 0.3%) of SPIT users. We also analyze the impact of different membership thresholds on the clustering results and propose a Threshold-W-FCM (TW-FCM) algorithm, through which we can select appropriate membership thresholds to alleviate the class-imbalance problem, and thereby improve the overall performance of SPIT detection compared with traditional FCM method.