{"title":"基于模糊多类支持向量机的微博垃圾信息检测","authors":"Guangxia Xu, G. Gao, Mengxiao Hu","doi":"10.1109/CYBERC.2018.00016","DOIUrl":null,"url":null,"abstract":"Micro-blog has become an important information dissemination and exchange platform in people's social lives. Massive micro-blog data contains a large number of valuable information, but the micro-blog platform appears to have a lot of spam behavior problems in recent years; behavior consistent with spammers and spam micro-blogs. The spam not only affects the impact of micro-blog's data mining and decision analysis, but also seriously affects the healthy development of micro-blog platform and user experience. In this paper, a new spammer detection method based on fuzzy multi-class support vector machines (FMCSVM) is proposed in micro-blog, it combines the SVM multi-class classifier with the fuzzy mathematics theory in spammer detection. Current researches on micro-blog spammers is to analyze the characteristics of the global spammers, so that the strength of these analyses is not enough, and these researches lack the feature analysis for a certain type spammer. As a result, this will enable the spammer to escape the spam detection system. In this paper, we divide spammers into three categories by analyzing the features of micro-blog spammers, and then construct one-versus-rest SVM multi-class classifier. The fuzzy clustering method is used to deal with the mixed samples generated by the multi class classifier, and the combination classifier is obtained, which improves the detection accuracy.","PeriodicalId":282903,"journal":{"name":"2018 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detecting Spammer on Micro-blogs Base on Fuzzy Multi-class SVM\",\"authors\":\"Guangxia Xu, G. Gao, Mengxiao Hu\",\"doi\":\"10.1109/CYBERC.2018.00016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Micro-blog has become an important information dissemination and exchange platform in people's social lives. Massive micro-blog data contains a large number of valuable information, but the micro-blog platform appears to have a lot of spam behavior problems in recent years; behavior consistent with spammers and spam micro-blogs. The spam not only affects the impact of micro-blog's data mining and decision analysis, but also seriously affects the healthy development of micro-blog platform and user experience. In this paper, a new spammer detection method based on fuzzy multi-class support vector machines (FMCSVM) is proposed in micro-blog, it combines the SVM multi-class classifier with the fuzzy mathematics theory in spammer detection. Current researches on micro-blog spammers is to analyze the characteristics of the global spammers, so that the strength of these analyses is not enough, and these researches lack the feature analysis for a certain type spammer. As a result, this will enable the spammer to escape the spam detection system. In this paper, we divide spammers into three categories by analyzing the features of micro-blog spammers, and then construct one-versus-rest SVM multi-class classifier. The fuzzy clustering method is used to deal with the mixed samples generated by the multi class classifier, and the combination classifier is obtained, which improves the detection accuracy.\",\"PeriodicalId\":282903,\"journal\":{\"name\":\"2018 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CYBERC.2018.00016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CYBERC.2018.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Detecting Spammer on Micro-blogs Base on Fuzzy Multi-class SVM
Micro-blog has become an important information dissemination and exchange platform in people's social lives. Massive micro-blog data contains a large number of valuable information, but the micro-blog platform appears to have a lot of spam behavior problems in recent years; behavior consistent with spammers and spam micro-blogs. The spam not only affects the impact of micro-blog's data mining and decision analysis, but also seriously affects the healthy development of micro-blog platform and user experience. In this paper, a new spammer detection method based on fuzzy multi-class support vector machines (FMCSVM) is proposed in micro-blog, it combines the SVM multi-class classifier with the fuzzy mathematics theory in spammer detection. Current researches on micro-blog spammers is to analyze the characteristics of the global spammers, so that the strength of these analyses is not enough, and these researches lack the feature analysis for a certain type spammer. As a result, this will enable the spammer to escape the spam detection system. In this paper, we divide spammers into three categories by analyzing the features of micro-blog spammers, and then construct one-versus-rest SVM multi-class classifier. The fuzzy clustering method is used to deal with the mixed samples generated by the multi class classifier, and the combination classifier is obtained, which improves the detection accuracy.