{"title":"使用基于字典和机器学习方法检测社交媒体标签劫持","authors":"Wei Ling Cheah, Hui Na Chua","doi":"10.1109/IICAIET55139.2022.9936788","DOIUrl":null,"url":null,"abstract":"Nowadays, hashtags are widely utilized on all social media platforms since they deliver numerous benefits, particularly for corporations aiming to reach a larger audience. However, hashtag exploitation has resulted in the problem of hashtag hijacking, which is a type of cyber content threat that anyone or any organization can carry out. As a result, this research presents a framework for detecting social media hashtag hijacking through machine learning algorithms. This paper aims to identify methods to classify relevant and irrelevant hashtags to their contents. This paper demonstrates the unsupervised machine learning method, namely the dictionary-based approach, to classify the relevance of hashtags with the content of tweets on an unlabeled dataset, and also the implementation of supervised machine learning methods, including the Support Vector Machine (SVM), Naive Bayes classifier, and Decision Tree algorithms, to classify the relevance of hashtags used with their contents and compare the machine's performances on labeled datasets. Our results showed that the Support Vector Machine (SVM) performs the best in classifying the relevance of hashtags with an accuracy of 93.36%, an F1 score of 96.19% and ROC-AVC score of 97.22 %. The findings of the study present an automated detection framework for hashtag hijacking that can overcome the limitations of previous studies and adapt to external threats with high performance over time.","PeriodicalId":142482,"journal":{"name":"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Detection of Social Media Hashtag Hijacking Using Dictionary-based and Machine Learning Methods\",\"authors\":\"Wei Ling Cheah, Hui Na Chua\",\"doi\":\"10.1109/IICAIET55139.2022.9936788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays, hashtags are widely utilized on all social media platforms since they deliver numerous benefits, particularly for corporations aiming to reach a larger audience. However, hashtag exploitation has resulted in the problem of hashtag hijacking, which is a type of cyber content threat that anyone or any organization can carry out. As a result, this research presents a framework for detecting social media hashtag hijacking through machine learning algorithms. This paper aims to identify methods to classify relevant and irrelevant hashtags to their contents. This paper demonstrates the unsupervised machine learning method, namely the dictionary-based approach, to classify the relevance of hashtags with the content of tweets on an unlabeled dataset, and also the implementation of supervised machine learning methods, including the Support Vector Machine (SVM), Naive Bayes classifier, and Decision Tree algorithms, to classify the relevance of hashtags used with their contents and compare the machine's performances on labeled datasets. Our results showed that the Support Vector Machine (SVM) performs the best in classifying the relevance of hashtags with an accuracy of 93.36%, an F1 score of 96.19% and ROC-AVC score of 97.22 %. The findings of the study present an automated detection framework for hashtag hijacking that can overcome the limitations of previous studies and adapt to external threats with high performance over time.\",\"PeriodicalId\":142482,\"journal\":{\"name\":\"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IICAIET55139.2022.9936788\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICAIET55139.2022.9936788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Detection of Social Media Hashtag Hijacking Using Dictionary-based and Machine Learning Methods
Nowadays, hashtags are widely utilized on all social media platforms since they deliver numerous benefits, particularly for corporations aiming to reach a larger audience. However, hashtag exploitation has resulted in the problem of hashtag hijacking, which is a type of cyber content threat that anyone or any organization can carry out. As a result, this research presents a framework for detecting social media hashtag hijacking through machine learning algorithms. This paper aims to identify methods to classify relevant and irrelevant hashtags to their contents. This paper demonstrates the unsupervised machine learning method, namely the dictionary-based approach, to classify the relevance of hashtags with the content of tweets on an unlabeled dataset, and also the implementation of supervised machine learning methods, including the Support Vector Machine (SVM), Naive Bayes classifier, and Decision Tree algorithms, to classify the relevance of hashtags used with their contents and compare the machine's performances on labeled datasets. Our results showed that the Support Vector Machine (SVM) performs the best in classifying the relevance of hashtags with an accuracy of 93.36%, an F1 score of 96.19% and ROC-AVC score of 97.22 %. The findings of the study present an automated detection framework for hashtag hijacking that can overcome the limitations of previous studies and adapt to external threats with high performance over time.