{"title":"基于集成学习和SMOTE的恶意短信检测提高移动网络安全","authors":"Hongsheng Xu , Akeel Qadir , Saima Sadiq","doi":"10.1016/j.cose.2025.104443","DOIUrl":null,"url":null,"abstract":"<div><div>The widespread use of cell phones, along with their constant internet connection, makes them vulnerable to malicious SMS attacks, including smishing and spam. Smishing involves attempts to steal personal information, while spam focuses on unwanted advertisements. Both pose cybersecurity threats, often requiring effective filtering techniques. Researchers have devised multiple methods for detecting malicious SMS, yet a notable gap remains in creating algorithms to reduce false positives, where normal messages are wrongly classified as malicious. The method employs ensemble learning to automatically identify malicious or legitimate messages. It combines Support Vector Machine and Random Forest models, compared with individual machine learning approaches for smishing detection. Feature extraction methods like Term Frequency (TF) and Term Frequency–Inverse Document Frequency (TF–IDF) are employed to derive features from the data. The imbalanced issue of the dataset is addressed by applying the Synthetic Minority Oversampling Technique (SMOTE). The results showed that the ensemble model outperformed the individual models, with an accuracy score of 99.58% when trained using TF–IDF on the balanced dataset. The proposed approach offers proactive defense against malicious SMS attacks, enhancing cybersecurity in the mobile communications sector.</div></div>","PeriodicalId":51004,"journal":{"name":"Computers & Security","volume":"154 ","pages":"Article 104443"},"PeriodicalIF":4.8000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Malicious SMS detection using ensemble learning and SMOTE to improve mobile cybersecurity\",\"authors\":\"Hongsheng Xu , Akeel Qadir , Saima Sadiq\",\"doi\":\"10.1016/j.cose.2025.104443\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The widespread use of cell phones, along with their constant internet connection, makes them vulnerable to malicious SMS attacks, including smishing and spam. Smishing involves attempts to steal personal information, while spam focuses on unwanted advertisements. Both pose cybersecurity threats, often requiring effective filtering techniques. Researchers have devised multiple methods for detecting malicious SMS, yet a notable gap remains in creating algorithms to reduce false positives, where normal messages are wrongly classified as malicious. The method employs ensemble learning to automatically identify malicious or legitimate messages. It combines Support Vector Machine and Random Forest models, compared with individual machine learning approaches for smishing detection. Feature extraction methods like Term Frequency (TF) and Term Frequency–Inverse Document Frequency (TF–IDF) are employed to derive features from the data. The imbalanced issue of the dataset is addressed by applying the Synthetic Minority Oversampling Technique (SMOTE). The results showed that the ensemble model outperformed the individual models, with an accuracy score of 99.58% when trained using TF–IDF on the balanced dataset. The proposed approach offers proactive defense against malicious SMS attacks, enhancing cybersecurity in the mobile communications sector.</div></div>\",\"PeriodicalId\":51004,\"journal\":{\"name\":\"Computers & Security\",\"volume\":\"154 \",\"pages\":\"Article 104443\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Security\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167404825001324\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Security","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167404825001324","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Malicious SMS detection using ensemble learning and SMOTE to improve mobile cybersecurity
The widespread use of cell phones, along with their constant internet connection, makes them vulnerable to malicious SMS attacks, including smishing and spam. Smishing involves attempts to steal personal information, while spam focuses on unwanted advertisements. Both pose cybersecurity threats, often requiring effective filtering techniques. Researchers have devised multiple methods for detecting malicious SMS, yet a notable gap remains in creating algorithms to reduce false positives, where normal messages are wrongly classified as malicious. The method employs ensemble learning to automatically identify malicious or legitimate messages. It combines Support Vector Machine and Random Forest models, compared with individual machine learning approaches for smishing detection. Feature extraction methods like Term Frequency (TF) and Term Frequency–Inverse Document Frequency (TF–IDF) are employed to derive features from the data. The imbalanced issue of the dataset is addressed by applying the Synthetic Minority Oversampling Technique (SMOTE). The results showed that the ensemble model outperformed the individual models, with an accuracy score of 99.58% when trained using TF–IDF on the balanced dataset. The proposed approach offers proactive defense against malicious SMS attacks, enhancing cybersecurity in the mobile communications sector.
期刊介绍:
Computers & Security is the most respected technical journal in the IT security field. With its high-profile editorial board and informative regular features and columns, the journal is essential reading for IT security professionals around the world.
Computers & Security provides you with a unique blend of leading edge research and sound practical management advice. It is aimed at the professional involved with computer security, audit, control and data integrity in all sectors - industry, commerce and academia. Recognized worldwide as THE primary source of reference for applied research and technical expertise it is your first step to fully secure systems.