{"title":"基于递归神经网络和长短期记忆的垃圾短信过滤","authors":"A. Chandra, S. Khatri","doi":"10.1109/ISCON47742.2019.9036269","DOIUrl":null,"url":null,"abstract":"SMS is the abbreviation for Short Messaging Service which uses standard protocols for mobile devices to exchange information via short text messages. Today SMS's are an easy, inexpensive and widely accepted way to communicate rather than phone calls. Spam can be described as random unsolicited messages sent at large without any authorization from the receiver. People still deal with spammer's misusing SMS's to advertise false claims and can gain access to private information of users. Emails, social media sites, review, and even Twitter has seen spammers trying to intrude everywhere with the advent of Internet. Spam appears in many forms like comments, emails, search results and personal messages where spammers tend to gain revenues. Various Machine Learning algorithms like Neural Networks have tried to detect spam messages and normal messages or ham from SMS's. These techniques can learn high level features automatically using raw data unlike traditional ways where features are selected after analysis for classification. In this research paper, we propose a new method utilizing Recurrent Neural Network (RNN) and Long Short Term Memory (LSTM) using Keras models and Tensorflow backend to detect Spam and Ham from ‘SpamSMSCollection’ dataset available at UCI machine learning repository. Crucial preprocessing of dataset included tokenization, TF-IDF Vectorization and removal of stopwords. Overall accuracy of 98% is achieved and shows improvement from other machine learning algorithms for spam detection.","PeriodicalId":124412,"journal":{"name":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Spam SMS Filtering using Recurrent Neural Network and Long Short Term Memory\",\"authors\":\"A. Chandra, S. Khatri\",\"doi\":\"10.1109/ISCON47742.2019.9036269\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"SMS is the abbreviation for Short Messaging Service which uses standard protocols for mobile devices to exchange information via short text messages. Today SMS's are an easy, inexpensive and widely accepted way to communicate rather than phone calls. Spam can be described as random unsolicited messages sent at large without any authorization from the receiver. People still deal with spammer's misusing SMS's to advertise false claims and can gain access to private information of users. Emails, social media sites, review, and even Twitter has seen spammers trying to intrude everywhere with the advent of Internet. Spam appears in many forms like comments, emails, search results and personal messages where spammers tend to gain revenues. Various Machine Learning algorithms like Neural Networks have tried to detect spam messages and normal messages or ham from SMS's. These techniques can learn high level features automatically using raw data unlike traditional ways where features are selected after analysis for classification. In this research paper, we propose a new method utilizing Recurrent Neural Network (RNN) and Long Short Term Memory (LSTM) using Keras models and Tensorflow backend to detect Spam and Ham from ‘SpamSMSCollection’ dataset available at UCI machine learning repository. Crucial preprocessing of dataset included tokenization, TF-IDF Vectorization and removal of stopwords. Overall accuracy of 98% is achieved and shows improvement from other machine learning algorithms for spam detection.\",\"PeriodicalId\":124412,\"journal\":{\"name\":\"2019 4th International Conference on Information Systems and Computer Networks (ISCON)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 4th International Conference on Information Systems and Computer Networks (ISCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCON47742.2019.9036269\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCON47742.2019.9036269","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Spam SMS Filtering using Recurrent Neural Network and Long Short Term Memory
SMS is the abbreviation for Short Messaging Service which uses standard protocols for mobile devices to exchange information via short text messages. Today SMS's are an easy, inexpensive and widely accepted way to communicate rather than phone calls. Spam can be described as random unsolicited messages sent at large without any authorization from the receiver. People still deal with spammer's misusing SMS's to advertise false claims and can gain access to private information of users. Emails, social media sites, review, and even Twitter has seen spammers trying to intrude everywhere with the advent of Internet. Spam appears in many forms like comments, emails, search results and personal messages where spammers tend to gain revenues. Various Machine Learning algorithms like Neural Networks have tried to detect spam messages and normal messages or ham from SMS's. These techniques can learn high level features automatically using raw data unlike traditional ways where features are selected after analysis for classification. In this research paper, we propose a new method utilizing Recurrent Neural Network (RNN) and Long Short Term Memory (LSTM) using Keras models and Tensorflow backend to detect Spam and Ham from ‘SpamSMSCollection’ dataset available at UCI machine learning repository. Crucial preprocessing of dataset included tokenization, TF-IDF Vectorization and removal of stopwords. Overall accuracy of 98% is achieved and shows improvement from other machine learning algorithms for spam detection.