{"title":"Exploring Efficiency of Character-level Convolution Neuron Network and Long Short Term Memory on Malicious URL Detection","authors":"Thuy Pham, Van-Nam Hoang, Thanh Ngoc Ha","doi":"10.1145/3301326.3301336","DOIUrl":null,"url":null,"abstract":"Machine learning techniques, especially deep learning neuron networks have been increasingly applied to solve the problems relating to information security and cybersecurity. Malicious URL (Uniform Resource Locator) detection is one of these. It is considered as a binary classification in machine learning, in which a URL or website address is classed as malign or benign. In this work, we implement the experiments on two different datasets to explore the efficiency of three proposed character-level deep neuron networks: (1) CNN (Convolution Neuron Network) based on VGG-16 architecture (Visual Geometry Group), (2) LSTM (Long Short Term Memory), and a fusion of CNN and LSTM for malicious URL detection. The experimental results are promising, especially for the fusion scheme of LSTM and CNN, with above 96% for precision and 98% for recall.","PeriodicalId":294040,"journal":{"name":"Proceedings of the 2018 VII International Conference on Network, Communication and Computing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 VII International Conference on Network, Communication and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3301326.3301336","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
Machine learning techniques, especially deep learning neuron networks have been increasingly applied to solve the problems relating to information security and cybersecurity. Malicious URL (Uniform Resource Locator) detection is one of these. It is considered as a binary classification in machine learning, in which a URL or website address is classed as malign or benign. In this work, we implement the experiments on two different datasets to explore the efficiency of three proposed character-level deep neuron networks: (1) CNN (Convolution Neuron Network) based on VGG-16 architecture (Visual Geometry Group), (2) LSTM (Long Short Term Memory), and a fusion of CNN and LSTM for malicious URL detection. The experimental results are promising, especially for the fusion scheme of LSTM and CNN, with above 96% for precision and 98% for recall.