{"title":"字符级卷积神经元网络和长短期记忆在恶意URL检测中的效率探索","authors":"Thuy Pham, Van-Nam Hoang, Thanh Ngoc Ha","doi":"10.1145/3301326.3301336","DOIUrl":null,"url":null,"abstract":"Machine learning techniques, especially deep learning neuron networks have been increasingly applied to solve the problems relating to information security and cybersecurity. Malicious URL (Uniform Resource Locator) detection is one of these. It is considered as a binary classification in machine learning, in which a URL or website address is classed as malign or benign. In this work, we implement the experiments on two different datasets to explore the efficiency of three proposed character-level deep neuron networks: (1) CNN (Convolution Neuron Network) based on VGG-16 architecture (Visual Geometry Group), (2) LSTM (Long Short Term Memory), and a fusion of CNN and LSTM for malicious URL detection. The experimental results are promising, especially for the fusion scheme of LSTM and CNN, with above 96% for precision and 98% for recall.","PeriodicalId":294040,"journal":{"name":"Proceedings of the 2018 VII International Conference on Network, Communication and Computing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Exploring Efficiency of Character-level Convolution Neuron Network and Long Short Term Memory on Malicious URL Detection\",\"authors\":\"Thuy Pham, Van-Nam Hoang, Thanh Ngoc Ha\",\"doi\":\"10.1145/3301326.3301336\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning techniques, especially deep learning neuron networks have been increasingly applied to solve the problems relating to information security and cybersecurity. Malicious URL (Uniform Resource Locator) detection is one of these. It is considered as a binary classification in machine learning, in which a URL or website address is classed as malign or benign. In this work, we implement the experiments on two different datasets to explore the efficiency of three proposed character-level deep neuron networks: (1) CNN (Convolution Neuron Network) based on VGG-16 architecture (Visual Geometry Group), (2) LSTM (Long Short Term Memory), and a fusion of CNN and LSTM for malicious URL detection. The experimental results are promising, especially for the fusion scheme of LSTM and CNN, with above 96% for precision and 98% for recall.\",\"PeriodicalId\":294040,\"journal\":{\"name\":\"Proceedings of the 2018 VII International Conference on Network, Communication and Computing\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2018 VII International Conference on Network, Communication and Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3301326.3301336\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 VII International Conference on Network, Communication and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3301326.3301336","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring Efficiency of Character-level Convolution Neuron Network and Long Short Term Memory on Malicious URL Detection
Machine learning techniques, especially deep learning neuron networks have been increasingly applied to solve the problems relating to information security and cybersecurity. Malicious URL (Uniform Resource Locator) detection is one of these. It is considered as a binary classification in machine learning, in which a URL or website address is classed as malign or benign. In this work, we implement the experiments on two different datasets to explore the efficiency of three proposed character-level deep neuron networks: (1) CNN (Convolution Neuron Network) based on VGG-16 architecture (Visual Geometry Group), (2) LSTM (Long Short Term Memory), and a fusion of CNN and LSTM for malicious URL detection. The experimental results are promising, especially for the fusion scheme of LSTM and CNN, with above 96% for precision and 98% for recall.