BotDetector: A robust and scalable approach toward detecting malware-infected devices

2017 IEEE International Conference on Communications (ICC) Pub Date : 2017-07-28 DOI:10.1109/ICC.2017.7997372

Sho Mizuno, Mitsuhiro Hatada, Tatsuya Mori, Shigeki Goto

{"title":"BotDetector: A robust and scalable approach toward detecting malware-infected devices","authors":"Sho Mizuno, Mitsuhiro Hatada, Tatsuya Mori, Shigeki Goto","doi":"10.1109/ICC.2017.7997372","DOIUrl":null,"url":null,"abstract":"Damage caused by malware is a serious problem that needs to be addressed. The recent rise in the spread of evasive malware has made it difficult to detect it at the pre-infection timing. Malware detection at post-infection timing is a promising approach that fulfills this gap. Given this background, this work aims to identify likely malware-infected devices from the measurement of Internet traffic. The advantage of the traffic-measurement-based approach is that it enables us to monitor a large number of clients. If we find a client as a source of malicious traffic, the client is likely a malware-infected device. Since the majority of malware today makes use of the web as a means to communicate with the C&C servers that reside on the external network, we leverage information recorded in the HTTP headers to discriminate between malicious and legitimate traffic. To make our approach scalable and robust, we develop the automatic template generation scheme that drastically reduces the amount of information to be kept while achieving the high accuracy of classification; since it does not make use of any domain knowledge, the approach should be robust against changes of malware. We apply several classifiers, which include machine learning algorithms, to the extracted templates and classify traffic into two categories: malicious and legitimate. Our extensive experiments demonstrate that our approach discriminates between malicious and legitimate traffic with up to 97.1% precision while maintaining the false positive below 1.0%.","PeriodicalId":6517,"journal":{"name":"2017 IEEE International Conference on Communications (ICC)","volume":"46 1","pages":"1-7"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Communications (ICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICC.2017.7997372","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

Damage caused by malware is a serious problem that needs to be addressed. The recent rise in the spread of evasive malware has made it difficult to detect it at the pre-infection timing. Malware detection at post-infection timing is a promising approach that fulfills this gap. Given this background, this work aims to identify likely malware-infected devices from the measurement of Internet traffic. The advantage of the traffic-measurement-based approach is that it enables us to monitor a large number of clients. If we find a client as a source of malicious traffic, the client is likely a malware-infected device. Since the majority of malware today makes use of the web as a means to communicate with the C&C servers that reside on the external network, we leverage information recorded in the HTTP headers to discriminate between malicious and legitimate traffic. To make our approach scalable and robust, we develop the automatic template generation scheme that drastically reduces the amount of information to be kept while achieving the high accuracy of classification; since it does not make use of any domain knowledge, the approach should be robust against changes of malware. We apply several classifiers, which include machine learning algorithms, to the extracted templates and classify traffic into two categories: malicious and legitimate. Our extensive experiments demonstrate that our approach discriminates between malicious and legitimate traffic with up to 97.1% precision while maintaining the false positive below 1.0%.

查看原文本刊更多论文

BotDetector:一种检测恶意软件感染设备的强大且可扩展的方法

恶意软件造成的损害是一个需要解决的严重问题。最近，规避性恶意软件的传播有所增加，这使得在感染前的时间检测它变得困难。在感染后进行恶意软件检测是一种很有前途的方法，可以填补这一空白。在此背景下，本工作旨在通过测量互联网流量来识别可能被恶意软件感染的设备。基于流量测量的方法的优点是，它使我们能够监控大量的客户端。如果我们发现客户端是恶意流量的来源，则该客户端很可能是受恶意软件感染的设备。由于今天大多数恶意软件都利用网络作为与驻留在外部网络上的C&C服务器通信的手段，我们利用记录在HTTP标头中的信息来区分恶意和合法流量。为了使我们的方法具有可扩展性和鲁棒性，我们开发了自动模板生成方案，该方案在实现高精度分类的同时大大减少了需要保留的信息量;由于它不使用任何领域知识，因此该方法对于恶意软件的更改应该是健壮的。我们将几种分类器(包括机器学习算法)应用于提取的模板，并将流量分为两类:恶意和合法。我们的大量实验表明，我们的方法区分恶意和合法流量的准确率高达97.1%，同时保持误报低于1.0%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE International Conference on Communications (ICC)

自引率

0.00%

发文量