Disclosure: detecting botnet command and control servers through large-scale NetFlow analysis

Asia-Pacific Computer Systems Architecture Conference Pub Date : 2012-12-03 DOI:10.1145/2420950.2420969

Leyla Bilge, D. Balzarotti, William K. Robertson, E. Kirda, Christopher Krügel

{"title":"Disclosure: detecting botnet command and control servers through large-scale NetFlow analysis","authors":"Leyla Bilge, D. Balzarotti, William K. Robertson, E. Kirda, Christopher Krügel","doi":"10.1145/2420950.2420969","DOIUrl":null,"url":null,"abstract":"Botnets continue to be a significant problem on the Internet. Accordingly, a great deal of research has focused on methods for detecting and mitigating the effects of botnets. Two of the primary factors preventing the development of effective large-scale, wide-area botnet detection systems are seemingly contradictory. On the one hand, technical and administrative restrictions result in a general unavailability of raw network data that would facilitate botnet detection on a large scale. On the other hand, were this data available, real-time processing at that scale would be a formidable challenge. In contrast to raw network data, NetFlow data is widely available. However, NetFlow data imposes several challenges for performing accurate botnet detection.\n In this paper, we present Disclosure, a large-scale, wide-area botnet detection system that incorporates a combination of novel techniques to overcome the challenges imposed by the use of NetFlow data. In particular, we identify several groups of features that allow Disclosure to reliably distinguish C&C channels from benign traffic using NetFlow records (i.e., flow sizes, client access patterns, and temporal behavior). To reduce Disclosure's false positive rate, we incorporate a number of external reputation scores into our system's detection procedure. Finally, we provide an extensive evaluation of Disclosure over two large, real-world networks. Our evaluation demonstrates that Disclosure is able to perform real-time detection of botnet C&C channels over datasets on the order of billions of flows per day.","PeriodicalId":397003,"journal":{"name":"Asia-Pacific Computer Systems Architecture Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"286","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asia-Pacific Computer Systems Architecture Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2420950.2420969","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 286

Abstract

Botnets continue to be a significant problem on the Internet. Accordingly, a great deal of research has focused on methods for detecting and mitigating the effects of botnets. Two of the primary factors preventing the development of effective large-scale, wide-area botnet detection systems are seemingly contradictory. On the one hand, technical and administrative restrictions result in a general unavailability of raw network data that would facilitate botnet detection on a large scale. On the other hand, were this data available, real-time processing at that scale would be a formidable challenge. In contrast to raw network data, NetFlow data is widely available. However, NetFlow data imposes several challenges for performing accurate botnet detection. In this paper, we present Disclosure, a large-scale, wide-area botnet detection system that incorporates a combination of novel techniques to overcome the challenges imposed by the use of NetFlow data. In particular, we identify several groups of features that allow Disclosure to reliably distinguish C&C channels from benign traffic using NetFlow records (i.e., flow sizes, client access patterns, and temporal behavior). To reduce Disclosure's false positive rate, we incorporate a number of external reputation scores into our system's detection procedure. Finally, we provide an extensive evaluation of Disclosure over two large, real-world networks. Our evaluation demonstrates that Disclosure is able to perform real-time detection of botnet C&C channels over datasets on the order of billions of flows per day.

查看原文本刊更多论文

披露:通过大规模NetFlow分析检测僵尸网络命令和控制服务器

僵尸网络仍然是互联网上的一个重大问题。因此，大量的研究集中在检测和减轻僵尸网络影响的方法上。阻止大规模、广域僵尸网络检测系统发展的两个主要因素似乎是相互矛盾的。一方面，技术和管理限制导致原始网络数据的普遍不可用，这将有助于大规模的僵尸网络检测。另一方面，如果这些数据可用，那么这种规模的实时处理将是一个艰巨的挑战。与原始网络数据相比，NetFlow数据是广泛可用的。然而，NetFlow数据对执行准确的僵尸网络检测提出了一些挑战。在本文中，我们介绍了Disclosure，这是一个大规模的广域僵尸网络检测系统，它结合了一系列新技术来克服使用NetFlow数据所带来的挑战。特别是，我们确定了几组特征，这些特征允许Disclosure使用NetFlow记录(即流量大小、客户端访问模式和时间行为)可靠地区分C&C通道和良性流量。为了减少“披露”的误报率，我们将一些外部声誉评分纳入我们系统的检测程序中。最后，我们在两个大型的真实世界网络上对Disclosure进行了广泛的评估。我们的评估表明，Disclosure能够在每天数十亿流量的数据集上对僵尸网络C&C通道进行实时检测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Asia-Pacific Computer Systems Architecture Conference

自引率

0.00%

发文量