A supervised machine learning approach to classify host roles on line using sFlow

HPPN '13 Pub Date : 2013-06-18 DOI:10.1145/2465839.2465847
Bingdong Li, M. H. Gunes, G. Bebis, Jeff Springer
{"title":"A supervised machine learning approach to classify host roles on line using sFlow","authors":"Bingdong Li, M. H. Gunes, G. Bebis, Jeff Springer","doi":"10.1145/2465839.2465847","DOIUrl":null,"url":null,"abstract":"Classifying host roles based on network traffic behavior is valuable for network security analysis and detecting security policy violation. Behavior-based network security analysis has advantages over traditional approaches such as code patterns or signatures. Modeling host roles based on network flow data is challenging because of the huge volume of network traffic and overlap among host roles. Many studies of network traffic classification have focused on classifying applications such as web, peer-to-peer, and DNS traffic. In general, machine learning approaches have been applied on classifying applications, security awareness, and anomaly detection. In this paper, we present a supervised machine learning approach that use On-Line Support Vector Machine and Decision Tree to classify host roles. We collect sFlow data from main gateways of a large campus network. We classify different roles, namely, clients versus servers, regular web non-email servers versus web email servers, clients at personal offices versus public places of laboratories and libraries, and personal office clients from two different colleges. We achieved very high classification accuracy, i.e., 99.2% accuracy in classifying clients versus servers, 100% accuracy in classifying regular web non-email servers versus web email servers, 93.3% accuracy in classifying clients at personnel offices versus public places, and 93.3% accuracy in classifying clients at personal offices from two different colleges.","PeriodicalId":212430,"journal":{"name":"HPPN '13","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"30","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"HPPN '13","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2465839.2465847","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 30

Abstract

Classifying host roles based on network traffic behavior is valuable for network security analysis and detecting security policy violation. Behavior-based network security analysis has advantages over traditional approaches such as code patterns or signatures. Modeling host roles based on network flow data is challenging because of the huge volume of network traffic and overlap among host roles. Many studies of network traffic classification have focused on classifying applications such as web, peer-to-peer, and DNS traffic. In general, machine learning approaches have been applied on classifying applications, security awareness, and anomaly detection. In this paper, we present a supervised machine learning approach that use On-Line Support Vector Machine and Decision Tree to classify host roles. We collect sFlow data from main gateways of a large campus network. We classify different roles, namely, clients versus servers, regular web non-email servers versus web email servers, clients at personal offices versus public places of laboratories and libraries, and personal office clients from two different colleges. We achieved very high classification accuracy, i.e., 99.2% accuracy in classifying clients versus servers, 100% accuracy in classifying regular web non-email servers versus web email servers, 93.3% accuracy in classifying clients at personnel offices versus public places, and 93.3% accuracy in classifying clients at personal offices from two different colleges.
一种使用sFlow在线分类主机角色的监督机器学习方法
基于网络流量行为对主机角色进行分类,对网络安全分析和安全策略违规检测具有重要意义。基于行为的网络安全分析比传统方法(如代码模式或签名)具有优势。由于网络流量巨大且主机角色之间存在重叠,因此基于网络流数据对主机角色进行建模具有挑战性。许多网络流量分类的研究都集中在web、p2p和DNS等应用流量的分类上。一般来说,机器学习方法已经应用于应用程序分类、安全意识和异常检测。在本文中,我们提出了一种使用在线支持向量机和决策树对主机角色进行分类的监督机器学习方法。我们从一个大型校园网的主网关收集sFlow数据。我们对不同的角色进行分类,即客户端与服务器,常规网络非邮件服务器与网络邮件服务器,个人办公室客户端与实验室和图书馆公共场所客户端,以及来自两个不同学院的个人办公室客户端。我们取得了非常高的分类准确率,即客户端与服务器的分类准确率为99.2%,常规web非电子邮件服务器与web电子邮件服务器的分类准确率为100%,人事办公室与公共场所的客户端分类准确率为93.3%,个人办公室与两个不同学院的客户端分类准确率为93.3%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信