On the Use of SVMs to Detect Anomalies in a Stream of SIP Messages

2012 11th International Conference on Machine Learning and Applications Pub Date : 2012-12-12 DOI:10.1109/ICMLA.2012.109

Raihana Ferdous, R. Cigno, A. Zorat

{"title":"On the Use of SVMs to Detect Anomalies in a Stream of SIP Messages","authors":"Raihana Ferdous, R. Cigno, A. Zorat","doi":"10.1109/ICMLA.2012.109","DOIUrl":null,"url":null,"abstract":"Voice and multimedia communications are rapidly migrating from traditional networks to TCP/IP networks (Internet), where services are provisioned by SIP (Session Initiation Protocol). This paper proposes an on-line filter that examines the stream of incoming SIP messages and classifies them as good or bad. The classification is carried out in two stages: first a lexical analysis is performed to weed out those messages that do not belong to the language generated by the grammar defined by the SIP standard. After this first stage, a second filtering occurs which identifies messages that somehow differ - in structure or contents - from messages that were previously classified as good. While the first filter stage is straightforward, as the classification is crisp (either a messages belongs to the language or it does not), the second stage requires a more delicate handling, as it is not a sharp decision whether a message is semantically meaningful or not. The approach we followed for this step is based on using past experience on previously classified messages, i.e. a \"learn-by-example\" approach, which led to a classifier based on Support-Vector-Machines (SVM) to perform the required analysis of each incoming SIP message. The paper describes the overall architecture of the two-stage filter and then explores several points of the configuration-space for the SVM to determine a good configuration setting that will perform well when used to classify a large sample of SIP messages obtained from real traffic collected on a VoIP installation at our institution. Finally, the performance of the classification on additional messages collected from the same source is presented.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2012.109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 16

Abstract

Voice and multimedia communications are rapidly migrating from traditional networks to TCP/IP networks (Internet), where services are provisioned by SIP (Session Initiation Protocol). This paper proposes an on-line filter that examines the stream of incoming SIP messages and classifies them as good or bad. The classification is carried out in two stages: first a lexical analysis is performed to weed out those messages that do not belong to the language generated by the grammar defined by the SIP standard. After this first stage, a second filtering occurs which identifies messages that somehow differ - in structure or contents - from messages that were previously classified as good. While the first filter stage is straightforward, as the classification is crisp (either a messages belongs to the language or it does not), the second stage requires a more delicate handling, as it is not a sharp decision whether a message is semantically meaningful or not. The approach we followed for this step is based on using past experience on previously classified messages, i.e. a "learn-by-example" approach, which led to a classifier based on Support-Vector-Machines (SVM) to perform the required analysis of each incoming SIP message. The paper describes the overall architecture of the two-stage filter and then explores several points of the configuration-space for the SVM to determine a good configuration setting that will perform well when used to classify a large sample of SIP messages obtained from real traffic collected on a VoIP installation at our institution. Finally, the performance of the classification on additional messages collected from the same source is presented.

查看原文本刊更多论文

基于svm的SIP消息流异常检测

语音和多媒体通信正迅速从传统网络向TCP/IP网络(Internet)迁移，在TCP/IP网络中，业务由SIP(会话发起协议)提供。本文提出了一种在线过滤器，该过滤器检查传入的SIP消息流并将其分类为好或坏。分类分两个阶段进行:首先执行词法分析，以清除那些不属于由SIP标准定义的语法生成的语言的消息。在第一阶段之后，将进行第二次过滤，以识别在结构或内容上与先前分类为良好的消息有所不同的消息。虽然第一个筛选阶段很简单，因为分类很清晰(消息要么属于该语言，要么不属于该语言)，但第二阶段需要更精细的处理，因为它不能明确地决定消息是否在语义上有意义。我们在此步骤中采用的方法是基于使用过去对先前分类消息的经验，即“通过示例学习”方法，该方法导致基于支持向量机(SVM)的分类器对每个传入的SIP消息执行所需的分析。本文描述了两阶段过滤器的整体架构，然后探讨了支持向量机配置空间的几个点，以确定一个良好的配置设置，当用于分类从我们机构的VoIP安装上收集的真实流量中获得的大量SIP消息样本时，该配置设置将表现良好。最后，给出了对从同一来源收集的其他消息进行分类的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 11th International Conference on Machine Learning and Applications

自引率

0.00%

发文量