A “Fast Data” architecture: Dashboard for anomalous traffic analysis in data networks

2016 Eleventh International Conference on Digital Information Management (ICDIM) Pub Date : 2016-09-01 DOI:10.1109/ICDIM.2016.7829756

Miguel Angel López Peña, C. Rua, Sergio Segovia Lozoya

{"title":"A “Fast Data” architecture: Dashboard for anomalous traffic analysis in data networks","authors":"Miguel Angel López Peña, C. Rua, Sergio Segovia Lozoya","doi":"10.1109/ICDIM.2016.7829756","DOIUrl":null,"url":null,"abstract":"Fast Data is a new Big Data computing paradigm that ensures requirements such as Real-Time processing of continuous data stream, storage at high rates and low latency with no data losses. In this work we propose a “Fast Data” architecture for a specific kind of software application in which input data arrive very fast and the results for each processed data have to match such input rates. We applied this architecture to build a Dashboard for Anomalous Traffic Analysis in Data Networks. In order to fulfill the requirements of Real-Time processing and no data losses, we carry out a design that consists of a pattern of dynamic tree of process pipelines, where the number of branches increases proportionally to the input data rate. Two different approaches have been followed to implement this design pattern: one based in a well-known set of products from the Big Data ecosystem; and the other built with Kafka, Zookeeper and a set of components designed and implemented by us. These two implementations have been compared in terms of velocity and scalability performance. As a result, the implementation built with our own components is significantly faster and scalable than the traditional one. The good results obtained by using both the design pattern of dynamic tree of process pipelines and our implementation make them very suitable for its use in other scenarios and applications such as smart cities, environment monitoring, industry 4.0, distributed control systems, etc.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2016.7829756","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Fast Data is a new Big Data computing paradigm that ensures requirements such as Real-Time processing of continuous data stream, storage at high rates and low latency with no data losses. In this work we propose a “Fast Data” architecture for a specific kind of software application in which input data arrive very fast and the results for each processed data have to match such input rates. We applied this architecture to build a Dashboard for Anomalous Traffic Analysis in Data Networks. In order to fulfill the requirements of Real-Time processing and no data losses, we carry out a design that consists of a pattern of dynamic tree of process pipelines, where the number of branches increases proportionally to the input data rate. Two different approaches have been followed to implement this design pattern: one based in a well-known set of products from the Big Data ecosystem; and the other built with Kafka, Zookeeper and a set of components designed and implemented by us. These two implementations have been compared in terms of velocity and scalability performance. As a result, the implementation built with our own components is significantly faster and scalable than the traditional one. The good results obtained by using both the design pattern of dynamic tree of process pipelines and our implementation make them very suitable for its use in other scenarios and applications such as smart cities, environment monitoring, industry 4.0, distributed control systems, etc.

查看原文本刊更多论文

“快速数据”架构:数据网络中异常流量分析的仪表板

Fast Data是一种新的大数据计算范式，可确保对连续数据流的实时处理、高速率存储和低延迟、无数据丢失等要求。在这项工作中，我们为一种特定类型的软件应用程序提出了一种“快速数据”架构，其中输入数据到达非常快，并且每个处理数据的结果必须匹配这样的输入速率。我们应用这个架构来构建一个仪表板，用于数据网络中的异常流量分析。为了满足实时处理和无数据丢失的要求，我们进行了一种由流程管道动态树模式组成的设计，其中分支数量与输入数据率成比例增加。实现这种设计模式有两种不同的方法:一种是基于大数据生态系统中一组众所周知的产品;另一个是用Kafka、Zookeeper和我们设计和实现的一组组件构建的。这两种实现在速度和可伸缩性性能方面进行了比较。因此，使用我们自己的组件构建的实现比传统的实现要快得多，而且可扩展。通过使用过程管道动态树的设计模式和我们的实现所获得的良好结果使它们非常适合在其他场景和应用中使用，例如智慧城市，环境监测，工业4.0，分布式控制系统等。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 Eleventh International Conference on Digital Information Management (ICDIM)

自引率

0.00%

发文量