Design and Development of A Cloud-Based IDS using Apache Kafka and Spark Streaming

2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE) Pub Date : 2022-06-22 DOI:10.1109/jcsse54890.2022.9836264

Len Wirz, Rinrada Tanthanathewin, Asipan Ketphet, S. Fugkeaw

{"title":"Design and Development of A Cloud-Based IDS using Apache Kafka and Spark Streaming","authors":"Len Wirz, Rinrada Tanthanathewin, Asipan Ketphet, S. Fugkeaw","doi":"10.1109/jcsse54890.2022.9836264","DOIUrl":null,"url":null,"abstract":"Owing to the efficient resource management, accessibility, and high service availability, cloud computing has been leveraged by several intensive-data processing applications such as big data analytics, social media applications. These applications are typically based on the development of web service and web application. Even though web-based technology offers effective communication and implementation, it has been susceptible to various kinds of attack. In this paper, we investigate possible attacks on REST which is a commonly used protocol for the web service implementation. In REST, HTTP requests are mapped to GET, POST, PUT, and DELETE that have been proven to be prone to common attacks including Automated Brute Forcing on web-based login, HTTP flood attacks, SQL injections (SQLi), and Cross-Site Scripting (XSS). To this end, we propose a design and implementation of the cloud-based IDS to detect such attacks by employing Apache Kafka and Spark streaming to classify and process the high volume of user inputs in REST HTTP communication. To detect the anomalous inputs, we apply the signature-based approach to construct an IDS engine based on a set of known attack patterns that will be leveraged by the Spark Streaming. Specifically, we introduce a new string comparison collection that improves the False Positive (FP) rate in SQL injection detection, which has been a major issue in most proposed IDS currently available. In our experiment, the system is able to determine malicious patterns with high performance as well as to generate SMS alerts and log the event in a Google Cloud Storage Bucket in an efficient manner.","PeriodicalId":284735,"journal":{"name":"2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/jcsse54890.2022.9836264","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Owing to the efficient resource management, accessibility, and high service availability, cloud computing has been leveraged by several intensive-data processing applications such as big data analytics, social media applications. These applications are typically based on the development of web service and web application. Even though web-based technology offers effective communication and implementation, it has been susceptible to various kinds of attack. In this paper, we investigate possible attacks on REST which is a commonly used protocol for the web service implementation. In REST, HTTP requests are mapped to GET, POST, PUT, and DELETE that have been proven to be prone to common attacks including Automated Brute Forcing on web-based login, HTTP flood attacks, SQL injections (SQLi), and Cross-Site Scripting (XSS). To this end, we propose a design and implementation of the cloud-based IDS to detect such attacks by employing Apache Kafka and Spark streaming to classify and process the high volume of user inputs in REST HTTP communication. To detect the anomalous inputs, we apply the signature-based approach to construct an IDS engine based on a set of known attack patterns that will be leveraged by the Spark Streaming. Specifically, we introduce a new string comparison collection that improves the False Positive (FP) rate in SQL injection detection, which has been a major issue in most proposed IDS currently available. In our experiment, the system is able to determine malicious patterns with high performance as well as to generate SMS alerts and log the event in a Google Cloud Storage Bucket in an efficient manner.

查看原文本刊更多论文

基于Apache Kafka和Spark Streaming的基于云的IDS的设计与开发

由于高效的资源管理、可访问性和高服务可用性，云计算已被一些密集数据处理应用程序(如大数据分析、社交媒体应用程序)所利用。这些应用程序通常是基于web服务和web应用程序的开发。尽管基于web的技术提供了有效的通信和实现，但它很容易受到各种攻击。在本文中，我们研究了针对REST的可能攻击，REST是web服务实现的常用协议。在REST中，HTTP请求被映射为GET、POST、PUT和DELETE，这些已被证明容易受到常见攻击，包括基于web登录的自动暴力破解、HTTP洪水攻击、SQL注入(SQLi)和跨站脚本(XSS)。为此，我们提出了一种基于云的IDS的设计和实现，通过使用Apache Kafka和Spark流对REST HTTP通信中的大量用户输入进行分类和处理来检测此类攻击。为了检测异常输入，我们应用基于签名的方法来构建一个基于一组已知攻击模式的IDS引擎，这些攻击模式将被Spark Streaming利用。具体来说，我们引入了一个新的字符串比较集合，它可以提高SQL注入检测中的误报率(FP)，这是目前可用的大多数IDS中的一个主要问题。在我们的实验中，系统能够以高性能确定恶意模式，并以高效的方式生成SMS警报并将事件记录在谷歌云存储桶中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE)

自引率

0.00%

发文量