{"title":"Towards comparison of real time stream processing engines","authors":"Devesh Kumar Lal, U. Suman","doi":"10.1109/CICT48419.2019.9066123","DOIUrl":null,"url":null,"abstract":"The real time stream processing engines are developed for different specific use cases, which incorporates various domains such as, IOT, finance, advertisement, telecommunications, healthcare etc. These stream processing engines are based on distributed processing models, where unbounded data streams are processed. Semantics of data stream is determined after complete scanning of whole data sets, which becomes inconvenient in real time stream processing to process entire data stream at once. Windowing mechanisms are used for processing data stream in a predefine topology with fixed number of operations such as, join, aggregate, filter etc. In this paper, a comparative study is performed with existing stream processing engines. This comparison provides a direction for choosing an appropriate stream processing engine. A modified master-slave model for stream processing is discussed for reducing latency, improving scalability and fault tolerance.","PeriodicalId":234540,"journal":{"name":"2019 IEEE Conference on Information and Communication Technology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Conference on Information and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CICT48419.2019.9066123","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
The real time stream processing engines are developed for different specific use cases, which incorporates various domains such as, IOT, finance, advertisement, telecommunications, healthcare etc. These stream processing engines are based on distributed processing models, where unbounded data streams are processed. Semantics of data stream is determined after complete scanning of whole data sets, which becomes inconvenient in real time stream processing to process entire data stream at once. Windowing mechanisms are used for processing data stream in a predefine topology with fixed number of operations such as, join, aggregate, filter etc. In this paper, a comparative study is performed with existing stream processing engines. This comparison provides a direction for choosing an appropriate stream processing engine. A modified master-slave model for stream processing is discussed for reducing latency, improving scalability and fault tolerance.