Apostolos Papageorgiou, Ehsan Poormohammady, Bin Cheng
{"title":"Edge-Computing-Aware Deployment of Stream Processing Tasks Based on Topology-External Information: Model, Algorithms, and a Storm-Based Prototype","authors":"Apostolos Papageorgiou, Ehsan Poormohammady, Bin Cheng","doi":"10.1109/BigDataCongress.2016.40","DOIUrl":null,"url":null,"abstract":"Stream Processing Frameworks (SPF, e.g., Apache Storm) are solutions that facilitate and manage the execution of processing topologies that consist of multiple parallelizable steps (or tasks) and involve continuous data exchange among these tasks. Stemming from the world of Cloud-centric Big Data processing, SPFs often fail to address certain requirements of Internet-of-Things systems. For example, existing deployment solutions ignore the fact that topology tasks can also be involved in other interactions and data-intensive communication flows, which are not taking place between the tasks, but between a task and another Internet-of-things entity, such as an actuator, a database, or a user. This paper describes SPF extensions for taking these interactions into account. The extensions are described both generically and as extensions of Apache Storm. In a simple evaluation upon a topology which involves topology-external interactions, we demonstrate how our solution can eliminate latency requirements violations and reduce Cloud-to-edge bandwidth consumption to 1/3 compared to Apache Storm.","PeriodicalId":407471,"journal":{"name":"2016 IEEE International Congress on Big Data (BigData Congress)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Congress on Big Data (BigData Congress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BigDataCongress.2016.40","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23
Abstract
Stream Processing Frameworks (SPF, e.g., Apache Storm) are solutions that facilitate and manage the execution of processing topologies that consist of multiple parallelizable steps (or tasks) and involve continuous data exchange among these tasks. Stemming from the world of Cloud-centric Big Data processing, SPFs often fail to address certain requirements of Internet-of-Things systems. For example, existing deployment solutions ignore the fact that topology tasks can also be involved in other interactions and data-intensive communication flows, which are not taking place between the tasks, but between a task and another Internet-of-things entity, such as an actuator, a database, or a user. This paper describes SPF extensions for taking these interactions into account. The extensions are described both generically and as extensions of Apache Storm. In a simple evaluation upon a topology which involves topology-external interactions, we demonstrate how our solution can eliminate latency requirements violations and reduce Cloud-to-edge bandwidth consumption to 1/3 compared to Apache Storm.