V. Gil-Costa, Nicolás Hidalgo, Erika Rosas, Mauricio Marín
{"title":"S4并行流处理引擎的动态负载平衡算法","authors":"V. Gil-Costa, Nicolás Hidalgo, Erika Rosas, Mauricio Marín","doi":"10.1109/SBAC-PADW.2016.12","DOIUrl":null,"url":null,"abstract":"Large streams of data can be analyzed in realtimeby Parallel Stream Processing Engines (PSPEs) which arebased on a graph paradigm where vertices represent processingelements (PEs) and edges represent flows of data among PEs. Inthis work, we propose a new elastic strategy for the S4 PSPE toadjust the overall load of PEs in accordance with the utilizationlevels and data traffic at each PE. Our approach exploits aproducer/consumer model to achieve load balance where newworkers pull events from a buffer queue in order to release theamount of traffic in an overloaded PE. Results show that theproposed strategy prevents saturation of PEs and improves theoverall throughput of the system by up to 470%.","PeriodicalId":186179,"journal":{"name":"2016 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Dynamic Load Balance Algorithm for the S4 Parallel Stream Processing Engine\",\"authors\":\"V. Gil-Costa, Nicolás Hidalgo, Erika Rosas, Mauricio Marín\",\"doi\":\"10.1109/SBAC-PADW.2016.12\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Large streams of data can be analyzed in realtimeby Parallel Stream Processing Engines (PSPEs) which arebased on a graph paradigm where vertices represent processingelements (PEs) and edges represent flows of data among PEs. Inthis work, we propose a new elastic strategy for the S4 PSPE toadjust the overall load of PEs in accordance with the utilizationlevels and data traffic at each PE. Our approach exploits aproducer/consumer model to achieve load balance where newworkers pull events from a buffer queue in order to release theamount of traffic in an overloaded PE. Results show that theproposed strategy prevents saturation of PEs and improves theoverall throughput of the system by up to 470%.\",\"PeriodicalId\":186179,\"journal\":{\"name\":\"2016 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SBAC-PADW.2016.12\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SBAC-PADW.2016.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Dynamic Load Balance Algorithm for the S4 Parallel Stream Processing Engine
Large streams of data can be analyzed in realtimeby Parallel Stream Processing Engines (PSPEs) which arebased on a graph paradigm where vertices represent processingelements (PEs) and edges represent flows of data among PEs. Inthis work, we propose a new elastic strategy for the S4 PSPE toadjust the overall load of PEs in accordance with the utilizationlevels and data traffic at each PE. Our approach exploits aproducer/consumer model to achieve load balance where newworkers pull events from a buffer queue in order to release theamount of traffic in an overloaded PE. Results show that theproposed strategy prevents saturation of PEs and improves theoverall throughput of the system by up to 470%.