{"title":"SAGE: Geo-Distributed Streaming Data Analysis in Clouds","authors":"R. Tudoran, Gabriel Antoniu, L. Bougé","doi":"10.1109/IPDPSW.2013.95","DOIUrl":null,"url":null,"abstract":"The continuous growth of sensor networks, stock exchanges, climate monitoring or scientific applications produces new streaming data at increasing rates. Managing and processing such data, sometimes generated from multiple geographical locations, raises important challenges as it requires real-time processing or data aggregation. Conventional solutions like DBMS, MapReduce or dedicated solutions adopting single-located environments fail to meet the demands required for processing the Geo-distributed streaming data. Public clouds like Azure, with data centers spread around the globe, offer the infrastructure which can handle such a processing. Our approach, proposes a service-oriented cloud architecture for performing the stream analysis, by composing services which are distributed among multiple cloud data centers. Hence, the computation is moved towards the multiple data sources exploiting the geographical data locality. The initial results showed good scalability of the approach, reaching 1000 cores in the Azure cloud, and performance improvements compared to single location processing of a factor of 3.3.","PeriodicalId":234552,"journal":{"name":"2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW.2013.95","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
The continuous growth of sensor networks, stock exchanges, climate monitoring or scientific applications produces new streaming data at increasing rates. Managing and processing such data, sometimes generated from multiple geographical locations, raises important challenges as it requires real-time processing or data aggregation. Conventional solutions like DBMS, MapReduce or dedicated solutions adopting single-located environments fail to meet the demands required for processing the Geo-distributed streaming data. Public clouds like Azure, with data centers spread around the globe, offer the infrastructure which can handle such a processing. Our approach, proposes a service-oriented cloud architecture for performing the stream analysis, by composing services which are distributed among multiple cloud data centers. Hence, the computation is moved towards the multiple data sources exploiting the geographical data locality. The initial results showed good scalability of the approach, reaching 1000 cores in the Azure cloud, and performance improvements compared to single location processing of a factor of 3.3.