The impact of data locality on the performance of a SaaS cloud with real-time data-intensive applications

Georgios L. Stavrinides, H. Karatza
{"title":"The impact of data locality on the performance of a SaaS cloud with real-time data-intensive applications","authors":"Georgios L. Stavrinides, H. Karatza","doi":"10.1109/DISTRA.2017.8167683","DOIUrl":null,"url":null,"abstract":"As cloud computing continues to gain momentum, big data analytics are now offered as Software as a Service (SaaS). Besides the heterogeneity and multi-tenancy of the underlying virtualized environment, scheduling such real-time, data-intensive, embarrassingly parallel applications in a SaaS cloud involves another serious challenge: data locality. Consequently, data-aware scheduling policies should be employed, in order to effectively exploit data locality, while at the same time taking into account the other attributes of the workload and the characteristics of the resources. Towards this direction, we investigate via simulation the impact of data locality on the performance of a SaaS cloud, where real-time, data-intensive bags-of-tasks are scheduled dynamically, under various data availability conditions. A non-data-aware baseline scheduling policy is compared with two proposed data-aware heuristics, in an attempt to shed light on the effect of data locality awareness on the system performance.","PeriodicalId":109971,"journal":{"name":"2017 IEEE/ACM 21st International Symposium on Distributed Simulation and Real Time Applications (DS-RT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE/ACM 21st International Symposium on Distributed Simulation and Real Time Applications (DS-RT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DISTRA.2017.8167683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

Abstract

As cloud computing continues to gain momentum, big data analytics are now offered as Software as a Service (SaaS). Besides the heterogeneity and multi-tenancy of the underlying virtualized environment, scheduling such real-time, data-intensive, embarrassingly parallel applications in a SaaS cloud involves another serious challenge: data locality. Consequently, data-aware scheduling policies should be employed, in order to effectively exploit data locality, while at the same time taking into account the other attributes of the workload and the characteristics of the resources. Towards this direction, we investigate via simulation the impact of data locality on the performance of a SaaS cloud, where real-time, data-intensive bags-of-tasks are scheduled dynamically, under various data availability conditions. A non-data-aware baseline scheduling policy is compared with two proposed data-aware heuristics, in an attempt to shed light on the effect of data locality awareness on the system performance.
数据局部性对具有实时数据密集型应用程序的SaaS云性能的影响
随着云计算的发展势头不断增强,大数据分析现在以软件即服务(SaaS)的形式提供。除了底层虚拟化环境的异构性和多租户性之外,在SaaS云中调度这种实时的、数据密集型的、令人尴尬的并行应用程序还涉及另一个严重的挑战:数据位置。因此,应该采用数据感知调度策略,以便有效地利用数据局部性,同时考虑工作负载的其他属性和资源的特征。朝着这个方向,我们通过模拟研究了数据局部性对SaaS云性能的影响,在SaaS云中,实时、数据密集型的任务包在各种数据可用性条件下是动态调度的。将非数据感知基线调度策略与两种提出的数据感知启发式策略进行比较,试图揭示数据位置感知对系统性能的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信