Chris X. Cai, Shayan Saeed, Indranil Gupta, R. Campbell, Franck Le
{"title":"Phurti: Application and Network-Aware Flow Scheduling for Multi-tenant MapReduce Clusters","authors":"Chris X. Cai, Shayan Saeed, Indranil Gupta, R. Campbell, Franck Le","doi":"10.1109/IC2E.2016.21","DOIUrl":"https://doi.org/10.1109/IC2E.2016.21","url":null,"abstract":"Traffic for a typical MapReduce job in a data center consists of multiple network flows. Traditionally, network resources have been allocated to optimize network-level metrics such as flow completion time or throughput. Some recent schemes propose using application-aware scheduling which can shorten the average job completion time. However, most of them treat the core network as a black box with sufficient capacity. Even if only one network link in the core network becomes a bottleneck, it can hurt application performance. We design and implement a centralized flow-scheduling framework called Phurti with the goal of improving the completion time for jobs in a cluster shared among multiple Hadoop jobs (multi-tenant). Phurti communicates both with the Hadoop framework to retrieve job-level network traffic information and the OpenFlow-based switches to learn about the network topology. Phurti implements a novel heuristic called Smallest Maximum Sequential-traffic First (SMSF) that uses collected application and network information to perform traffic scheduling for MapReduce jobs. Our evaluation with real Hadoop workloads shows that compared to application and network-agnostic scheduling strategies, Phurti improves job completion time for 95% of the jobs, decreases average job completion time by 20%, tail job completion time by 13% and scales well with the cluster size and number of jobs.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128673490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Attribute-Based Partial Geo-Replication System","authors":"Hobin Yoon, Ada Gavrilovska, K. Schwan","doi":"10.1109/IC2E.2016.29","DOIUrl":"https://doi.org/10.1109/IC2E.2016.29","url":null,"abstract":"Existing partial geo-replication systems do not always provide optimal cost or latency, because their replication decisions are based on statically established data access popularity metrics, regardless of the application types. We demonstrate that additional reduction in cost and latency can be achieved by (1) using the right object attributes for making replication decisions for each type of application, (2) using multi-attribute-based replications, and (3) combining the popularity-based but reactive approach with the more random but proactive approach to data replication. Toward this end, we propose Acorn, an Attribute-based COntinuous partial geo-ReplicatioN system, and its prototype implementation based on Apache Cassandra. Experiments with two types of global-scale, data-sharing applications demonstrate up to 54% and 90% cost overhead reduction over existing systems or 38% and 91% latency overhead reduction.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125241875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On IO Latency Prediction Accuracy and Automated Load Balancing in Consolidated VM Environments","authors":"Jun Nemoto, G. Ganger","doi":"10.1109/IC2E.2016.32","DOIUrl":"https://doi.org/10.1109/IC2E.2016.32","url":null,"abstract":"Manually managing IO workloads and performance in consolidated VM environments is often difficult and error prone. Thus, automated IO workload (re) placement using virtual disk migration is a key functionality of large scale VM infrastructure. A promising approach is to place IO workloads based on predicted IO latencies, but previous prediction models are often inaccurate due to dependence on there being only a linear relationship between workload parameters and IO latency. This paper presents a new accurate IO latency prediction model for use in automated load balancing. Our experiments show that our model improves relative error ratio of IO latency prediction by 67% for SSDs and 43% for HDDs on average. We also evaluate how the improvement of IO latency prediction affects actual load balancing and overall IO performance. Contrary to our expectation, we find that the significant improvement of IO latency prediction accuracy does not translate into significant overall performance improvement.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121100081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. A. Javadi, Sagar Mehra, Bharath Kumar Reddy Vangoor, Anshul Gandhi
{"title":"UIE: User-Centric Interference Estimation for Cloud Applications","authors":"S. A. Javadi, Sagar Mehra, Bharath Kumar Reddy Vangoor, Anshul Gandhi","doi":"10.1109/IC2E.2016.13","DOIUrl":"https://doi.org/10.1109/IC2E.2016.13","url":null,"abstract":"Interference is one of the key deterrents to cloud adoption, and is known to cause severe degradation in application performance, costing service providers in lost revenues. In this paper, we present UIE, a user-centric approach to detecting and, importantly, estimating the degree of interference experienced by user applications in the cloud. UIE employs queueing theory to model the impact of resource contention on application performance. By leveraging UIE, users can estimate the true amount of resources, including CPU, network, and I/O, allocated to their application at any given time, without any assistance from the cloud provider or hypervisor.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134144100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anshul Gandhi, Sidhartha Thota, Parijat Dube, Andrzej Kochut, Li Zhang
{"title":"Autoscaling for Hadoop Clusters","authors":"Anshul Gandhi, Sidhartha Thota, Parijat Dube, Andrzej Kochut, Li Zhang","doi":"10.1109/IC2E.2016.11","DOIUrl":"https://doi.org/10.1109/IC2E.2016.11","url":null,"abstract":"Unforeseen events such as node failures and resource contention can have a severe impact on the performance of data processing frameworks, such as Hadoop, especially in cloud environments where such incidents are common. SLA compliance in the presence of such events requires the ability to quickly and dynamically resize infrastructure resources. Unfortunately, the distributed and stateful nature of data processing frameworks makes it challenging to accurately scale the system at run-time. In this paper, we present the design and implementation of a model-driven autoscaling solution for Hadoop clusters. We first develop novel gray-box performance models for Hadoop workloads that specifically relate job execution times to resource allocation and workload parameters. We then employ these models to dynamically determine the resources required to successfully complete the Hadoop jobs as per the user-specified SLA under various scenarios including node failures and multi-job executions. Our experimental results on three different Hadoop cloud clusters and across different workloads demonstrate the efficacy of our models and highlight their autoscaling capabilities.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"167 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133937593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Enabling Enterprise-Class Workloads in the Cloud","authors":"V. Salapura, R. Mahindru","doi":"10.1109/IC2E.2016.41","DOIUrl":"https://doi.org/10.1109/IC2E.2016.41","url":null,"abstract":"Enterprise-level workloads - such as SAP and Oracle workloads - require infrastructure with high availability, clustering, or physical server appliances, features which are often not a part of a cloud offering. As a result, businesses are forced to run enterprise workloads in their legacy environments, and cannot take advantage of the cloud's flexibility, elasticity, and low cost. IBM Cloud Managed Services (CMS) cloud implements shared storage, clustering support, and private networks. These features effectively enable a large number of SAP and Oracle workloads to run in both virtualized and non-virtualized cloud environments. In this paper, we discuss a diverse set of enterprise applications implemented in the IBM CMS cloud.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127512785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Genetic Algorithm for Dynamic Cloud Application Brokerage","authors":"Lino Chamorro, Fabio Lopez Pires, B. Barán","doi":"10.1109/IC2E.2016.25","DOIUrl":"https://doi.org/10.1109/IC2E.2016.25","url":null,"abstract":"Cloud Service Brokers (CSBs) may abstract complex resource allocation decisions for efficiently mapping demands of tenants into offers of providers. Nowadays, both demands and offers could be considered in dynamic environments, representing particular challenges in cloud computing markets. This work studies a broker-oriented Virtual Machine Placement (VMP) in dynamic environments such as: (1) variable resource offers and (2) pricing, from providers and (3) dynamic requirements of tenants. A genetic algorithm is proposed for an efficient and scalable resolution of the considered problem. Experimental results demonstrate good quality of solutions obtained by the proposed algorithm when compared to a state-of-the-art Integer Linear Programming (ILP) algorithm. Additionally, experimental results also demonstrate the good level of scalability of the proposed algorithm for large instances of the considered problem.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116075768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Discovery Cloud: Accelerating and Democratizing Research on a Global Scale","authors":"Ian T Foster, K. Chard, S. Tuecke","doi":"10.1109/IC2E.2016.46","DOIUrl":"https://doi.org/10.1109/IC2E.2016.46","url":null,"abstract":"Modern science and engineering require increasingly sophisticated information technology (IT) for data analysis, simulation, and related tasks. Yet the small to medium laboratories (SMLs) in which the majority of research advances occur increasingly lack the human and financial capital needed to acquire and operate such IT. New methods are needed to provide all researchers with access to state-of-the-art scientific capabilities, regardless of their location and budget. Industry has demonstrated the value of cloud-hosted software-and platform-as-a-service approaches, small businesses that outsource their IT to third-party providers slash costs and accelerate innovation. However, few business cloud services are transferable to science. We thus propose the Discovery Cloud, an ecosystem of new, community-produced services to which SMLs can outsource common activities, from data management and analysis to collaboration and experiment automation. We explain the need for a Discovery Platform to streamline the creation and operation of new and interoperable services, and a Discovery Exchange to facilitate the use and sustainability of Discovery Cloud services. We report on our experiences building early elements of the Discovery Platform in the form of Globus services, and on the experiences of those who have applied those services in innovative applications.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130825951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multiple Virtual Machines Live Migration Performance Modelling -- VMware vMotion Based Study","authors":"M. E. Elsaid, C. Meinel","doi":"10.1109/IC2E.2016.9","DOIUrl":"https://doi.org/10.1109/IC2E.2016.9","url":null,"abstract":"Live migration is one of the powerful features in virtual datacenters environment. Servers load balance, power saving and dynamic resource management techniques are all dependent on live migration feature in virtual datacenters. So it is important to study virtual machine live migration processes and analyze its performance impact on datacentres resources. In this research, the performance analysis for single and multiple virtual machines migration has led to getting empirical models that can be used for live migration overhead estimation and providing resource management techniques that are migration overhead aware.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131355189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shuang Wu, Bei Wang, Ce Yang, Qinming He, Jianhai Chen
{"title":"A Hot-Page Aware Hybrid-Copy Migration Method","authors":"Shuang Wu, Bei Wang, Ce Yang, Qinming He, Jianhai Chen","doi":"10.1109/IC2E.2016.27","DOIUrl":"https://doi.org/10.1109/IC2E.2016.27","url":null,"abstract":"Hybrid-copy migration is a practical mean in cloud computing for memory intensive workload. But it does not perform perfectly in fetching remote pages. Aiming at solving this problem, we present a hot-page hybrid-copy migration method. We design a hot-page capturer to find out hot-pages, and a hot-page syringe to push hot-pages into transmission queue. We present an evaluation called page fault interval time, to evaluate the performance of hybrid-copy migration. The experimental results show that our method can extend the free part in page fault interval time about 19.79%, reduce the amount of remote page faults about 9.6%, and finally improve the performance of hybrid-copy migration.","PeriodicalId":430893,"journal":{"name":"2016 IEEE International Conference on Cloud Engineering (IC2E)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126042595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}