{"title":"LiPS: A Cost-Efficient Data and Task Co-Scheduler for MapReduce","authors":"M. Ehsan, R. Sion","doi":"10.1109/IPDPSW.2013.175","DOIUrl":null,"url":null,"abstract":"We introduce LiPS, a new cost-efficient data and task co-scheduler for MapReduce in a cloud environment. LiPS allows flexible control of job make spans, multi-resource management, and fairness. By using linear programming to simultaneously co-schedule data and tasks, LiPS helps to achieve minimized dollar cost globally. We evaluated LiPS both analytically and on Amazon EC2 in order to measure actual dollar charges. The results were significant; LiPS saved 58-79% of the dollar costs when compared with the Hadoop default scheduler, while also allowing users to fine-tune the cost-performance tradeoff.","PeriodicalId":234552,"journal":{"name":"2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW.2013.175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
We introduce LiPS, a new cost-efficient data and task co-scheduler for MapReduce in a cloud environment. LiPS allows flexible control of job make spans, multi-resource management, and fairness. By using linear programming to simultaneously co-schedule data and tasks, LiPS helps to achieve minimized dollar cost globally. We evaluated LiPS both analytically and on Amazon EC2 in order to measure actual dollar charges. The results were significant; LiPS saved 58-79% of the dollar costs when compared with the Hadoop default scheduler, while also allowing users to fine-tune the cost-performance tradeoff.