Hongbin Yang, Xianyang Liu, Shenbo Chen, Zhou Lei, Hongguang Du, C. Zhu
{"title":"在异构环境中使用MPTE改进Spark性能","authors":"Hongbin Yang, Xianyang Liu, Shenbo Chen, Zhou Lei, Hongguang Du, C. Zhu","doi":"10.1109/ICALIP.2016.7846627","DOIUrl":null,"url":null,"abstract":"Spark has become the first choice of distributed computing framework for big data processing. The biggest highlight is the use of in-memory computations on large clusters, which is suitable for iterative computing and interactive computing. However, the straggler machines can seriously affect their performance. The current approach of Spark is speculative execution which selects the slow tasks and resubmit them, but there are two deficiencies: Firstly, it directly uses the median time to judge whether the task is abnormal, this may be misleading in reality; Secondly, the backup tasks are directly added to the task queue without taking into account the presence of straggler machines. These deficiencies will further extend the execution time of a job. Therefore, we design a improved speculative strategy, Multiple Phases Time Estimation (MPTE), which greatly reduces the impact of straggler machines. In MPTE, we use the remaining time estimated based on multiple phases to select slow tasks, and we improve the task scheduler for backup tasks scheduling. Experiment results show that MPTE can improve the accuracy of determining if should run a speculative copy for a task by about 20% compared to Spark native scheduler.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Improving Spark performance with MPTE in heterogeneous environments\",\"authors\":\"Hongbin Yang, Xianyang Liu, Shenbo Chen, Zhou Lei, Hongguang Du, C. Zhu\",\"doi\":\"10.1109/ICALIP.2016.7846627\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spark has become the first choice of distributed computing framework for big data processing. The biggest highlight is the use of in-memory computations on large clusters, which is suitable for iterative computing and interactive computing. However, the straggler machines can seriously affect their performance. The current approach of Spark is speculative execution which selects the slow tasks and resubmit them, but there are two deficiencies: Firstly, it directly uses the median time to judge whether the task is abnormal, this may be misleading in reality; Secondly, the backup tasks are directly added to the task queue without taking into account the presence of straggler machines. These deficiencies will further extend the execution time of a job. Therefore, we design a improved speculative strategy, Multiple Phases Time Estimation (MPTE), which greatly reduces the impact of straggler machines. In MPTE, we use the remaining time estimated based on multiple phases to select slow tasks, and we improve the task scheduler for backup tasks scheduling. Experiment results show that MPTE can improve the accuracy of determining if should run a speculative copy for a task by about 20% compared to Spark native scheduler.\",\"PeriodicalId\":184170,\"journal\":{\"name\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICALIP.2016.7846627\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2016.7846627","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving Spark performance with MPTE in heterogeneous environments
Spark has become the first choice of distributed computing framework for big data processing. The biggest highlight is the use of in-memory computations on large clusters, which is suitable for iterative computing and interactive computing. However, the straggler machines can seriously affect their performance. The current approach of Spark is speculative execution which selects the slow tasks and resubmit them, but there are two deficiencies: Firstly, it directly uses the median time to judge whether the task is abnormal, this may be misleading in reality; Secondly, the backup tasks are directly added to the task queue without taking into account the presence of straggler machines. These deficiencies will further extend the execution time of a job. Therefore, we design a improved speculative strategy, Multiple Phases Time Estimation (MPTE), which greatly reduces the impact of straggler machines. In MPTE, we use the remaining time estimated based on multiple phases to select slow tasks, and we improve the task scheduler for backup tasks scheduling. Experiment results show that MPTE can improve the accuracy of determining if should run a speculative copy for a task by about 20% compared to Spark native scheduler.