Learning Latest Private-Cluster-State to Improve the Performance of Sample-Based Cluster Scheduling

Yawen Wang, Qing Wang
{"title":"Learning Latest Private-Cluster-State to Improve the Performance of Sample-Based Cluster Scheduling","authors":"Yawen Wang, Qing Wang","doi":"10.1109/DSA.2019.00014","DOIUrl":null,"url":null,"abstract":"Sample based cluster scheduling is considered promising for its high-scalability and low-latency. Its major limitation, on the other hand, is its very limited view of cluster resource state. The limitation confines both its decision precision and the support towards many important scheduling features. There have been several approaches to solve this limitation, yet these works are mostly high-cost solutions that use either extra communication or system component to collect more resource information, which damage the scalability and latency of sample based cluster scheduling. In this paper, we propose L-PCS, a novel learning-based approach based on latest private-cluster-state to generate a relatively accurate knowledge of global cluster state. L-PCS gathers and learns process data of schedulers and predicts a more precise approximation of real-time cluster state for each scheduler. It is a dynamic model updated through time for time-validity. The results predicted by trained model serve as references when schedulers make scheduling decisions. Experiment shows that comparing to sample based schedulers without such learning mechanism, L-PCS improves mean absolute error by 2 × to 3 × and gang scheduling results show a maximum increase of 10.1% to 25.09%.","PeriodicalId":342719,"journal":{"name":"2019 6th International Conference on Dependable Systems and Their Applications (DSA)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 6th International Conference on Dependable Systems and Their Applications (DSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSA.2019.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Sample based cluster scheduling is considered promising for its high-scalability and low-latency. Its major limitation, on the other hand, is its very limited view of cluster resource state. The limitation confines both its decision precision and the support towards many important scheduling features. There have been several approaches to solve this limitation, yet these works are mostly high-cost solutions that use either extra communication or system component to collect more resource information, which damage the scalability and latency of sample based cluster scheduling. In this paper, we propose L-PCS, a novel learning-based approach based on latest private-cluster-state to generate a relatively accurate knowledge of global cluster state. L-PCS gathers and learns process data of schedulers and predicts a more precise approximation of real-time cluster state for each scheduler. It is a dynamic model updated through time for time-validity. The results predicted by trained model serve as references when schedulers make scheduling decisions. Experiment shows that comparing to sample based schedulers without such learning mechanism, L-PCS improves mean absolute error by 2 × to 3 × and gang scheduling results show a maximum increase of 10.1% to 25.09%.
学习最新私有集群状态以提高基于样本的集群调度性能
基于样本的集群调度因其高可伸缩性和低延迟而被认为是有前途的。另一方面,它的主要限制是它对集群资源状态的视图非常有限。这种限制限制了它的决策精度和对许多重要调度特征的支持。有几种方法可以解决这一限制,但这些工作大多是高成本的解决方案,使用额外的通信或系统组件来收集更多的资源信息,这会损害基于样本的集群调度的可伸缩性和延迟。在本文中,我们提出了一种新的基于学习的方法L-PCS,它基于最新的私有集群状态来生成相对准确的全局集群状态知识。L-PCS收集和学习调度程序的进程数据,并为每个调度程序预测更精确的实时集群状态近似值。它是一个动态模型,随着时间的推移而更新,以保证时间有效性。训练后的模型预测结果可作为调度决策的参考。实验表明,与没有这种学习机制的基于样本的调度程序相比,L-PCS的平均绝对误差提高了2 ~ 3倍,组调度结果最大提高了10.1% ~ 25.09%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信