The Effect of System Utilization on Application Performance Variability

Boyang Li, Sudheer Chunduri, K. Harms, Yuping Fan, Z. Lan
{"title":"The Effect of System Utilization on Application Performance Variability","authors":"Boyang Li, Sudheer Chunduri, K. Harms, Yuping Fan, Z. Lan","doi":"10.1145/3322789.3328743","DOIUrl":null,"url":null,"abstract":"Application performance variability caused by network contention is a major issue on dragonfly based systems. This work-in-progress study makes two contributions. First, we analyze real workload logs and conduct application experiments on the production system Theta at Argonne to evaluate application performance variability. We find a strong correlation between system utilization and performance variability where a high system utilization (e.g., above 95%) can cause up to 21% degradation in application performance. Next, driven by this key finding, we investigate a scheduling policy to mitigate workload interference by leveraging the fact that production systems often exhibit diurnal utilization behavior and not all users are in a hurry for job completion. Preliminary results show that this scheduling design is capable of improving system productivity (measured by scheduling makespan) as well as improving user-level scheduling metrics such as user wait time and job slowdown.","PeriodicalId":365438,"journal":{"name":"Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3322789.3328743","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

Abstract

Application performance variability caused by network contention is a major issue on dragonfly based systems. This work-in-progress study makes two contributions. First, we analyze real workload logs and conduct application experiments on the production system Theta at Argonne to evaluate application performance variability. We find a strong correlation between system utilization and performance variability where a high system utilization (e.g., above 95%) can cause up to 21% degradation in application performance. Next, driven by this key finding, we investigate a scheduling policy to mitigate workload interference by leveraging the fact that production systems often exhibit diurnal utilization behavior and not all users are in a hurry for job completion. Preliminary results show that this scheduling design is capable of improving system productivity (measured by scheduling makespan) as well as improving user-level scheduling metrics such as user wait time and job slowdown.
系统利用率对应用程序性能可变性的影响
在基于蜻蜓的系统中,由网络争用引起的应用程序性能变化是一个主要问题。这项正在进行的研究有两个贡献。首先,我们分析真实的工作负载日志,并在Argonne的生产系统Theta上进行应用程序实验,以评估应用程序性能的可变性。我们发现系统利用率和性能可变性之间存在很强的相关性,其中高系统利用率(例如,高于95%)可能导致应用程序性能下降高达21%。接下来,在这一关键发现的推动下,我们研究了一种调度策略,通过利用生产系统经常表现出每日使用行为以及并非所有用户都急于完成任务的事实来减轻工作负载干扰。初步结果表明,这种调度设计能够提高系统生产率(通过调度makespan来衡量),并改善用户级调度指标,如用户等待时间和作业速度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信