Multi-class latency-bounded Web services

V. Kanodia, E. Knightly
{"title":"Multi-class latency-bounded Web services","authors":"V. Kanodia, E. Knightly","doi":"10.1109/IWQOS.2000.847959","DOIUrl":null,"url":null,"abstract":"Two recent advances have resulted in significant improvements in Web server quality of service. First, both centralized and distributed Web servers can provide isolation among service classes by fairly distributing system resources. Second, session admission control can protect classes from performance degradation due to overload. The goal of this work is to design a general \"front-end\" algorithm that uses these two building blocks to support a new Web service model, namely, multi-class services which control response latencies to within pre-specified targets. Our key technique is to devise a general service abstraction to adaptively control not only the latency of a particular class, but also to assess the inter-class relationships. In this way, we capture the extent to which classes are isolated or share system resources (as determined by the server architecture and system internals) and hence their effects on each other's QoS. For example, if the server provides class isolation (i.e., a minimum fraction of system resources independent of other classes), yet also allows a class to utilize unused resources from other classes, the algorithm infers and exploits this behavior without an explicit low level model of the server. Thus, as new functionalities are incorporated into Web servers, the approach naturally exploits their properties to efficiently satisfy the classes' performance targets. We validate the scheme with trace driven simulations.","PeriodicalId":416650,"journal":{"name":"2000 Eighth International Workshop on Quality of Service. IWQoS 2000 (Cat. No.00EX400)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"78","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2000 Eighth International Workshop on Quality of Service. IWQoS 2000 (Cat. No.00EX400)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWQOS.2000.847959","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 78

Abstract

Two recent advances have resulted in significant improvements in Web server quality of service. First, both centralized and distributed Web servers can provide isolation among service classes by fairly distributing system resources. Second, session admission control can protect classes from performance degradation due to overload. The goal of this work is to design a general "front-end" algorithm that uses these two building blocks to support a new Web service model, namely, multi-class services which control response latencies to within pre-specified targets. Our key technique is to devise a general service abstraction to adaptively control not only the latency of a particular class, but also to assess the inter-class relationships. In this way, we capture the extent to which classes are isolated or share system resources (as determined by the server architecture and system internals) and hence their effects on each other's QoS. For example, if the server provides class isolation (i.e., a minimum fraction of system resources independent of other classes), yet also allows a class to utilize unused resources from other classes, the algorithm infers and exploits this behavior without an explicit low level model of the server. Thus, as new functionalities are incorporated into Web servers, the approach naturally exploits their properties to efficiently satisfy the classes' performance targets. We validate the scheme with trace driven simulations.
多类延迟受限的Web服务
最近的两项进展显著提高了Web服务器的服务质量。首先,集中式和分布式Web服务器都可以通过公平分配系统资源来提供服务类之间的隔离。其次,会话允许控制可以保护类不因过载而导致性能下降。这项工作的目标是设计一个通用的“前端”算法,该算法使用这两个构建块来支持新的Web服务模型,即控制对预先指定目标的响应延迟的多类服务。我们的关键技术是设计一个通用的服务抽象,不仅可以自适应地控制特定类的延迟,还可以评估类间的关系。通过这种方式,我们捕获了类被隔离或共享系统资源的程度(由服务器架构和系统内部决定),以及它们对彼此QoS的影响。例如,如果服务器提供类隔离(即,独立于其他类的系统资源的最小部分),但也允许类利用来自其他类的未使用的资源,则算法推断并利用这种行为,而无需显式的服务器低级模型。因此,随着新功能被合并到Web服务器中,该方法自然会利用它们的属性来有效地满足类的性能目标。我们通过跟踪驱动仿真验证了该方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信