Prefetching in segmented disk cache for multi-disk systems

Workshop on I/O in Parallel and Distributed Systems Pub Date : 1996-05-27 DOI:10.1145/236017.236037

V. Soloviev

{"title":"Prefetching in segmented disk cache for multi-disk systems","authors":"V. Soloviev","doi":"10.1145/236017.236037","DOIUrl":null,"url":null,"abstract":"This paper investigates the performance of a multi-disk storage system equipped with a segmented disk cache processing a workload of multiple relational scans. Prefetching is a popular method of improving the performance of scans. Many modern disks have a multisegment cache which can be used for prefetching. We observe that, exploiting declustering as a data placement method, prefetching in a segmented cache causes a load imbalance among several disks. A single disk becomes a bottleneck, degrading performance of the entire system. A variation in disk queue length is a primary factor of the imbalance. Using a precise simulation model, we investigate several approaches to achieving better balancing. Our metrics are a scan response time for the closed-end system and an ability to sustain a workload without saturating for the open-end system. We arrive at two main conclusions: (1) Prefetching in main memory is inexpensive and effective for balancing and can supplement or substitute prefetching in disk cache. (2) Disk-level prefetching provides about the same performance as main memory prefetching if request queues are managed in the disk controllers rather than in the host. Checking the disk cache before queuing requests provides not only better request response time but also drastically improves balancing. A single cache performs better than a segmented cache for this method.","PeriodicalId":442608,"journal":{"name":"Workshop on I/O in Parallel and Distributed Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop on I/O in Parallel and Distributed Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/236017.236037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

This paper investigates the performance of a multi-disk storage system equipped with a segmented disk cache processing a workload of multiple relational scans. Prefetching is a popular method of improving the performance of scans. Many modern disks have a multisegment cache which can be used for prefetching. We observe that, exploiting declustering as a data placement method, prefetching in a segmented cache causes a load imbalance among several disks. A single disk becomes a bottleneck, degrading performance of the entire system. A variation in disk queue length is a primary factor of the imbalance. Using a precise simulation model, we investigate several approaches to achieving better balancing. Our metrics are a scan response time for the closed-end system and an ability to sustain a workload without saturating for the open-end system. We arrive at two main conclusions: (1) Prefetching in main memory is inexpensive and effective for balancing and can supplement or substitute prefetching in disk cache. (2) Disk-level prefetching provides about the same performance as main memory prefetching if request queues are managed in the disk controllers rather than in the host. Checking the disk cache before queuing requests provides not only better request response time but also drastically improves balancing. A single cache performs better than a segmented cache for this method.

查看原文本刊更多论文

多磁盘系统分段磁盘缓存中的预取

本文研究了配备分段磁盘缓存的多磁盘存储系统处理多个关系扫描工作负载的性能。预取是一种常用的提高扫描性能的方法。许多现代磁盘都有多段缓存，可用于预取。我们观察到，利用聚类作为数据放置方法，在分段缓存中预取会导致多个磁盘之间的负载不平衡。单个磁盘成为瓶颈，降低整个系统的性能。磁盘队列长度的变化是不平衡的主要因素。使用精确的仿真模型，我们研究了几种实现更好平衡的方法。我们的指标是封闭端系统的扫描响应时间和开放端系统维持工作负载而不饱和的能力。我们得出两个主要结论:(1)主存预取对于平衡来说成本低廉且有效，可以补充或替代磁盘缓存中的预取。(2)如果请求队列在磁盘控制器中而不是在主机中进行管理，则磁盘级预取提供与主内存预取大致相同的性能。在排队请求之前检查磁盘缓存不仅可以提供更好的请求响应时间，还可以极大地改善平衡。对于这种方法，单个缓存的性能优于分段缓存。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Workshop on I/O in Parallel and Distributed Systems

自引率

0.00%

发文量