ACM Transactions on Storage (TOS)最新文献_第10页

ACM Transactions on Storage (TOS) Pub Date : 2018-10-03 DOI: 10.1145/3242091

Y. Won, Joontaek Oh, Jaemin Jung, Gyeongyeol Choi, Seongbae Son, J. Hwang, Sangyeun Cho

引用次数: 1

ACM Transactions on Storage (TOS) Pub Date : 2018-10-03 DOI: 10.1145/3242086

Haryadi S. Gunawi, Riza O. Suminto, R. Sears, Casey Golliher, S. Sundararaman, Xing Lin, Tim Emami, Weiguang Sheng, N. Bidokhti, C. McCaffrey, Deepthi Srinivasan, Biswaranjan Panda, A. Baptist, G. Grider, P. Fields, K. Harms, R. Ross, Andree Jacobson, R. Ricci, Kirk Webb, P. Alvaro, H. Runesha, M. Hao, Huaicheng Li

引用次数: 45

M-CLOCK M-CLOCK

ACM Transactions on Storage (TOS) Pub Date : 2018-10-03 DOI: 10.1145/3216730

Minhoe Lee, Donghyun Kang, Y. Eom

引用次数: 3

Protocol-Aware Recovery for Consensus-Based Distributed Storage 基于共识的分布式存储协议感知恢复

ACM Transactions on Storage (TOS) Pub Date : 2018-10-03 DOI: 10.1145/3241062

R. Alagappan, Aishwarya Ganesan, Eric Lee, Aws Albarghouthi, Vijay Chidambaram, A. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau

引用次数: 7

Lerna

ACM Transactions on Storage (TOS) Pub Date : 2018-06-04 DOI: 10.1145/3310368

Mohamed M. Saad, R. Palmieri, B. Ravindran

引用次数: 1

REGISTOR 暂存器

ACM Transactions on Storage (TOS) Pub Date : 2018-06-04 DOI: 10.1145/3310149

Shuyi Pei, Jing Yang, Qing Yang

{"title":"REGISTOR","authors":"Shuyi Pei, Jing Yang, Qing Yang","doi":"10.1145/3310149","DOIUrl":"https://doi.org/10.1145/3310149","url":null,"abstract":"This article presents REGISTOR, a platform for regular expression grabbing inside storage. The main idea of Registor is accelerating regular expression (regex) search inside storage where large data set is stored, eliminating the I/O bottleneck problem. A special hardware engine for regex search is designed and augmented inside a flash SSD that processes data on-the-fly during data transmission from NAND flash to host. To make the speed of regex search match the internal bus speed of a modern SSD, a deep pipeline structure is designed in Registor hardware consisting of a file semantics extractor, matching candidates finder, regex matching units (REMUs), and results organizer. Furthermore, each stage of the pipeline makes the use of maximal parallelism possible. To make Registor readily usable by high-level applications, we have developed a set of APIs and libraries in Linux allowing Registor to process files in the SSD by recombining separate data blocks into files efficiently. A working prototype of Registor has been built in our newly designed NVMe-SSD. Extensive experiments and analyses have been carried out to show that Registor achieves high throughput, reduces the I/O bandwidth requirement by up to 97%, and reduces CPU utilization by as much as 82% for regex search in large datasets.","PeriodicalId":273014,"journal":{"name":"ACM Transactions on Storage (TOS)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116670237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Cluster and Single-Node Analysis of Long-Term Deduplication Patterns 长期重复数据删除模式的集群和单机分析

ACM Transactions on Storage (TOS) Pub Date : 2018-05-11 DOI: 10.1145/3183890

Zhen Sun, G. Kuenning, Sonam Mandal, Philip Shilane, Vasily Tarasov, Nong Xiao, E. Zadok

{"title":"Cluster and Single-Node Analysis of Long-Term Deduplication Patterns","authors":"Zhen Sun, G. Kuenning, Sonam Mandal, Philip Shilane, Vasily Tarasov, Nong Xiao, E. Zadok","doi":"10.1145/3183890","DOIUrl":"https://doi.org/10.1145/3183890","url":null,"abstract":"Deduplication has become essential in disk-based backup systems, but there have been few long-term studies of backup workloads. Most past studies either were of a small static snapshot or covered only a short period that was not representative of how a backup system evolves over time. For this article, we first collected 21 months of data from a shared user file system; 33 users and over 4,000 snapshots are covered. We then analyzed the dataset, examining a variety of essential characteristics across two dimensions: single-node deduplication and cluster deduplication. For single-node deduplication analysis, our primary focus was individual-user data. Despite apparently similar roles and behavior among all of our users, we found significant differences in their deduplication ratios. Moreover, the data that some users share with others had a much higher deduplication ratio than average. For cluster deduplication analysis, we implemented seven published data-routing algorithms and created a detailed comparison of their performance with respect to deduplication ratio, load distribution, and communication overhead. We found that per-file routing achieves a higher deduplication ratio than routing by super-chunk (multiple consecutive chunks), but it also leads to high data skew (imbalance of space usage across nodes). We also found that large chunking sizes are better for cluster deduplication, as they significantly reduce data-routing overhead, while their negative impact on deduplication ratios is small and acceptable. We draw interesting conclusions from both single-node and cluster deduplication analysis and make recommendations for future deduplication systems design.","PeriodicalId":273014,"journal":{"name":"ACM Transactions on Storage (TOS)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127644451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Empirical Evaluation and Enhancement of Enterprise Storage System Request Scheduling 企业存储系统请求调度的实证评价与改进

ACM Transactions on Storage (TOS) Pub Date : 2018-04-27 DOI: 10.1145/3193741

Deng Zhou, Vania Fang, T. Xie, Wen Pan, R. Kesavan, Tony Lin, N. Patel

{"title":"Empirical Evaluation and Enhancement of Enterprise Storage System Request Scheduling","authors":"Deng Zhou, Vania Fang, T. Xie, Wen Pan, R. Kesavan, Tony Lin, N. Patel","doi":"10.1145/3193741","DOIUrl":"https://doi.org/10.1145/3193741","url":null,"abstract":"Since little has been reported in the literature concerning enterprise storage system file-level request scheduling, we do not have enough knowledge about how various scheduling factors affect performance. Moreover, we are in lack of a good understanding on how to enhance request scheduling to adapt to the changing characteristics of workloads and hardware resources. To answer these questions, we first build a request scheduler prototype based on WAFL®, a mainstream file system running on numerous enterprise storage systems worldwide. Next, we use the prototype to quantitatively measure the impact of various scheduling configurations on performance on a NetApp®'s enterprise-class storage system. Several observations have been made. For example, we discover that in order to improve performance, the priority of write requests and non-preempted restarted requests should be boosted in some workloads. Inspired by these observations, we further propose two scheduling enhancement heuristics called SORD (size-oriented request dispatching) and QATS (queue-depth aware time slicing). Finally, we evaluate them by conducting a wide range of experiments using workloads generated by SPC-1 and SFS2014 on both HDD-based and all-flash platforms. Experimental results show that the combination of the two can noticeably reduce average request latency under some workloads.","PeriodicalId":273014,"journal":{"name":"ACM Transactions on Storage (TOS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122163306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fast Miss Ratio Curve Modeling for Storage Cache 存储缓存快速缺失率曲线建模

ACM Transactions on Storage (TOS) Pub Date : 2018-04-12 DOI: 10.1145/3185751

Xiameng Hu, Xiaolin Wang, Lan Zhou, Yingwei Luo, Zhenlin Wang, C. Ding, Chencheng Ye

引用次数: 29

Workload Characterization for Enterprise Disk Drives 企业磁盘驱动器的工作负载表征

ACM Transactions on Storage (TOS) Pub Date : 2018-04-12 DOI: 10.1145/3151847

A. Kashyap

引用次数: 5