Proceedings of the 8th Parallel Data Storage Workshop最新文献

Asynchronous object storage with QoS for scientific and commercial big data 面向科学大数据和商业大数据的具有QoS的异步对象存储

Proceedings of the 8th Parallel Data Storage Workshop Pub Date : 2013-11-17 DOI: 10.1145/2538542.2538565

Michael J. Brim, D. Dillow, S. Oral, B. Settlemyer, Feiyi Wang

引用次数: 16

SDS: a framework for scientific data services SDS:科学数据服务的框架

Proceedings of the 8th Parallel Data Storage Workshop Pub Date : 2013-11-17 DOI: 10.1145/2538542.2538563

Bin Dong, S. Byna, Kesheng Wu

{"title":"SDS: a framework for scientific data services","authors":"Bin Dong, S. Byna, Kesheng Wu","doi":"10.1145/2538542.2538563","DOIUrl":"https://doi.org/10.1145/2538542.2538563","url":null,"abstract":"Large-scale scientific applications typically write their data to parallel file systems with organizations designed to achieve fast write speeds. Analysis tasks frequently read the data in a pattern that is different from the write pattern, and therefore experience poor I/O performance. In this paper, we introduce a prototype framework for bridging the performance gap between write and read stages of data access from parallel file systems. We call this framework Scientific Data Services, or SDS for short. This initial implementation of SDS focuses on reorganizing previously written files into data layouts that benefit read patterns, and transparently directs read calls to the reorganized data. SDS follows a client-server architecture. The SDS Server manages partial or full replicas of reorganized datasets and serves SDS Clients' requests for data. The current version of the SDS client library supports HDF5 programming interface for reading data. The client library intercepts HDF5 calls using the HDF5 Virtual Object Layer (VOL) and transparently redirects them to the reorganized data. The SDS client library also provides a querying interface for reading part of the data based on user-specified selective criteria. We describe the design and implementation of the SDS client-server architecture, and evaluate the response time of the SDS Server and the performance benefits of SDS.","PeriodicalId":250653,"journal":{"name":"Proceedings of the 8th Parallel Data Storage Workshop","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122744543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 31

Proceedings of the 8th Parallel Data Storage Workshop 第八届并行数据存储研讨会论文集

Proceedings of the 8th Parallel Data Storage Workshop Pub Date : 2013-11-17 DOI: 10.1145/2538542

Dean Hildebrand, K. Schwan

引用次数: 0

Structuring PLFS for extensibility 为可扩展性构建PLFS

Proceedings of the 8th Parallel Data Storage Workshop Pub Date : 2013-11-17 DOI: 10.1145/2538542.2538564

C. Cranor, Milo Polte, Garth A. Gibson

引用次数: 8

Performance and scalability evaluation of the Ceph parallel file system Ceph并行文件系统的性能和可伸缩性评估

Proceedings of the 8th Parallel Data Storage Workshop Pub Date : 2013-11-17 DOI: 10.1145/2538542.2538562

Feiyi Wang, M. Nelson, S. Oral, S. Atchley, S. Weil, B. Settlemyer, Blake Caldwell, Jason Hill

引用次数: 25

Active data: a data-centric approach to data life-cycle management 活动数据:以数据为中心的数据生命周期管理方法

Proceedings of the 8th Parallel Data Storage Workshop Pub Date : 2013-11-17 DOI: 10.1145/2538542.2538566

Anthony Simonet, G. Fedak, M. Ripeanu, S. Al-Kiswany

引用次数: 6

Efficient transactions for parallel data movement 并行数据移动的高效事务

Proceedings of the 8th Parallel Data Storage Workshop Pub Date : 2013-11-17 DOI: 10.1145/2538542.2538567

J. Lofstead, Jai Dayal, I. Jimenez, C. Maltzahn

引用次数: 7

Fourier-assisted machine learning of hard disk drive access time models 硬盘访问时间模型的傅里叶辅助机器学习

Proceedings of the 8th Parallel Data Storage Workshop Pub Date : 2013-11-17 DOI: 10.1145/2538542.2538561

A. Crume, C. Maltzahn, L. Ward, Thomas M. Kroeger, M. Curry, R. Oldfield

引用次数: 4

Predicting intermediate storage performance for workflow applications 预测工作流应用程序的中间存储性能

Proceedings of the 8th Parallel Data Storage Workshop Pub Date : 2013-02-19 DOI: 10.1145/2538542.2538560

L. Costa, S. Al-Kiswany, A. Barros, Hao Yang, M. Ripeanu

引用次数: 5