{"title":"pWebDAV: A Multi-Tier Storage System","authors":"Christos Filippidis, Y. Cotronis","doi":"10.1109/PDP2018.2018.00108","DOIUrl":null,"url":null,"abstract":"Experiments using the Large Hadron Collider (LHC) currently generate tens of petabytes of reduced data per year, observational and simulation data in the climate domain is expected to reach eXabytes by 2021, and light source experiments are expected to generate hundreds of terabytes per day. At such extreme scale, the substantial amount of concurrency can cause critical contention issue of the I/O system. This study introduces pWebDAV as a heterogeneous, multi-tier storage system. pWebDAV proposes a dynamically coordinated I/O architecture offering overall data flow solutions (remote-local access). The fundamental idea is to implement, for each data transfer, I/O policies on the fly. pWebDAV controls all I/O nodes, participating in the data transfer, directly regardless of the tier. pWebDAV approach can fully utilize the provided I/O & network resources and is able to minimize disk and network contention. The focus in this study is the Metadata node scalability performance.","PeriodicalId":333367,"journal":{"name":"2018 26th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 26th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDP2018.2018.00108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Experiments using the Large Hadron Collider (LHC) currently generate tens of petabytes of reduced data per year, observational and simulation data in the climate domain is expected to reach eXabytes by 2021, and light source experiments are expected to generate hundreds of terabytes per day. At such extreme scale, the substantial amount of concurrency can cause critical contention issue of the I/O system. This study introduces pWebDAV as a heterogeneous, multi-tier storage system. pWebDAV proposes a dynamically coordinated I/O architecture offering overall data flow solutions (remote-local access). The fundamental idea is to implement, for each data transfer, I/O policies on the fly. pWebDAV controls all I/O nodes, participating in the data transfer, directly regardless of the tier. pWebDAV approach can fully utilize the provided I/O & network resources and is able to minimize disk and network contention. The focus in this study is the Metadata node scalability performance.