{"title":"Data Management Practices on Large-Scale Lustre Scratch File Systems","authors":"Gary L. Rogers, Jesse Hanley, Rick Mohr","doi":"10.1145/2616498.2616545","DOIUrl":null,"url":null,"abstract":"Managing large-scale Lustre scratch file systems is a necessity on shared storage resources. The lack of proper data management can halt all computation that is contingent upon any type of output to disk. Lustre scratch areas are typically provided for users to utilize as high-performance temporary space to stage job input data and store job output data. Common techniques for managing the amount of stored temporary data involve the use of file system quotas or the enforcement of a purge policy that limits how long files can reside in the scratch space. This paper reviews the challenges of balancing usability from the end user's perspective and the administration of these scratch areas, along with the use of tools to decrease the overall amount of time required for file purges on large-scale Lustre systems.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"32 1","pages":"36:1-36:6"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2616498.2616545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Managing large-scale Lustre scratch file systems is a necessity on shared storage resources. The lack of proper data management can halt all computation that is contingent upon any type of output to disk. Lustre scratch areas are typically provided for users to utilize as high-performance temporary space to stage job input data and store job output data. Common techniques for managing the amount of stored temporary data involve the use of file system quotas or the enforcement of a purge policy that limits how long files can reside in the scratch space. This paper reviews the challenges of balancing usability from the end user's perspective and the administration of these scratch areas, along with the use of tools to decrease the overall amount of time required for file purges on large-scale Lustre systems.