File System Aging

Alex Conway, Ainesh Bakshi, Arghya Bhattacharya, Rory Bennett, Yizheng Jiao, Eric Knorr, Yang Zhan, Michael A. Bender, William Jannen, Rob Johnson, Bradley C. Kuszmaul, Donald E. Porter, Jun Yuan, Martin Farach-Colton
{"title":"File System Aging","authors":"Alex Conway, Ainesh Bakshi, Arghya Bhattacharya, Rory Bennett, Yizheng Jiao, Eric Knorr, Yang Zhan, Michael A. Bender, William Jannen, Rob Johnson, Bradley C. Kuszmaul, Donald E. Porter, Jun Yuan, Martin Farach-Colton","doi":"arxiv-2401.08858","DOIUrl":null,"url":null,"abstract":"File systems must allocate space for files without knowing what will be added\nor removed in the future. Over the life of a file system, this may cause\nsuboptimal file placement decisions that eventually lead to slower performance,\nor aging. Conventional wisdom suggests that file system aging is a solved\nproblem in the common case; heuristics to avoid aging, such as colocating\nrelated files and data blocks, are effective until a storage device fills up,\nat which point space pressure exacerbates fragmentation-based aging. However,\nthis article describes both realistic and synthetic workloads that can cause\nthese heuristics to fail, inducing large performance declines due to aging,\neven when the storage device is nearly empty. We argue that these slowdowns are caused by poor layout. We demonstrate a\ncorrelation between the read performance of a directory scan and the locality\nwithin a file system's access patterns, using a dynamic layout score. We\ncomplement these results with microbenchmarks that show that space pressure can\ncause a substantial amount of inter-file and intra-file fragmentation. However,\nour results suggest that the effect of free-space fragmentation on read\nperformance is best described as accelerating the file system aging process.\nThe effect on write performance is non-existent in some cases, and, in most\ncases, an order of magnitude smaller than the read degradation from\nfragmentation caused by normal usage. In short, many file systems are exquisitely prone to read aging after a\nvariety of write patterns. We show, however, that aging is not inevitable.\nBetrFS, a file system based on write-optimized dictionaries, exhibits almost no\naging in our experiments. We present a framework for understanding and\npredicting aging, and identify the key features of BetrFS that avoid aging.","PeriodicalId":501333,"journal":{"name":"arXiv - CS - Operating Systems","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Operating Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2401.08858","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

File systems must allocate space for files without knowing what will be added or removed in the future. Over the life of a file system, this may cause suboptimal file placement decisions that eventually lead to slower performance, or aging. Conventional wisdom suggests that file system aging is a solved problem in the common case; heuristics to avoid aging, such as colocating related files and data blocks, are effective until a storage device fills up, at which point space pressure exacerbates fragmentation-based aging. However, this article describes both realistic and synthetic workloads that can cause these heuristics to fail, inducing large performance declines due to aging, even when the storage device is nearly empty. We argue that these slowdowns are caused by poor layout. We demonstrate a correlation between the read performance of a directory scan and the locality within a file system's access patterns, using a dynamic layout score. We complement these results with microbenchmarks that show that space pressure can cause a substantial amount of inter-file and intra-file fragmentation. However, our results suggest that the effect of free-space fragmentation on read performance is best described as accelerating the file system aging process. The effect on write performance is non-existent in some cases, and, in most cases, an order of magnitude smaller than the read degradation from fragmentation caused by normal usage. In short, many file systems are exquisitely prone to read aging after a variety of write patterns. We show, however, that aging is not inevitable. BetrFS, a file system based on write-optimized dictionaries, exhibits almost no aging in our experiments. We present a framework for understanding and predicting aging, and identify the key features of BetrFS that avoid aging.
文件系统老化
文件系统必须为文件分配空间,而不知道将来会添加或删除什么文件。在文件系统的生命周期中,这可能会导致次优文件放置决策,最终导致性能降低或老化。传统观点认为,文件系统的老化在一般情况下是一个可以解决的问题;避免老化的启发式方法(如将相关文件和数据块同地放置)在存储设备填满之前是有效的,此时空间压力会加剧基于碎片的老化。然而,本文描述的现实工作负载和合成工作负载会导致这些启发式方法失效,甚至在存储设备几乎耗尽的情况下也会因老化而导致性能大幅下降。我们认为,这些性能下降是由于布局不当造成的。我们使用动态布局评分证明了目录扫描的读取性能与文件系统访问模式的本地性之间的相关性。我们使用微基准测试对这些结果进行了补充,结果表明空间压力会导致大量文件间和文件内碎片。然而,我们的结果表明,自由空间碎片对读取性能的影响最好的描述就是加速文件系统的老化过程。在某些情况下,自由空间碎片对写入性能的影响并不存在,在大多数情况下,自由空间碎片对写入性能的影响要比正常使用造成的碎片读取性能下降小一个数量级。简而言之,在各种写入模式下,许多文件系统都极易出现读老化。BetrFS 是一个基于写优化字典的文件系统,在我们的实验中几乎没有出现老化现象。我们提出了一个理解和预测老化的框架,并确定了 BetrFS 避免老化的关键特性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信