Njoud O. Al-Maaitah , Javier Garcia-Blas , Genaro Sanchez-Gallegos , Jesus Carretero , Marc-André Vef , André Brinkmann
{"title":"A comparative study of ad-hoc file systems for extreme scale computing","authors":"Njoud O. Al-Maaitah , Javier Garcia-Blas , Genaro Sanchez-Gallegos , Jesus Carretero , Marc-André Vef , André Brinkmann","doi":"10.1016/j.future.2025.107815","DOIUrl":null,"url":null,"abstract":"<div><div>High-performance computing (HPC) systems often suffer from interference caused by multiple applications accessing a shared parallel file system, which can negatively impact compute performance. One solution to this problem is to add new tiers to the HPC storage hierarchy that can absorb I/O bursts and support moving data between tiers based on its hotness. Ad-hoc file systems serve as an intermediate storage layer that leverages new storage technologies, such as non-volatile random access memory devices and flash-based solid state drives, to provide temporary storage based on application behavior in the HPC environment. A variety of ad-hoc file systems have been proposed recently. In this survey, we will explore the integration of fast storage layers into HPC storage hierarchies. We will examine various ad-hoc file systems highlighting their features and functionalities to categorize the proposed solutions into different groups.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"170 ","pages":"Article 107815"},"PeriodicalIF":6.2000,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25001104","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
High-performance computing (HPC) systems often suffer from interference caused by multiple applications accessing a shared parallel file system, which can negatively impact compute performance. One solution to this problem is to add new tiers to the HPC storage hierarchy that can absorb I/O bursts and support moving data between tiers based on its hotness. Ad-hoc file systems serve as an intermediate storage layer that leverages new storage technologies, such as non-volatile random access memory devices and flash-based solid state drives, to provide temporary storage based on application behavior in the HPC environment. A variety of ad-hoc file systems have been proposed recently. In this survey, we will explore the integration of fast storage layers into HPC storage hierarchies. We will examine various ad-hoc file systems highlighting their features and functionalities to categorize the proposed solutions into different groups.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.