Chandrika Prasad , Geetha J Dr , Vinay M , Manoj V L , Bharath Vamsi , D. Malavika Dileepa
{"title":"A pyDAOS Approach for Enabling Efficient Data Handling in DAOS","authors":"Chandrika Prasad , Geetha J Dr , Vinay M , Manoj V L , Bharath Vamsi , D. Malavika Dileepa","doi":"10.1016/j.procs.2025.01.053","DOIUrl":null,"url":null,"abstract":"<div><div>High-Performance Computing (HPC) systems require efficient storage solutions to handle vast amounts of data. Distributed Asynchronous Object Storage (DAOS) is designed for such environments, offering scalable and high performance storage. Our study initially explores the capabilities and performance of DAOS using pyDAOS when integrated with a key-value store (KV store). The existing methodologies are analyzed to identify inefficiencies, particularly in handling large I/O operations. This paper proposes an enhanced approach that incorporates chunking for large I/O operations, automated resource selection, and simplified system administration via server discovery and by fetching configuration details. This results show significant improvements in efficiency, scalability, and usability of DAOS using pyDAOS, making it a robust solution for modern data intensive applications.</div></div>","PeriodicalId":20465,"journal":{"name":"Procedia Computer Science","volume":"252 ","pages":"Pages 922-933"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Procedia Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877050925000535","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
High-Performance Computing (HPC) systems require efficient storage solutions to handle vast amounts of data. Distributed Asynchronous Object Storage (DAOS) is designed for such environments, offering scalable and high performance storage. Our study initially explores the capabilities and performance of DAOS using pyDAOS when integrated with a key-value store (KV store). The existing methodologies are analyzed to identify inefficiencies, particularly in handling large I/O operations. This paper proposes an enhanced approach that incorporates chunking for large I/O operations, automated resource selection, and simplified system administration via server discovery and by fetching configuration details. This results show significant improvements in efficiency, scalability, and usability of DAOS using pyDAOS, making it a robust solution for modern data intensive applications.