Richard Grunzke, Volker Hartmann, T. Jejkal, H. Kollai, C. Dressler, Julia Dolhoff, Julia Stanek, H. Herold, A. Hoffmann, R. Müller-Pfefferkorn, Torsten Schrade, S. Herres‐Pawlis, G. Meinel, W. Nagel
{"title":"Performance Evaluation of the Metadata-Driven MASi Research Data Management Repository Service","authors":"Richard Grunzke, Volker Hartmann, T. Jejkal, H. Kollai, C. Dressler, Julia Dolhoff, Julia Stanek, H. Herold, A. Hoffmann, R. Müller-Pfefferkorn, Torsten Schrade, S. Herres‐Pawlis, G. Meinel, W. Nagel","doi":"10.1109/PDP2018.2018.00059","DOIUrl":null,"url":null,"abstract":"Research data is increasingly important in order to gain insights from scientific data. To optimally foster this, the management of research data is required to be usable, customizable and fast. We enable this by building up the MASi research data management repository service, based on the KIT DM framework. The aim is on utilizing a single repository instance to serve multiple arbitrary community use cases. Due to their diverse data characteristics the performance of the MASi service has to be fitting across the different cases. We evaluate the performance along three initial heterogeneous use cases. Various aspects are investigated; First, the object insertion and query performance of the database along the object fill level. Second and third, the ingest and download performance of digital objects using real-life data sets. Highly favorable performance characteristics are shown.","PeriodicalId":333367,"journal":{"name":"2018 26th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 26th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDP2018.2018.00059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Research data is increasingly important in order to gain insights from scientific data. To optimally foster this, the management of research data is required to be usable, customizable and fast. We enable this by building up the MASi research data management repository service, based on the KIT DM framework. The aim is on utilizing a single repository instance to serve multiple arbitrary community use cases. Due to their diverse data characteristics the performance of the MASi service has to be fitting across the different cases. We evaluate the performance along three initial heterogeneous use cases. Various aspects are investigated; First, the object insertion and query performance of the database along the object fill level. Second and third, the ingest and download performance of digital objects using real-life data sets. Highly favorable performance characteristics are shown.