Albert K. Engstfeld, Johannes M. Hermann, Nicolas G. Hörmann, Julian Rüth
{"title":"echemdb 工具包 -- 为数据管理解决方案准备数据的轻量级方法","authors":"Albert K. Engstfeld, Johannes M. Hermann, Nicolas G. Hörmann, Julian Rüth","doi":"arxiv-2409.07083","DOIUrl":null,"url":null,"abstract":"According to the FAIR (findability, accessibility, interoperability, and\nreusability) principles, scientific data should always be stored with\nmachine-readable descriptive metadata. Existing solutions to store data with\nmetadata, such as electronic lab notebooks (ELN), are often very\ndomain-specific and not sufficiently generic for arbitrary experimental or\ncomputational results. In this work, we present open-source echemdb toolkit for creating and\nhandling data and metadata. The toolkit is running entirely on the file system\nlevel using a file-based approach, which facilitates integration with other\ntools in a FAIR data life cycle and means that no complicated server setup is\nrequired. This also makes the toolkit more accessible to the average researcher\nsince no understanding of more sophisticated database technologies is required. We showcase several aspects and applications of the toolkit: automatic\nannotation of raw research data with human- and machine-readable metadata, data\nconversion into standardised frictionless Data Packages, and an API for\nexploring the data. We also illustrate the web frameworks to illustrate the\ndata using example data from research into energy conversion and storage.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"echemdb Toolkit -- a Lightweight Approach to Getting Data Ready for Data Management Solutions\",\"authors\":\"Albert K. Engstfeld, Johannes M. Hermann, Nicolas G. Hörmann, Julian Rüth\",\"doi\":\"arxiv-2409.07083\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"According to the FAIR (findability, accessibility, interoperability, and\\nreusability) principles, scientific data should always be stored with\\nmachine-readable descriptive metadata. Existing solutions to store data with\\nmetadata, such as electronic lab notebooks (ELN), are often very\\ndomain-specific and not sufficiently generic for arbitrary experimental or\\ncomputational results. In this work, we present open-source echemdb toolkit for creating and\\nhandling data and metadata. The toolkit is running entirely on the file system\\nlevel using a file-based approach, which facilitates integration with other\\ntools in a FAIR data life cycle and means that no complicated server setup is\\nrequired. This also makes the toolkit more accessible to the average researcher\\nsince no understanding of more sophisticated database technologies is required. We showcase several aspects and applications of the toolkit: automatic\\nannotation of raw research data with human- and machine-readable metadata, data\\nconversion into standardised frictionless Data Packages, and an API for\\nexploring the data. We also illustrate the web frameworks to illustrate the\\ndata using example data from research into energy conversion and storage.\",\"PeriodicalId\":501123,\"journal\":{\"name\":\"arXiv - CS - Databases\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Databases\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.07083\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
echemdb Toolkit -- a Lightweight Approach to Getting Data Ready for Data Management Solutions
According to the FAIR (findability, accessibility, interoperability, and
reusability) principles, scientific data should always be stored with
machine-readable descriptive metadata. Existing solutions to store data with
metadata, such as electronic lab notebooks (ELN), are often very
domain-specific and not sufficiently generic for arbitrary experimental or
computational results. In this work, we present open-source echemdb toolkit for creating and
handling data and metadata. The toolkit is running entirely on the file system
level using a file-based approach, which facilitates integration with other
tools in a FAIR data life cycle and means that no complicated server setup is
required. This also makes the toolkit more accessible to the average researcher
since no understanding of more sophisticated database technologies is required. We showcase several aspects and applications of the toolkit: automatic
annotation of raw research data with human- and machine-readable metadata, data
conversion into standardised frictionless Data Packages, and an API for
exploring the data. We also illustrate the web frameworks to illustrate the
data using example data from research into energy conversion and storage.