Global overview of research data repositories: an analysis of re3data registry

IF 2.1 Q2 INFORMATION SCIENCE & LIBRARY SCIENCE
A. Khan, Fayaz Ahmad Loan, Umer Yousuf Parray, Sozia Rashid
{"title":"Global overview of research data repositories: an analysis of re3data registry","authors":"A. Khan, Fayaz Ahmad Loan, Umer Yousuf Parray, Sozia Rashid","doi":"10.1108/idd-07-2022-0069","DOIUrl":null,"url":null,"abstract":"\nPurpose\nData sharing is increasingly being recognized as an essential component of scholarly research and publishing. Sharing data improves results and propels research and discovery forward. Given the importance of data sharing, the purpose of the study is to unveil the present scenario of research data repositories (RDR) and sheds light on strategies and tactics followed by different countries for efficient organization and optimal use of scientific literature.\n\n\nDesign/methodology/approach\nThe data for the study is collected from registry of RDR (re3data registry) (re3data.org), which covers RDR from different academic disciplines and provides filtration options “Search” and “Browse” to access the repositories. Using these filtration options, the researchers collected metadata of repositories i.e. country wise contribution, content-type data, repository language interface, software usage, metadata standards and data access type. Furthermore, the data was exported to Google Sheets for analysis and visualization.\n\n\nFindings\nThe re3data registry holds a rich and diverse collection of data repositories from the majority of countries all over the world. It is revealed that English is the dominant language, and the most widely used software for the creation of data repositories are “DataVerse”, followed by “Dspace” and “MySQL”. The most frequently used metadata standards are “Dublin Core” and “Datacite metadata schema”. The majority of repositories are open, with more than half of the repositories being “disciplinary” in nature, and the most significant data sources include “scientific and statistical data” followed by “standard office documents”.\n\n\nResearch limitations/implications\nThe main limitation of the study is that the findings are based on the data collected through a single registry of repositories, and only a few characteristic features were investigated.\n\n\nOriginality/value\nThe study will benefit all countries with a small number of data repositories or no repositories at all, with tools and techniques used by the top repositories to ensure long-term storage and accessibility to research data. In addition to this, the study provides a global overview of RDR and its characteristic features.\n","PeriodicalId":43488,"journal":{"name":"Information Discovery and Delivery","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2023-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Discovery and Delivery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/idd-07-2022-0069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose Data sharing is increasingly being recognized as an essential component of scholarly research and publishing. Sharing data improves results and propels research and discovery forward. Given the importance of data sharing, the purpose of the study is to unveil the present scenario of research data repositories (RDR) and sheds light on strategies and tactics followed by different countries for efficient organization and optimal use of scientific literature. Design/methodology/approach The data for the study is collected from registry of RDR (re3data registry) (re3data.org), which covers RDR from different academic disciplines and provides filtration options “Search” and “Browse” to access the repositories. Using these filtration options, the researchers collected metadata of repositories i.e. country wise contribution, content-type data, repository language interface, software usage, metadata standards and data access type. Furthermore, the data was exported to Google Sheets for analysis and visualization. Findings The re3data registry holds a rich and diverse collection of data repositories from the majority of countries all over the world. It is revealed that English is the dominant language, and the most widely used software for the creation of data repositories are “DataVerse”, followed by “Dspace” and “MySQL”. The most frequently used metadata standards are “Dublin Core” and “Datacite metadata schema”. The majority of repositories are open, with more than half of the repositories being “disciplinary” in nature, and the most significant data sources include “scientific and statistical data” followed by “standard office documents”. Research limitations/implications The main limitation of the study is that the findings are based on the data collected through a single registry of repositories, and only a few characteristic features were investigated. Originality/value The study will benefit all countries with a small number of data repositories or no repositories at all, with tools and techniques used by the top repositories to ensure long-term storage and accessibility to research data. In addition to this, the study provides a global overview of RDR and its characteristic features.
研究数据存储库的全球概述:re3data注册表分析
数据共享越来越被认为是学术研究和出版的重要组成部分。共享数据可以改善结果,推动研究和发现向前发展。鉴于数据共享的重要性,本研究的目的是揭示研究数据存储库(RDR)的现状,并阐明不同国家为有效组织和最佳利用科学文献所遵循的战略和策略。设计/方法/方法本研究的数据收集自RDR注册表(re3data registry) (re3data.org),该注册表涵盖了不同学科的RDR,并提供“Search”和“Browse”过滤选项以访问存储库。通过这些过滤选项,研究人员收集了存储库的元数据,即国家贡献、内容类型数据、存储库语言接口、软件使用、元数据标准和数据访问类型。此外,数据导出到谷歌Sheets进行分析和可视化。re3data注册中心拥有来自世界上大多数国家的丰富多样的数据存储库集合。据透露,英语是占主导地位的语言,最广泛使用的创建数据库的软件是“DataVerse”,其次是“Dspace”和“MySQL”。最常用的元数据标准是“Dublin Core”和“Datacite metadata schema”。大多数知识库是开放的,超过一半的知识库本质上是“学科”的,最重要的数据源包括“科学和统计数据”,其次是“标准办公文档”。研究的局限性/启示本研究的主要局限性是研究结果是基于通过单一注册库收集的数据,并且只调查了少数特征。独创性/价值这项研究将使所有拥有少量数据存储库或根本没有数据存储库的国家受益,顶级存储库使用的工具和技术将确保长期存储和获取研究数据。除此之外,本研究还提供了RDR的全球概况及其特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Information Discovery and Delivery
Information Discovery and Delivery INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
5.40
自引率
4.80%
发文量
21
期刊介绍: Information Discovery and Delivery covers information discovery and access for digital information researchers. This includes educators, knowledge professionals in education and cultural organisations, knowledge managers in media, health care and government, as well as librarians. The journal publishes research and practice which explores the digital information supply chain ie transport, flows, tracking, exchange and sharing, including within and between libraries. It is also interested in digital information capture, packaging and storage by ‘collectors’ of all kinds. Information is widely defined, including but not limited to: Records, Documents, Learning objects, Visual and sound files, Data and metadata and , User-generated content.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信