Carole Goble, F. Bacall, S. Soiland-Reyes, Stuart Owen, Ignacio Eguinoa, Bert Droesbeke, Hervé Ménager, Laura Rodríguez-Navas, J. M. Fernández, Bjorn Gruening, Simone Leo, Luca Pireddu, M. Crusoe, Johan Gustafsson, S. Capella-Gutiérrez, Frederik Coppens
{"title":"The EOSC-Life Workflow Collaboratory for the Life Sciences","authors":"Carole Goble, F. Bacall, S. Soiland-Reyes, Stuart Owen, Ignacio Eguinoa, Bert Droesbeke, Hervé Ménager, Laura Rodríguez-Navas, J. M. Fernández, Bjorn Gruening, Simone Leo, Luca Pireddu, M. Crusoe, Johan Gustafsson, S. Capella-Gutiérrez, Frederik Coppens","doi":"10.52825/cordi.v1i.352","DOIUrl":"https://doi.org/10.52825/cordi.v1i.352","url":null,"abstract":"Workflows have become a major tool for the processing of Research Data, for example, data collection and data cleaning pipelines, data analytics, and data update feeds populating public archives. The EOSC-Life Research Infrastructure Cluster project brought together Europe’s Life Science Research Infrastructures to create an Open, Digital and Collaborative space for biological and medical research to develop a cloud-based Workflow Collaboratory. As adopting FAIR practices extends beyond data, the Workflow Collaboratory drives the implementation of FAIR computational workflows and tools. It fosters tool-focused collaborations and reuse via the sharing of data analysis workflows and offers an ecosystem of services for researchers and workflow specialists to find, use and reuse workflows. It’s web-friendly Digital Object Metadata Framework, based on RO-Crate and Bioschemas, supports the description and exchange of workflows across the services.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120967816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Markus Scheidgen, Sebastian Brückner, S. Brockhauser, L. Ghiringhelli, Felix Dietrich, Ahmed E. Mansour, José A. Márquez, Martin Albrecht, Heiko B. Weber, Silvana Botti, Martin Aeschlimann, Claudia Ambrosch-Draxl
{"title":"FAIR Research Data With NOMAD FAIRmat's Distributed, Schema-based Research-data Infrastructure to Harmonize RDM in Materials Science","authors":"Markus Scheidgen, Sebastian Brückner, S. Brockhauser, L. Ghiringhelli, Felix Dietrich, Ahmed E. Mansour, José A. Márquez, Martin Albrecht, Heiko B. Weber, Silvana Botti, Martin Aeschlimann, Claudia Ambrosch-Draxl","doi":"10.52825/cordi.v1i.376","DOIUrl":"https://doi.org/10.52825/cordi.v1i.376","url":null,"abstract":"Scientific research is becoming increasingly data centric, which requires more effort to manage, share, and publish data.NOMAD is a web-based platform that provides research data management (RDM) for materials-science data. In addition to core RDM functions like uploading and sharing files, NOMAD automatically extracts structured data from supported file formats, normalizes, and converts data from these formats. NOMAD provides an extendable framework for managing not just files, but structured machine-actionable harmonized and inter-operable data. This is the basis for a faceted search with domain-specific filters, a comprehensive API, structured data entry via customizable ELNs, integrated data-analysis and machine-learning tools. NOMAD is run as a free public service and can additionally be operated by research institutes. Connecting NOMAD installations through the public services will allow a federated data infrastructure to share data between research institutes and further harmonize RDM within a large research domain such as materials science.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114993275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tom Emery, Kasia Karpinska, Angelica Maineri, Lucas van der Meer
{"title":"The Open Data Infrastructure for Social Science and Economic Innovations (ODISSEI) Better Infrastructure, Better Science, Better Society","authors":"Tom Emery, Kasia Karpinska, Angelica Maineri, Lucas van der Meer","doi":"10.52825/cordi.v1i.393","DOIUrl":"https://doi.org/10.52825/cordi.v1i.393","url":null,"abstract":"The Open Data Infrastructure for Social Science and Economic Innovations (ODISSEI) equips social scientists in the Netherlands with the data, tools, and skills that are necessary to answer groundbreaking questions for scientific and policy making purposes. With a variety of use cases to pick from, we aim at engaging in a discussion with other Research data infrastructures to identify synergies but also challenges ahead.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127097255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lars Bernard, Christin Henzen, Auriol Degbelo, Daniel Nüst, Jörg Seegert
{"title":"NFDI4Earth: Improving Research Data Management in the Earth System Sciences","authors":"Lars Bernard, Christin Henzen, Auriol Degbelo, Daniel Nüst, Jörg Seegert","doi":"10.52825/cordi.v1i.288","DOIUrl":"https://doi.org/10.52825/cordi.v1i.288","url":null,"abstract":"The increasing availability of digital research products – typically encompassing data and software – in the Earth System Sciences (ESS) calls for new approaches to facilitate the management and reuse of these digital resources. NFDI4Earth addresses this gap and targets the harmonization of services related to digital research products in the ESS. In our initial phase, we aim to support researchers in 1) discovering and exploring relevant data and software sources, 2) exploratory spatial data analysis, 3) solving research data management problems and 4) creating and publishing information products. This abstract briefly presents concepts and the first results of the harmonization efforts within NFDI4Earth. It touches upon four points: the harmonization of resource descriptions, the harmonization of ESS-specific support for research data management (RDM), joint training activities for researchers in the ESS, and the harmonization of discovery and publishing workflows.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133916385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Determining the Similarity of Research Data by Using an Interoperable Metadata Extraction Method","authors":"Benedikt Heinrichs, M. A. Yazdi","doi":"10.52825/cordi.v1i.290","DOIUrl":"https://doi.org/10.52825/cordi.v1i.290","url":null,"abstract":"Determining the similarity of research data is not a simple task, as the formats can differ widely depending on the domain. Especially, since many formats are represented as binary files, the raw comparison of these will not yield good results. This makes it hard to accurately tell how similar certain research work is by comparing the data. With the emergence of extracted interoperable metadata, a form to describe data has been provided which is independent of the data format. Therefore, this work tries to use this extracted interoperable metadata and create a method to determine the similarity of research data based on their metadata. The produced method utilizes domain knowledge about the extracted metadata and the way they are formulated. A baseline is created, and further methods are created to compare to. The results show that our method outperforms all other methods, especially the ones which are focused on comparing the research data itself, not the metadata. Since the results are promising, we propose further investigations against other datasets and possible use cases.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133199947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dorothea Iglezakis, Džulia Terzijska, Susanne Arndt, Sophia Leimer, Johanna Hickmann, Marc Fuhrmans, Giacomo Lanza
{"title":"Modelling Scientific Processes With the m4i Ontology","authors":"Dorothea Iglezakis, Džulia Terzijska, Susanne Arndt, Sophia Leimer, Johanna Hickmann, Marc Fuhrmans, Giacomo Lanza","doi":"10.52825/cordi.v1i.271","DOIUrl":"https://doi.org/10.52825/cordi.v1i.271","url":null,"abstract":"We present an approach to document research data in a human and machine readable way by creating JSON-LD metadata files based on the m4i ontology. m4i is based on top level ontologies and reuses concepts of widely accepted ontologies to embed information modelled in m4i in larger contexts like a knowledge graph connecting research data with projects, actors, methods, tools and publications. We use a real-life research example from the engineering domain to show how to describe a research process with its object of research, the different steps with input and output data, the actors, and the used methods and tools. The resulting metadata files can serve as low-threshold documentation in a file system, as an exchange format between tools, as an input for data repositories and as a source of information to be used by scripts and tools.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133216737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gabriel Preuß, Alexander Schmidt, Mojeeb Sedeqi, Vivien Serve, Oonagh Mannix, Markus Kubin
{"title":"Monitoring the State of Open and FAIR Data in Helmholtz A Data-Harvesting and Dashboard-Approach by HMC","authors":"Gabriel Preuß, Alexander Schmidt, Mojeeb Sedeqi, Vivien Serve, Oonagh Mannix, Markus Kubin","doi":"10.52825/cordi.v1i.389","DOIUrl":"https://doi.org/10.52825/cordi.v1i.389","url":null,"abstract":"In this contribution we present an integrated approach to monitoring and assessing the state of open and FAIR data in the Helmholtz Association. The project is part of a multi-method approach by Hub Matter in the Helmholtz Metadata Collaboration (HMC). \u0000In a harvesting-approach, data published by Helmholtz researchers is found starting from literature metadata, harvested from the research centers. Data publications linked to that literature are identified using the SCHOLIX API. In a first approach to automated FAIR assessment, we adopted the F-UJI framework, as developed by the FAIRsFAIR consortium. \u0000The information collected is presented in an interactive dashboard. It allows to explore in which repositories Helmholtz researchers make their data publicly available, to engage Helmholtz communities, and to identify gaps towards improving the FAIRness of Helmholtz data. \u0000The dashboard is publicly available on https://fairdashboard.helmholtz-metadaten.de. The general approach as well as all program code are reusable by all research communities.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132169807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jens Dierkes, Birte Lindstädt, Ulrich Sax, Canan Hastik, Julia Fürst, Tanja Hörner, Sebastian Klammt, Ines Perrar, Iris Pigeot, Katja Restel, Carsten Oliver Schmidt, Aliaksandra Shutsko, Dagmar Waltemath, A. Zeleke
{"title":"Building the Next Generation of Data Savvy Biomedical Researchers","authors":"Jens Dierkes, Birte Lindstädt, Ulrich Sax, Canan Hastik, Julia Fürst, Tanja Hörner, Sebastian Klammt, Ines Perrar, Iris Pigeot, Katja Restel, Carsten Oliver Schmidt, Aliaksandra Shutsko, Dagmar Waltemath, A. Zeleke","doi":"10.52825/cordi.v1i.368","DOIUrl":"https://doi.org/10.52825/cordi.v1i.368","url":null,"abstract":"Modern research data management in biomedicine requires data literacy skills. The NFDI4Health consortium, the national research data infrastructure for personal health data, addresses this need with several different training concepts for clinical and epidemiological scientists. Both the institutional anchorage of the training and the thematic focus vary and can be assembled from a modular system as required. The aim is to enable multipliers to adapt and use the training modules through open educational resources. In addition, a competency profile for data stewards is under development to support the choice of required training in research institutions. The results will be fed into the NFDI “Education & Training” Section and will contribute to the Data Literacy Alliance in the future.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"11 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133112169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Firas Al Laban, Jan Bernoth, Michael Goedicke, Ulrike Lucke, Michael Striewe, P. Wieder, R. Yahyapour
{"title":"Establishing the Research Data Management Container in NFDIxCS","authors":"Firas Al Laban, Jan Bernoth, Michael Goedicke, Ulrike Lucke, Michael Striewe, P. Wieder, R. Yahyapour","doi":"10.52825/cordi.v1i.395","DOIUrl":"https://doi.org/10.52825/cordi.v1i.395","url":null,"abstract":"NFDIxCS is a consortium within the family of NFDI, which defines and establishes a research data management (RDM) infrastructure for Computer Science (CS). Based on a broad community process, the various types of research data, their metadata and quality criteria are agreed upon in the community. The resulting research data, along with all associated supplementary information and context such as software, metadata, and the corresponding execution environment are provided as an integral part of the overall infrastructure to meet the FAIR principles. This extended abstract addresses the interaction of the presented task area Architectures & Interfaces within the NFDIxCS consortium and the planned activities, challenges, and objectives of the recently launched project.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116344244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data Trustees - They Do Work! The Example of Research Data Centers","authors":"Daniel Fuß, Marie-Christine Laible","doi":"10.52825/cordi.v1i.241","DOIUrl":"https://doi.org/10.52825/cordi.v1i.241","url":null,"abstract":"This contribution presents the long established system of accredited Research Data Centers (RDCs). Created in the data-landscape of the social, behavioral, educational, and economic sciences, they enable access to restricted data and bridge interests of data providers and researchers. A distinctive feature of most research data in the above disciplines is the coverage of real persons. Such sensitive data require specific safeguards. The focus of this contribution is on the institutionalized processes of connecting and securing appropriate research data management strategies for this sort of data. It includes quality assurance measures through the accreditation of RDCs, a monitoring system and the regular cooperation of accredited RDCs.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114685131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}