{"title":"International Data Access Network (IDAN) for Sensitive Microdata in Humanities & Social Sciences","authors":"Dana Müller, Matthias Umkehrer","doi":"10.52825/cordi.v1i.269","DOIUrl":"https://doi.org/10.52825/cordi.v1i.269","url":null,"abstract":"In a globalized world it becomes increasingly important to provide international research data. This is particularly true in Humanities & Social Sciences. With legal frameworks changing, administrative data can increasingly be utilised both for official statistics and to facilitate new research, enabling the development of evidence-based policy for the public benefit. Secure access conditions generally apply to using these rich, highly detailed level data. However, using data from various sources is difficult when they are fragmented in ‘silos’ between several Research Data Centres (RDCs). While this might be the case at a national level, it is very likely to be the case at an international level. The latter is a major obstacle for international comparative research.\u0000The International Data Access Network (IDAN, https://idan.network/) aims at creating a concrete operational international framework enabling access to controlled data for research. The Network was founded in 2018, at the same time when the European General Data Protection Regulation became active. It currently involves six RDCs from France, Germany, the Netherlands and the United Kingdom. Within IDAN, step by step cooperative solutions are developed taking into account the particular legal and security requirements that are at the core of both, national and transnational access to confidential data. Initially, the partners’ access systems are being implemented in each partners’ premise based on bilateral agreements. This process involves combining requirements of security and surveillance for Safe Rooms. It sets up a new concrete environment for researchers to work remotely with data from the other partners from their local RDC.\u0000IDAN is a perfect match for the topic “Connecting RDM” regarding confidential data which are a highly demanded by the international research community. The presentation will show the opportunities and obstacles in developing the IDAN infrastructure. Based on discussions with research experts with heterogeneous backgrounds in Europe, it will also discuss researcher’s expectations and needs to further improve access to international research data. IDAN could be a role model for other research disciplines and research infrastructure for pushing the boundaries of international data access.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121221152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Robert Huber, Naouel Karam, Oliver Koepler, Philip Strömert
{"title":"Finding a Common Ground for NFDI Terminologies Proposing I-ADOPT as a NFDI Wide Semantic Layer","authors":"Robert Huber, Naouel Karam, Oliver Koepler, Philip Strömert","doi":"10.52825/cordi.v1i.366","DOIUrl":"https://doi.org/10.52825/cordi.v1i.366","url":null,"abstract":"We present the goals and first results of the NFDI Section Metadata Working Group on Ontology Harmonisation and Mapping. The ongoing analysis of the used terminologies within the NFDI consortia suggests that, agreeing on a single NFDI wide ontological framework for very general and interdisciplinary concepts needed in the semantic annotation of research date is not feasible in the short run due to domain specific requirements. We thus present how the Research Data Alliance (RDA) framework I-ADOPT, which focuses only on the formal description of observation variables in scientific studies, could be utilised to provide an NFDI wide and global common ground for data interoperability.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123874301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pascal Siegers, Antonia C. May, Claudia Saalbach, Jana Nebelin, Dagmar Kern, Andreas Daniel, Ben Zapilko, Fakhri Momeni, Knut Wenzig, Jan Goebel
{"title":"Linked Open Research Data for Social Science A Concept Registry for Granular Data Documentation","authors":"Pascal Siegers, Antonia C. May, Claudia Saalbach, Jana Nebelin, Dagmar Kern, Andreas Daniel, Ben Zapilko, Fakhri Momeni, Knut Wenzig, Jan Goebel","doi":"10.52825/cordi.v1i.300","DOIUrl":"https://doi.org/10.52825/cordi.v1i.300","url":null,"abstract":"The re-use of research data is an integral part of research practice in the social and economic sciences. To find relevant data, researchers need adequate search facilities. However, a comprehensive, thematic search for research data is made more difficult by inconsistent or missing semantic indexing of data at the level of social science concepts (e.g., representing the theory language). Either the data is not documented at a granular level, or primary investigators use their ad-hoc terminology to describe their data. Consequently, researchers have to make great efforts to find relevant or comparable data. From the user's perspective, the lack of theory language in data documentation impedes effective data searches and thus significantly limits the research potential of existing data collections. Because there is currently no semantic model for indexing the data content, the specific challenge for improving data search lies in establishing concept-based indexing of research data. Research infrastructures need technology for the harmonized semantic indexing of their research data. The LORD concept registry aims at closing this gap by developing a registry of sociological and economic concepts and, following the FAIR principles, making this concept registry generally available to the scientific community. As a first step, we developed a basic data model for the Concept Registry using United Modeling Language (UML). All links between are created and managed in the form of so-called RDF triples. An annotation application allows for linking specific questions/variables to concepts. The application also includes the two SKOS-compliant thesauri, \"Thesaurus Social Sciences\" (TheSoz) and \"Standard Thesaurus Economics\" (STW) but could be extended to other resources like ELSST. \u0000We illustrate the application of the LORD concept registry with examples from three large-scale survey programmes (German Socio-Economic Panel, German General Social Survey, National Academics Panel Study). The initial focus is on variables and questions with overlapping content in the three survey programmes, as they form a sound basis for cross-linking with concepts.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"215 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122845984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
C. Wissing, Basavaraja Bheemalingappa Sagar, Markus Blank-Burian, A. Drabent, Sören Fleischer, Oliver Freyermuth, M. Giffels, Andreas Henkel, M. Hoeft, J. Künsemöller, Nicola Malavasi, Cristina Manzano, B. Roland, Hubert Simma, Dominik Schwarz, Kilian Schwarz, Nandini Suvvi Neelakantaia, Luka Vomberg, Christian Voss, Vadim Vybornov, Michael Wigard, S. Wozniewski
{"title":"Distributed Computing and Storage Infrastructure for PUNCH4NFDI","authors":"C. Wissing, Basavaraja Bheemalingappa Sagar, Markus Blank-Burian, A. Drabent, Sören Fleischer, Oliver Freyermuth, M. Giffels, Andreas Henkel, M. Hoeft, J. Künsemöller, Nicola Malavasi, Cristina Manzano, B. Roland, Hubert Simma, Dominik Schwarz, Kilian Schwarz, Nandini Suvvi Neelakantaia, Luka Vomberg, Christian Voss, Vadim Vybornov, Michael Wigard, S. Wozniewski","doi":"10.52825/cordi.v1i.261","DOIUrl":"https://doi.org/10.52825/cordi.v1i.261","url":null,"abstract":"The PUNCH4NFDI consortium brings together scientists from the German particle physics, hadron and nuclear physics, astronomy, and astro-particle physics communities to improve the management and (re-)use of scientific data from these interrelated communities. The PUNCH sciences have a long tradition of building large instruments that are planned, constructed and operated by international collaborations. While the large collaborations typically employ advanced tools for data management and distribution, smaller-scale experiments often suffer from very limited resources to address these aspects. One of the aims of the consortium is to evaluate and enable or adopt existing solutions. Instances of a prototype federated and distributed computing and storage infrastructure have been set up at a handful of sites in Germany. This prototype is used to gain experience in running of scientific workflows to further guide the development of the Science Data Platform, which is an overarching goal of the consortium.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116526341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nina Leonie Weisweiler, Roland Bertelmann, Steffi Genderjahn, Heinz Pampel
{"title":"Connecting the Dots","authors":"Nina Leonie Weisweiler, Roland Bertelmann, Steffi Genderjahn, Heinz Pampel","doi":"10.52825/cordi.v1i.402","DOIUrl":"https://doi.org/10.52825/cordi.v1i.402","url":null,"abstract":"The Helmholtz Centers operate scientific-technical infrastructures that produce a high volume of digital research data, making Helmholtz a hub for expertise in research data. With the rapid digital transformation and growing volume of data, Helmholtz has implemented relevant policies and engaged in NFDI in various ways to manage and use research data effectively. The Helmholtz Information & Data Science Incubator, in particular the HMC and HIFIS platforms, contribute to these networking activities. The Helmholtz Open Science Office organized several internal forums for Helmholtz members to discuss NFDI, their findings will be presented at the Conference on Research Data Infrastructure (CoRDI). Helmholtz' multifaceted commitment to research data infrastructure is closely related to NFDI activities. The presentation at CoRDI will demonstrate how a large national research data ecosystem can be successfully connected to the NFDI network, highlighting opportunities for future collaboration.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135049002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Castellum - A Data Protection-Compliant Web Application for the Subject Management of Human Science Studies","authors":"K. Mader, Maike Kleemeyer","doi":"10.52825/cordi.v1i.325","DOIUrl":"https://doi.org/10.52825/cordi.v1i.325","url":null,"abstract":"Research institutions, especially in the human sciences, have been confronted with strict guidelines for the processing of personal data since the European General Data Protection Regulation came into force in 2018. This presents them with new challenges in recruiting and managing study participants and processing the associated data. To meet these challenges, Castellum has been developed at the Max Planck Institute for Human Development since 2016 and has been used successfully since May 2020. Castellum is a turnkey open-source web application for the data protection-compliant management of volunteer data. Among other things, Castellum simplifies study recruitment, appointment management and study implementation. Various institutions have expressed interest in Castellum in the recent past. On the one hand, this may be due to the fact that no comparable open source project exists. On the other hand, Castellum was explicitly designed to be so flexible and expandable that it can be adapted to the workflows and processes of other research institutions with relatively little effort. The use of Castellum has so far been particularly successful for institutions that conduct several studies in parallel, that want to proactively recruit participants from an internal pool of people interested in the study, and that generate data when dealing with these subjects. Since Castellum is subject to the AGPL licence, the software may be used free of charge without restrictions.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"100 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132236428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Harald Sack, Torsten Schrade, O. Bruns, Etienne Posthumus, Tabea Tietz, Ebrahim Norouzi, J. Waitelonis, Heike Fliegl, Linnaea Söhn, Julia Tolksdorf, Jonatan Jalle Steller, Abril Az´ocar Guzm´an, Said Fathalla, Ahmad Zainul Ihsan, Volker Hofmann, Stefan Sandfeld, F. Fritzen, A. Laadhar, Sonja Schimmler, Peter Mutschke
{"title":"Knowledge Graph Based RDM Solutions NFDI4Culture - NFDI-MatWerk - NFDI4DataScience","authors":"Harald Sack, Torsten Schrade, O. Bruns, Etienne Posthumus, Tabea Tietz, Ebrahim Norouzi, J. Waitelonis, Heike Fliegl, Linnaea Söhn, Julia Tolksdorf, Jonatan Jalle Steller, Abril Az´ocar Guzm´an, Said Fathalla, Ahmad Zainul Ihsan, Volker Hofmann, Stefan Sandfeld, F. Fritzen, A. Laadhar, Sonja Schimmler, Peter Mutschke","doi":"10.52825/cordi.v1i.371","DOIUrl":"https://doi.org/10.52825/cordi.v1i.371","url":null,"abstract":"Based on our experience within the NFDI4Culture and NFDI-MatWerk projects we propose generalized knowledge graph based research data management solutions, which are applicable to other consortia. Our solution covers the construction of a common NFDI core ontology adapted to specific domains via domain extensions as a basis for a knowledge graph (KG) providing information about a consortium and its related research data and software resources. This KG serves as a backend for the web portal that enables interactive access and management of this data. Already implemented for NFDI4Culture and to be adapted by NFDI-MatWerk, this solution might serve as an example solution also for other consortia. We are synchronizing our efforts with ongoing work to implement knowledge graph based research data management in NFDI4DataScience.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114775893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tunc Kayikcioglu, Sanjay Kumar Srikakulam, Wolfgang Maier, Björn A. Grüning
{"title":"Interactive Tools (IT) in Galaxy Combining Synchronous and Asynchronous Workflows","authors":"Tunc Kayikcioglu, Sanjay Kumar Srikakulam, Wolfgang Maier, Björn A. Grüning","doi":"10.52825/cordi.v1i.419","DOIUrl":"https://doi.org/10.52825/cordi.v1i.419","url":null,"abstract":"Galaxy [1] is a GUI-based scientific data management and analysis platform that aims to expand the target audience of scientific computing to scientists without advanced computer literacy or access to computing infrastructure. During the operation of a typical asynchronous workflow, the interaction of the user triggers a job submission to a hosted computing system, the results of which are then reported back to the user upon completion of the task. However, such an asynchronous interaction scheme is not compatible with scientific software that require user interaction during the execution itself (synchronous), e.g. many data visualisation software. Interactive Tools on Galaxy (IT; [2]) leverage tool and job components of Galaxy by taking advantage of the latest containerization solutions to provide access to interactive executable software. With ITs, Galaxy is enabling the more and more common use-case of asynchronous workflows while keeping the possibility to bridge at any time to synchronous workflows and reusing all the other Galaxy data management features.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116712620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Manuela Richter, Johannes Putzke, Thomas M. Schimmer, Anett Mehler-Bicher
{"title":"We are Still Here, too! Research Data Management at Universities of Applied Sciences Approaches From the Project \"FDM@HAW.rlp\" in the German State Rhineland-Palatinate","authors":"Manuela Richter, Johannes Putzke, Thomas M. Schimmer, Anett Mehler-Bicher","doi":"10.52825/cordi.v1i.363","DOIUrl":"https://doi.org/10.52825/cordi.v1i.363","url":null,"abstract":"The objective of the German non-profit association NFDI (German short form for ”National Research Data Infrastructure”) is to make the data stock of the entire German science system accessible to the public. To do so, it should involve all stakeholders. However, currently the Universities of Applied Sciences (UAS) are underrepresented in the NFDI, and there is a danger of neglecting their needs. Therefore, we present the project ”Research Data Management at Universities of Applied Sciences in the State of Rhineland-Palatinate” (FDM@HAW.rlp), which is funded by the German Federal Ministry of Education and Research (BMBF) and financed within the Recovery and Resilience Facility of the European Union. In the project, seven public UAS in Rhineland-Palatinate and the Catholic University of Applied Sciences (CUAS) Mainz follow a common goal: They intend to establish an institutional RDM within a period of three years by building up competencies at the UAS, setting up services for researchers and finding solutions for a common technical infrastructure.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116054611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Amanda Wein, Jan Reinkensmeier, Anke Weidlich, Johan Lilliestam, V. Hagenmeyer, Mascha Richter, Sören Auer, Astrid Niesse, Sebastian Lehnhoff
{"title":"FAIR Data for Energy System Research An Overview of NFDI4Energy Task Area 4","authors":"Amanda Wein, Jan Reinkensmeier, Anke Weidlich, Johan Lilliestam, V. Hagenmeyer, Mascha Richter, Sören Auer, Astrid Niesse, Sebastian Lehnhoff","doi":"10.52825/cordi.v1i.364","DOIUrl":"https://doi.org/10.52825/cordi.v1i.364","url":null,"abstract":"The NFDI4Energy consortium aims to establish new services filling a variety of needs for the energy system research community, from making FAIR research data easily accessible to promoting collaboration among community entities. Seven Task Areas (TAs) have been defined to achieve the consortium’s objectives, each with a specific focus. Task Area 4 (TA4): FAIR Data for Energy System Research shall develop ontologies, metadata standards, and services to promote semantic consistency and improve interoperability of energy research projects, thereby supporting the harmonization of data management among various institutions and research fields.","PeriodicalId":359879,"journal":{"name":"Proceedings of the Conference on Research Data Infrastructure","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114946055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}