Development and Implementation of an Open, Modular, and Participatory Toolchain for Distributed IT Development in Healthcare Research - Lessons Learned.
Daniel Neumann, Richard Gebler, Jana Kiederle, Jördis Beck, Fabio Aubele, Alexander Struebing, Florian Schmidt, Matthias Reusche, Helene Koester, Markus Loeffler, Sebastian Staeubert
{"title":"Development and Implementation of an Open, Modular, and Participatory Toolchain for Distributed IT Development in Healthcare Research - Lessons Learned.","authors":"Daniel Neumann, Richard Gebler, Jana Kiederle, Jördis Beck, Fabio Aubele, Alexander Struebing, Florian Schmidt, Matthias Reusche, Helene Koester, Markus Loeffler, Sebastian Staeubert","doi":"10.3233/SHTI251418","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Distributed healthcare research infrastructures face significant challenges when translating routine clinical data into harmonized, research-ready formats using HL7 FHIR standards.</p><p><strong>State of the art: </strong>Existing FHIR-based pipelines such as the SMART/HL7 FHIR Bulk Data Access API, FHIR-to-OMOP mappings, and analytical services like Pathling demonstrate technical feasibility. However, most assume semantically valid FHIR data, operate within single-institution settings, and lack practical guidance for deployment across heterogeneous, regulated environments. Technical Framework and Deployment: Within the German Medical Informatics Initiative (MII) and the INTERPOLAR project, we developed an open, modular, and participatory toolchain for decentralized FHIR-based data transformation and export across multiple Data Integration Centers (DICs). The toolchain supports FHIR extraction, profile-based transformation, REDCap integration, and OMOP-compatible export. Deployment required adapting to local infrastructures, regulatory boundaries (e.g., de-identified FHIR stores, restricted network access), and clinical domain needs. Configurable modules, proxy support, and site-specific adaptations were essential for integration into operational hospital workflows.</p><p><strong>Lessons learned: </strong>Key lessons include the necessity of early access to real data, the limitations of synthetic test data, the value of joint workshops for profile interpretation, and the need for adaptable validation tooling. Organizational knowledge gaps, inconsistent FHIR implementations, and performance issues in resource flattening were addressed through co-design and iterative rollout strategies. Validator modules are essential across technical, content, and cross-site consistency levels.</p><p><strong>Conclusion: </strong>Centralized development paired with decentralized, participatory deployment enables scalable, GDPR-compliant infrastructures for embedded clinical research. This approach offers a replicable framework for future multi-site initiatives aiming to leverage real-world data across diverse environments.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"378-385"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in health technology and informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SHTI251418","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Distributed healthcare research infrastructures face significant challenges when translating routine clinical data into harmonized, research-ready formats using HL7 FHIR standards.
State of the art: Existing FHIR-based pipelines such as the SMART/HL7 FHIR Bulk Data Access API, FHIR-to-OMOP mappings, and analytical services like Pathling demonstrate technical feasibility. However, most assume semantically valid FHIR data, operate within single-institution settings, and lack practical guidance for deployment across heterogeneous, regulated environments. Technical Framework and Deployment: Within the German Medical Informatics Initiative (MII) and the INTERPOLAR project, we developed an open, modular, and participatory toolchain for decentralized FHIR-based data transformation and export across multiple Data Integration Centers (DICs). The toolchain supports FHIR extraction, profile-based transformation, REDCap integration, and OMOP-compatible export. Deployment required adapting to local infrastructures, regulatory boundaries (e.g., de-identified FHIR stores, restricted network access), and clinical domain needs. Configurable modules, proxy support, and site-specific adaptations were essential for integration into operational hospital workflows.
Lessons learned: Key lessons include the necessity of early access to real data, the limitations of synthetic test data, the value of joint workshops for profile interpretation, and the need for adaptable validation tooling. Organizational knowledge gaps, inconsistent FHIR implementations, and performance issues in resource flattening were addressed through co-design and iterative rollout strategies. Validator modules are essential across technical, content, and cross-site consistency levels.
Conclusion: Centralized development paired with decentralized, participatory deployment enables scalable, GDPR-compliant infrastructures for embedded clinical research. This approach offers a replicable framework for future multi-site initiatives aiming to leverage real-world data across diverse environments.