Woo Yeon Park, Teri Sippel Schmidt, Gabriel Salvador, Kevin O'Donnell, Brad Genereaux, Kyulee Jeon, Seng Chan You, Blake E Dewey, Paul Nagy
{"title":"Breaking data silos: incorporating the DICOM imaging standard into the OMOP CDM to enable multimodal research.","authors":"Woo Yeon Park, Teri Sippel Schmidt, Gabriel Salvador, Kevin O'Donnell, Brad Genereaux, Kyulee Jeon, Seng Chan You, Blake E Dewey, Paul Nagy","doi":"10.1093/jamia/ocaf091","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This work incorporates the Digital Imaging Communications in Medicine (DICOM) Standard into the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) to standardize and accurately represent imaging studies, such as acquisition parameters, in multimodal research studies.</p><p><strong>Materials and methods: </strong>DICOM is the internationally adopted standard that defines entities and relationships for biomedical imaging data used for clinical imaging studies. Most of the complexity in the DICOM data structure centers around the metadata. This metadata contains information about the patient and the modality acquisition parameters. We parsed the DICOM vocabularies in Parts 3, 6, and 16 to obtain structured metadata definitions and added these as custom concepts in the OMOP CDM vocabulary. To validate our pipeline, we harvested and transformed DICOM metadata from magnetic resonance images in the Alzheimer's Disease Neuroimaging Initiative (ADNI) study.</p><p><strong>Results: </strong>We extracted and added 5183 attributes and 3628 coded values from the DICOM standard as custom concepts to the OMOP CDM vocabulary. We ingested 545 ADNI imaging studies containing 4756 series and harvested 691 224 metadata values. They were filtered, transformed, and loaded in the OMOP CDM imaging extension using the OMOP concepts for the DICOM attributes and values.</p><p><strong>Discussion: </strong>This work is adaptable to clinical DICOM data. Future work will validate scalability and incorporate outcomes from automated analysis to provide a complete characterization research study within the OMOP framework.</p><p><strong>Conclusion: </strong>The incorporation of medical imaging into clinical observational studies has been a barrier to multi model research. This work demonstrates detailed phenotypes and paves the way for observational multimodal research.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1093/jamia/ocaf091","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: This work incorporates the Digital Imaging Communications in Medicine (DICOM) Standard into the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) to standardize and accurately represent imaging studies, such as acquisition parameters, in multimodal research studies.
Materials and methods: DICOM is the internationally adopted standard that defines entities and relationships for biomedical imaging data used for clinical imaging studies. Most of the complexity in the DICOM data structure centers around the metadata. This metadata contains information about the patient and the modality acquisition parameters. We parsed the DICOM vocabularies in Parts 3, 6, and 16 to obtain structured metadata definitions and added these as custom concepts in the OMOP CDM vocabulary. To validate our pipeline, we harvested and transformed DICOM metadata from magnetic resonance images in the Alzheimer's Disease Neuroimaging Initiative (ADNI) study.
Results: We extracted and added 5183 attributes and 3628 coded values from the DICOM standard as custom concepts to the OMOP CDM vocabulary. We ingested 545 ADNI imaging studies containing 4756 series and harvested 691 224 metadata values. They were filtered, transformed, and loaded in the OMOP CDM imaging extension using the OMOP concepts for the DICOM attributes and values.
Discussion: This work is adaptable to clinical DICOM data. Future work will validate scalability and incorporate outcomes from automated analysis to provide a complete characterization research study within the OMOP framework.
Conclusion: The incorporation of medical imaging into clinical observational studies has been a barrier to multi model research. This work demonstrates detailed phenotypes and paves the way for observational multimodal research.
期刊介绍:
JAMIA is AMIA''s premier peer-reviewed journal for biomedical and health informatics. Covering the full spectrum of activities in the field, JAMIA includes informatics articles in the areas of clinical care, clinical research, translational science, implementation science, imaging, education, consumer health, public health, and policy. JAMIA''s articles describe innovative informatics research and systems that help to advance biomedical science and to promote health. Case reports, perspectives and reviews also help readers stay connected with the most important informatics developments in implementation, policy and education.