From Scanner to Science: Reusing Clinically Acquired Medical Images for Research.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science Pub Date : 2025-06-10 eCollection Date: 2025-01-01

Jenna M Schabdach, Remo M S Williams, Joseph Logan, Viveknarayanan Padmanabhan, Russell D'Aiello Iii, Johnny Mclaughlin, Alexander Gonzalez, Edward M Krause, Gregory E Tasian, Susan Sotardi, Aaron F Alexander-Bloch

{"title":"From Scanner to Science: Reusing Clinically Acquired Medical Images for Research.","authors":"Jenna M Schabdach, Remo M S Williams, Joseph Logan, Viveknarayanan Padmanabhan, Russell D'Aiello Iii, Johnny Mclaughlin, Alexander Gonzalez, Edward M Krause, Gregory E Tasian, Susan Sotardi, Aaron F Alexander-Bloch","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Growth in the field of medical imaging research has revealed a need for larger volume and variety in available data. This need could be met using curated clinically acquired data, but the process for getting this data from the scanners to the scientists is complex and lengthy. We present a manifest-driven modular Extract, Transform, and Load (ETL) process named Locutus designed to appropriately handle difficulties present in the process of reusing clinically acquired medical imaging data. The design of Locutus was based on four foundational assumptions about medical data, research data, and communication. All parts of a workflow must communicate with each other and be adaptable to unique data delivery requests. In addition, the workflow must be robust to possible errors and uncertainties in clinically-acquired data, which may require human intervention to resolve. With these assumptions in mind,Locutus presents a five-phase workflow for downloading, deidentifying, and delivering unique requests for imaging data. The phases include initialization, data preparation, extraction of data from the research server to a pre-deidentification data warehouse, transformation into deidentified space, and loading into post-deidentification data warehouse. To date, this workflow has been used to process 32,962 imaging accessions for research use. This number is expected to grow as technical challenges are addressed and the role of humans is expected to shift from frequent intervention to regular monitoring.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"471-480"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150695/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Growth in the field of medical imaging research has revealed a need for larger volume and variety in available data. This need could be met using curated clinically acquired data, but the process for getting this data from the scanners to the scientists is complex and lengthy. We present a manifest-driven modular Extract, Transform, and Load (ETL) process named Locutus designed to appropriately handle difficulties present in the process of reusing clinically acquired medical imaging data. The design of Locutus was based on four foundational assumptions about medical data, research data, and communication. All parts of a workflow must communicate with each other and be adaptable to unique data delivery requests. In addition, the workflow must be robust to possible errors and uncertainties in clinically-acquired data, which may require human intervention to resolve. With these assumptions in mind,Locutus presents a five-phase workflow for downloading, deidentifying, and delivering unique requests for imaging data. The phases include initialization, data preparation, extraction of data from the research server to a pre-deidentification data warehouse, transformation into deidentified space, and loading into post-deidentification data warehouse. To date, this workflow has been used to process 32,962 imaging accessions for research use. This number is expected to grow as technical challenges are addressed and the role of humans is expected to shift from frequent intervention to regular monitoring.

本刊更多论文

从扫描仪到科学：重新使用临床获得的医学图像进行研究。

医学影像研究领域的增长表明，需要更大的数量和种类的可用数据。这种需求可以通过收集临床数据来满足，但是将这些数据从扫描仪传送给科学家的过程是复杂而漫长的。我们提出了一个清单驱动的模块化提取、转换和加载（ETL）过程，命名为Locutus，旨在适当地处理在重复使用临床获得的医学成像数据过程中存在的困难。loctus的设计基于关于医疗数据、研究数据和通信的四个基本假设。工作流的所有部分必须相互通信，并适应独特的数据传递请求。此外，工作流必须对临床数据中可能出现的错误和不确定性具有鲁棒性，这可能需要人工干预来解决。考虑到这些假设，loctus提出了一个五阶段的工作流程，用于下载、去识别和提供独特的成像数据请求。这些阶段包括初始化、数据准备、从研究服务器提取数据到预去识别数据仓库、转换到去识别空间以及加载到后去识别数据仓库。迄今为止，该工作流程已用于处理32,962个用于研究用途的成像接入。随着技术挑战的解决，以及人类的作用预计将从频繁干预转变为定期监测，这一数字预计还会增长。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

自引率

0.00%

发文量