Data Flow Construction and Quality Evaluation of Electronic Source Data in Clinical Trials: Pilot Study Based on Hospital Electronic Medical Records in China
{"title":"Data Flow Construction and Quality Evaluation of Electronic Source Data in Clinical Trials: Pilot Study Based on Hospital Electronic Medical Records in China","authors":"Yannan Yuan, Yun Mei, Shuhua Zhao, Shenglong Dai, Xiaohong Liu, Xiaojing Sun, Zhiying Fu, Liheng Zhou, Jie Ai, Liheng Ma, Min Jiang","doi":"10.2196/52934","DOIUrl":null,"url":null,"abstract":"Background: The traditional clinical trial data collection process requires a Clinical Research Coordinator (CRC) who is authorized by the investigators to read from the hospital electronic medical record. Using electronic source data opens a new path to extract subjects' data from EHR and transfer directly to EDC (often the method is referred to as eSource ).The eSource technology in clinical trial data flow can improve data quality without compromising timeliness. At the same time, improved data collection efficiency reduces clinical trial costs. Objective: Explore how to extract clinical trial-related data from hospital electronic health record system (EHR), transform the data into an electronic data capture system (EDC) required format, and transfer it into sponsor's environment. Evaluate the transferred datasets to validate the availability, completeness, and accuracy of building eSource dataflow. Methods: A prospective clinical trial study registered on the \"Drug Clinical Trial Registration and Information Disclosure Platform (http://www.chinadrugtrials.org.cn/) \" was selected, and the production data environment of EHR relied on to extract the structured data of four Case Report Form(CRF) data modules: demographics, vital signs, local laboratory, and concomitant medications from EHR. Extracted data was mapped & transformed, de-identified, and transferred to the sponsor’s environments. Data validation was performed based on availability, completeness and accuracy. Results: In a secure and controlled data environment, clinical trial data was successfully transferred from a hospital EHR to sponsor's environment with 100% transcriptional accuracy, but availability and completeness could be improved. Conclusions: Data availability is low due to some fields required in EDC not being available directly in the EHR. Concurrently, some data is still in unstructured data format and paper-based medical record data, therefore data completeness in the EHR is low. The top-level design of eSource and the construction of hospital electronic data standards should help lay a foundation for full electronic data flow from EHR to EDC in future.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"31 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/52934","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The traditional clinical trial data collection process requires a Clinical Research Coordinator (CRC) who is authorized by the investigators to read from the hospital electronic medical record. Using electronic source data opens a new path to extract subjects' data from EHR and transfer directly to EDC (often the method is referred to as eSource ).The eSource technology in clinical trial data flow can improve data quality without compromising timeliness. At the same time, improved data collection efficiency reduces clinical trial costs. Objective: Explore how to extract clinical trial-related data from hospital electronic health record system (EHR), transform the data into an electronic data capture system (EDC) required format, and transfer it into sponsor's environment. Evaluate the transferred datasets to validate the availability, completeness, and accuracy of building eSource dataflow. Methods: A prospective clinical trial study registered on the "Drug Clinical Trial Registration and Information Disclosure Platform (http://www.chinadrugtrials.org.cn/) " was selected, and the production data environment of EHR relied on to extract the structured data of four Case Report Form(CRF) data modules: demographics, vital signs, local laboratory, and concomitant medications from EHR. Extracted data was mapped & transformed, de-identified, and transferred to the sponsor’s environments. Data validation was performed based on availability, completeness and accuracy. Results: In a secure and controlled data environment, clinical trial data was successfully transferred from a hospital EHR to sponsor's environment with 100% transcriptional accuracy, but availability and completeness could be improved. Conclusions: Data availability is low due to some fields required in EDC not being available directly in the EHR. Concurrently, some data is still in unstructured data format and paper-based medical record data, therefore data completeness in the EHR is low. The top-level design of eSource and the construction of hospital electronic data standards should help lay a foundation for full electronic data flow from EHR to EDC in future.
背景:传统的临床试验数据采集过程需要临床研究协调员(CRC)经研究者授权从医院电子病历中读取数据。在临床试验数据流中使用电子源数据开辟了一条从电子病历中提取受试者数据并直接传输到 EDC 的新途径(通常这种方法被称为 eSource)。同时,数据收集效率的提高还能降低临床试验成本。目标探索如何从医院电子病历系统(EHR)中提取临床试验相关数据,将数据转换为电子数据采集系统(EDC)要求的格式,并将其传输到申办者的环境中。评估传输的数据集,以验证构建 eSource 数据流的可用性、完整性和准确性。方法选择一项在 "药物临床试验注册与信息公开平台(http://www.chinadrugtrials.org.cn/)"上注册的前瞻性临床试验研究,依托电子病历的生产数据环境,从电子病历中提取病例报告表(CRF)四个数据模块的结构化数据:人口统计学、生命体征、当地实验室和伴随药物。提取的数据经过映射和转换、去标识化后传输到赞助商的环境中。根据可用性、完整性和准确性进行数据验证。结果在安全可控的数据环境中,临床试验数据成功地从医院电子病历传输到赞助商的环境中,转录准确率达到 100%,但可用性和完整性有待提高。结论:由于 EDC 所需的某些字段无法直接在 EHR 中使用,因此数据可用性较低。同时,一些数据仍是非结构化数据格式和纸质病历数据,因此电子病历中的数据完整性较低。eSource 的顶层设计和医院电子数据标准的建设,应有助于为今后从电子健康记 录到电子病历数据库的全面电子数据流奠定基础。
期刊介绍:
JMIR Medical Informatics (JMI, ISSN 2291-9694) is a top-rated, tier A journal which focuses on clinical informatics, big data in health and health care, decision support for health professionals, electronic health records, ehealth infrastructures and implementation. It has a focus on applied, translational research, with a broad readership including clinicians, CIOs, engineers, industry and health informatics professionals.
Published by JMIR Publications, publisher of the Journal of Medical Internet Research (JMIR), the leading eHealth/mHealth journal (Impact Factor 2016: 5.175), JMIR Med Inform has a slightly different scope (emphasizing more on applications for clinicians and health professionals rather than consumers/citizens, which is the focus of JMIR), publishes even faster, and also allows papers which are more technical or more formative than what would be published in the Journal of Medical Internet Research.