电子病历的自动数据提取:数据挖掘构建胃肠病学临床试验资格研究数据库的有效性。

IF 1.5 4区 医学 Q2 MEDICINE, GENERAL & INTERNAL
Upsala journal of medical sciences Pub Date : 2022-01-27 eCollection Date: 2022-01-01 DOI:10.48101/ujms.v127.8260
Nora Joseph, Ida Lindblad, Sara Zaker, Sharareh Elfversson, Maria Albinzon, Øyvind Ødegård, Li Hantler, Per M Hellström
{"title":"电子病历的自动数据提取:数据挖掘构建胃肠病学临床试验资格研究数据库的有效性。","authors":"Nora Joseph,&nbsp;Ida Lindblad,&nbsp;Sara Zaker,&nbsp;Sharareh Elfversson,&nbsp;Maria Albinzon,&nbsp;Øyvind Ødegård,&nbsp;Li Hantler,&nbsp;Per M Hellström","doi":"10.48101/ujms.v127.8260","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized eXtraction Program (CXP).</p><p><strong>Methods: </strong>The CXP extracts and structures data in rapid standardised processes. The CXP was programmed to extract TNFα-native active ulcerative colitis (UC) patients from EMRs using defined International Classification of Disease-10 (ICD-10) codes. Extracted data were read in parallel with manual assessment of the EMR to compare with CXP-extracted data.</p><p><strong>Results: </strong>From the complete EMR set, 2,802 patients with code K51 (UC) were extracted. Then, CXP extracted 332 patients according to inclusion and exclusion criteria. Of these, 97.5% were correctly identified, resulting in a final set of 320 cases eligible for the study. When comparing CXP-extracted data against manually assessed EMRs, the recovery rate was 95.6-101.1% over the years with 96.1% weighted average sensitivity.</p><p><strong>Conclusion: </strong>Utilisation of the CXP software can be considered as an effective way to extract relevant EMR data without significant errors. Hence, by extracting from EMRs, CXP accurately identifies patients and has the capacity to facilitate research studies and clinical trials by finding patients with the requested code as well as funnel down itemised individuals according to specified inclusion and exclusion criteria. Beyond this, medical procedures and laboratory data can rapidly be retrieved from the EMRs to create tailored databases of extracted material for immediate use in clinical trials.</p>","PeriodicalId":23458,"journal":{"name":"Upsala journal of medical sciences","volume":" ","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2022-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8809051/pdf/","citationCount":"3","resultStr":"{\"title\":\"Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials.\",\"authors\":\"Nora Joseph,&nbsp;Ida Lindblad,&nbsp;Sara Zaker,&nbsp;Sharareh Elfversson,&nbsp;Maria Albinzon,&nbsp;Øyvind Ødegård,&nbsp;Li Hantler,&nbsp;Per M Hellström\",\"doi\":\"10.48101/ujms.v127.8260\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized eXtraction Program (CXP).</p><p><strong>Methods: </strong>The CXP extracts and structures data in rapid standardised processes. The CXP was programmed to extract TNFα-native active ulcerative colitis (UC) patients from EMRs using defined International Classification of Disease-10 (ICD-10) codes. Extracted data were read in parallel with manual assessment of the EMR to compare with CXP-extracted data.</p><p><strong>Results: </strong>From the complete EMR set, 2,802 patients with code K51 (UC) were extracted. Then, CXP extracted 332 patients according to inclusion and exclusion criteria. Of these, 97.5% were correctly identified, resulting in a final set of 320 cases eligible for the study. When comparing CXP-extracted data against manually assessed EMRs, the recovery rate was 95.6-101.1% over the years with 96.1% weighted average sensitivity.</p><p><strong>Conclusion: </strong>Utilisation of the CXP software can be considered as an effective way to extract relevant EMR data without significant errors. Hence, by extracting from EMRs, CXP accurately identifies patients and has the capacity to facilitate research studies and clinical trials by finding patients with the requested code as well as funnel down itemised individuals according to specified inclusion and exclusion criteria. Beyond this, medical procedures and laboratory data can rapidly be retrieved from the EMRs to create tailored databases of extracted material for immediate use in clinical trials.</p>\",\"PeriodicalId\":23458,\"journal\":{\"name\":\"Upsala journal of medical sciences\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2022-01-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8809051/pdf/\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Upsala journal of medical sciences\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.48101/ujms.v127.8260\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2022/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Upsala journal of medical sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.48101/ujms.v127.8260","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 3

摘要

背景:电子医疗记录(emr)被用于存储与患者相关的医疗保健信息。使用数据挖掘技术,可以有效地利用大量数据并从中获益。我们的目的是评估自定义提取程序(CXP)提取的数据的有效性。方法:CXP在快速标准化流程中提取和组织数据。CXP被编程为使用已定义的国际疾病分类-10 (ICD-10)代码从emr中提取tnf α-原生活动性溃疡性结肠炎(UC)患者。提取的数据与人工评估EMR同时读取,以与cxp提取的数据进行比较。结果:从完整的EMR集中,提取了2802例代码为K51 (UC)的患者。然后,CXP按照纳入和排除标准提取332例患者。其中,97.5%被正确识别,最终有320例符合研究条件。将cxp提取的数据与人工评估的emr进行比较,多年来的回收率为95.6-101.1%,加权平均灵敏度为96.1%。结论:利用CXP软件可有效提取相关EMR数据,且无明显误差。因此,通过从电子病历中提取,CXP可以准确地识别患者,并有能力通过查找具有所要求代码的患者来促进研究和临床试验,并根据指定的纳入和排除标准筛选分项个人。除此之外,医疗程序和实验室数据可以从电子病历中快速检索,以创建定制的提取材料数据库,以便在临床试验中立即使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials.

Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials.

Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials.

Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials.

Background: Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized eXtraction Program (CXP).

Methods: The CXP extracts and structures data in rapid standardised processes. The CXP was programmed to extract TNFα-native active ulcerative colitis (UC) patients from EMRs using defined International Classification of Disease-10 (ICD-10) codes. Extracted data were read in parallel with manual assessment of the EMR to compare with CXP-extracted data.

Results: From the complete EMR set, 2,802 patients with code K51 (UC) were extracted. Then, CXP extracted 332 patients according to inclusion and exclusion criteria. Of these, 97.5% were correctly identified, resulting in a final set of 320 cases eligible for the study. When comparing CXP-extracted data against manually assessed EMRs, the recovery rate was 95.6-101.1% over the years with 96.1% weighted average sensitivity.

Conclusion: Utilisation of the CXP software can be considered as an effective way to extract relevant EMR data without significant errors. Hence, by extracting from EMRs, CXP accurately identifies patients and has the capacity to facilitate research studies and clinical trials by finding patients with the requested code as well as funnel down itemised individuals according to specified inclusion and exclusion criteria. Beyond this, medical procedures and laboratory data can rapidly be retrieved from the EMRs to create tailored databases of extracted material for immediate use in clinical trials.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Upsala journal of medical sciences
Upsala journal of medical sciences 医学-医学:内科
CiteScore
5.60
自引率
0.00%
发文量
31
审稿时长
6-12 weeks
期刊介绍: Upsala Journal of Medical Sciences is published for the Upsala Medical Society. It has been published since 1865 and is one of the oldest medical journals in Sweden. The journal publishes clinical and experimental original works in the medical field. Although focusing on regional issues, the journal always welcomes contributions from outside Sweden. Specially extended issues are published occasionally, dealing with special topics, congress proceedings and academic dissertations.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信