How to (Semi)-Automatically Spot Prescreening Oriented Eligibility Criteria.

Morgan Vaterkowski, Nadir Ammour, Christel Daniel, Emmanuelle Kempf
{"title":"How to (Semi)-Automatically Spot Prescreening Oriented Eligibility Criteria.","authors":"Morgan Vaterkowski, Nadir Ammour, Christel Daniel, Emmanuelle Kempf","doi":"10.3233/SHTI251546","DOIUrl":null,"url":null,"abstract":"<p><p>Clinical Trial (CT) Recruitment Support Systems (CTRSS) querying Electronic Health Records (EHR) for patient-trial matching during CT execution have been expanding. Since free text CT eligibility criteria (EC) are not readily suitable for the automation of the EHR querying, the configuration of EHR-based CTRSS requires a time-consuming and usually manual processing of EC focusing on those that are the most relevant at the pre-inclusion (prescreening) step. The aim of this study is to provide a methodological approach to semi-automatically detect Prescreening-Oriented Eligibility Criteria (POEC) and build a library of POEC usable in the context of the development and evaluation of EHR-based Clinical Trial Recruitment Support Systems (CTRSS). We proposed an approach for decomposing free text EC into standardized elements and developing a rule-based algorithm to semi-automatically detect POEC. In addition, this paper describes the characteristics of a publicly available POEC library usable for CTRSS evaluation. An annotation framework consisting in 96 patterns of elementary EC categorized in 17 domains was used to annotate 381 free text EC from 20 CT dedicated to various cancer types. This training dataset was used to develop a rule-based algorithm detecting POEC. This study provides a methodological approach to (semi)-automatically spot POEC and store them in a library considering advances in the field of CTRSS. The PENELOPE-C2Q pipeline is designed to feed the PENELOPE POEC library, both having the potential to facilitate the reuse of EHR data for better participation of patients to research.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"332 ","pages":"288-292"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in health technology and informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SHTI251546","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Clinical Trial (CT) Recruitment Support Systems (CTRSS) querying Electronic Health Records (EHR) for patient-trial matching during CT execution have been expanding. Since free text CT eligibility criteria (EC) are not readily suitable for the automation of the EHR querying, the configuration of EHR-based CTRSS requires a time-consuming and usually manual processing of EC focusing on those that are the most relevant at the pre-inclusion (prescreening) step. The aim of this study is to provide a methodological approach to semi-automatically detect Prescreening-Oriented Eligibility Criteria (POEC) and build a library of POEC usable in the context of the development and evaluation of EHR-based Clinical Trial Recruitment Support Systems (CTRSS). We proposed an approach for decomposing free text EC into standardized elements and developing a rule-based algorithm to semi-automatically detect POEC. In addition, this paper describes the characteristics of a publicly available POEC library usable for CTRSS evaluation. An annotation framework consisting in 96 patterns of elementary EC categorized in 17 domains was used to annotate 381 free text EC from 20 CT dedicated to various cancer types. This training dataset was used to develop a rule-based algorithm detecting POEC. This study provides a methodological approach to (semi)-automatically spot POEC and store them in a library considering advances in the field of CTRSS. The PENELOPE-C2Q pipeline is designed to feed the PENELOPE POEC library, both having the potential to facilitate the reuse of EHR data for better participation of patients to research.

如何(半)自动点预选导向的资格标准。
临床试验(CT)招募支持系统(CTRSS)在CT执行期间查询电子健康记录(EHR)以进行患者-试验匹配已得到扩展。由于自由文本CT资格标准(EC)不容易适用于EHR查询的自动化,因此基于EHR的CTRSS的配置需要耗时且通常是手动处理EC,重点关注那些在预纳入(预筛选)步骤中最相关的EC。本研究的目的是提供一种半自动检测面向预筛选的资格标准(POEC)的方法学方法,并建立一个POEC库,用于基于ehr的临床试验招募支持系统(CTRSS)的开发和评估。提出了一种将自由文本语料分解为标准化元素的方法,并开发了一种基于规则的算法来半自动检测语料。此外,本文还描述了可用于CTRSS评估的公开可用POEC库的特征。用一个包含17个域的96种基本EC模式的注释框架,对来自20个不同癌症类型CT的381个自由文本EC进行了注释。该训练数据集用于开发基于规则的POEC检测算法。考虑到CTRSS领域的进展,本研究提供了一种(半)自动识别POEC并将其存储在库中的方法方法。PENELOPE- c2q管道是为PENELOPE POEC库设计的,两者都有可能促进EHR数据的重用,以更好地参与患者的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信