Improving the Reporting Quality of Studies on Information Extraction From Clinical Texts: Protocol for the Development of a Consensus-Based Reporting Guideline.

IF 1.5 Q3 HEALTH CARE SCIENCES & SERVICES

JMIR Research Protocols Pub Date : 2025-09-24 DOI:10.2196/76776

Daniel Reichenpfader, Henning Müller, Kerstin Denecke

{"title":"Improving the Reporting Quality of Studies on Information Extraction From Clinical Texts: Protocol for the Development of a Consensus-Based Reporting Guideline.","authors":"Daniel Reichenpfader, Henning Müller, Kerstin Denecke","doi":"10.2196/76776","DOIUrl":null,"url":null,"abstract":"Background: Information extraction (IE) from clinical texts is increasingly important in health care; yet, reporting practices remain inconsistent. Existing guidelines do not fully address the unique challenges of IE studies. IE methods vary widely in their design, ranging from rule-based systems to advanced large language models, contributing to heterogeneity in reporting. While several reporting frameworks exist for applications of artificial intelligence in health care, they primarily focus on prediction modeling or clinical trials and associated protocols rather than text-based IE.Objective: This study aims to develop the Clinical Information Extraction (CINEX) guideline, a consensus-based reporting guideline for studies on clinical IE.Methods: The CINEX guideline is developed following an established guideline methodology, including a 3-round electronic Delphi (eDelphi) study with domain experts and a final in-person consensus meeting. The eDelphi process includes feedback loops and predefined consensus thresholds, with items rated on a 10-point scale for both relevance and maturity. The final consensus meeting is held as a hybrid workshop at the MEDINFO 2025 conference and focuses on finalizing the items that reached consensus.Results: Our results will provide a validated reporting guideline for studies on clinical IE. A preliminary set of 28 reporting items was drafted from a scoping review and existing frameworks. The draft guidelines include 5 key dimensions: information model, architecture, data, annotation, and outcome. This draft guideline will be refined through the eDelphi process. It is designed to be technology-agnostic and applicable across diverse IE approaches, including not only large language models but also traditional machine learning methods and rule-based and hybrid systems.Conclusions: The CINEX guideline provides structured, expert-validated guidance for reporting clinical IE studies, improving transparency, reproducibility, and comparability. The final guideline will be disseminated alongside an explanatory document to support adoption and implementation.","PeriodicalId":14755,"journal":{"name":"JMIR Research Protocols","volume":"14 ","pages":"e76776"},"PeriodicalIF":1.5000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12459736/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Research Protocols","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/76776","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Information extraction (IE) from clinical texts is increasingly important in health care; yet, reporting practices remain inconsistent. Existing guidelines do not fully address the unique challenges of IE studies. IE methods vary widely in their design, ranging from rule-based systems to advanced large language models, contributing to heterogeneity in reporting. While several reporting frameworks exist for applications of artificial intelligence in health care, they primarily focus on prediction modeling or clinical trials and associated protocols rather than text-based IE.

Objective: This study aims to develop the Clinical Information Extraction (CINEX) guideline, a consensus-based reporting guideline for studies on clinical IE.

Methods: The CINEX guideline is developed following an established guideline methodology, including a 3-round electronic Delphi (eDelphi) study with domain experts and a final in-person consensus meeting. The eDelphi process includes feedback loops and predefined consensus thresholds, with items rated on a 10-point scale for both relevance and maturity. The final consensus meeting is held as a hybrid workshop at the MEDINFO 2025 conference and focuses on finalizing the items that reached consensus.

Results: Our results will provide a validated reporting guideline for studies on clinical IE. A preliminary set of 28 reporting items was drafted from a scoping review and existing frameworks. The draft guidelines include 5 key dimensions: information model, architecture, data, annotation, and outcome. This draft guideline will be refined through the eDelphi process. It is designed to be technology-agnostic and applicable across diverse IE approaches, including not only large language models but also traditional machine learning methods and rule-based and hybrid systems.

Conclusions: The CINEX guideline provides structured, expert-validated guidance for reporting clinical IE studies, improving transparency, reproducibility, and comparability. The final guideline will be disseminated alongside an explanatory document to support adoption and implementation.

查看原文本刊更多论文

提高临床文献信息提取研究的报告质量：制定基于共识的报告指南的协议。

背景：临床文本信息提取（IE）在卫生保健中越来越重要；然而，报告实践仍然不一致。现有的指导方针并没有完全解决IE研究的独特挑战。IE方法的设计差异很大，从基于规则的系统到先进的大型语言模型，这导致了报告的异质性。虽然存在一些人工智能在医疗保健中的应用报告框架，但它们主要侧重于预测建模或临床试验和相关协议，而不是基于文本的IE。目的：本研究旨在制定临床IE研究的共识报告指南——临床信息提取（CINEX）指南。方法：CINEX指南是根据既定的指南方法制定的，包括与领域专家进行的3轮电子德尔菲（eDelphi）研究和最终的面对面共识会议。eDelphi流程包括反馈循环和预定义的共识阈值，并对项目的相关性和成熟度进行10分制评分。最终共识会议作为混合研讨会在2025年MEDINFO会议上举行，重点是最后确定达成共识的项目。结果：我们的研究结果将为临床IE研究提供一个有效的报告指南。根据范围审查和现有框架起草了一套初步的28个报告项目。指南草案包括5个关键维度：信息模型、体系结构、数据、注释和结果。本指南草案将通过eDelphi过程进行完善。它的设计与技术无关，适用于各种IE方法，不仅包括大型语言模型，还包括传统的机器学习方法和基于规则的混合系统。结论：CINEX指南为临床IE研究报告提供了结构化的、专家验证的指导，提高了透明度、可重复性和可比性。最终准则将与一份解释性文件一起散发，以支持通过和执行。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊