Guideline-driven clinical decision support for colonoscopy patients using the hierarchical multi-label deep learning method.

IF 7.5 3区医学 Q1 MEDICINE, GENERAL & INTERNAL

Chinese Medical Journal Pub Date : 2025-05-23 DOI:10.1097/CM9.0000000000003469

Junling Wu, Jun Chen, Hanwen Zhang, Zhe Luan, Yiming Zhao, Mengxuan Sun, Shufang Wang, Congyong Li, Zhizhuang Zhao, Wei Zhang, Yi Chen, Jiaqi Zhang, Yansheng Li, Kejia Liu, Jinghao Niu, Gang Sun

{"title":"Guideline-driven clinical decision support for colonoscopy patients using the hierarchical multi-label deep learning method.","authors":"Junling Wu, Jun Chen, Hanwen Zhang, Zhe Luan, Yiming Zhao, Mengxuan Sun, Shufang Wang, Congyong Li, Zhizhuang Zhao, Wei Zhang, Yi Chen, Jiaqi Zhang, Yansheng Li, Kejia Liu, Jinghao Niu, Gang Sun","doi":"10.1097/CM9.0000000000003469","DOIUrl":null,"url":null,"abstract":"Background: Over 20 million colonoscopies are performed in China annually. An automatic clinical decision support system (CDSS) with accurate semantic recognition of colonoscopy reports and guideline-based is helpful to relieve the increasing medical burden and standardize the healthcare. In this study, the CDSS was built under a hierarchical-label interpretable classification framework, trained by a state-of-the-art transformer-based model, and validated in a multi-center style.Methods: We conducted stratified sampling on a previously established dataset containing 302,965 electronic colonoscopy reports with pathology, identified 2041 records representative of overall features, and randomly divided into the training and testing sets (7:3). A total of 5 main labels and 22 sublabels were applied to annotate each record on a network platform, and the data were trained respectively by three pre-training models on Chinese corpus website, including BERT-base-Chinese (BC), the BERT-wwm-ext-Chinese (BWEC), and ernie-3.0-base-zh (E3BZ). The performance of trained models was subsequently compared with a randomly initialized model, and the preferred model was selected. Model fine-tuning was applied to further enhance the capacity. The system was validated in five other hospitals with 3177 consecutive colonoscopy cases.Results: The E3BZ pre-trained model exhibited the best performance, with a 90.18% accuracy and a 69.14% Macro-F1 score overall. The model achieved 100% accuracy in identifying cancer cases and 99.16% for normal cases. In external validation, the model exhibited favorable consistency and good performance among five hospitals.Conclusions: The novel CDSS possesses high-level semantic recognition of colonoscopy reports, provides appropriate recommendations, and holds the potential to be a powerful tool for physicians and patients. The hierarchical multi-label strategy and pre-training method should be amendable to manage more medical text in the future.","PeriodicalId":10183,"journal":{"name":"Chinese Medical Journal","volume":" ","pages":""},"PeriodicalIF":7.5000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Medical Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/CM9.0000000000003469","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Over 20 million colonoscopies are performed in China annually. An automatic clinical decision support system (CDSS) with accurate semantic recognition of colonoscopy reports and guideline-based is helpful to relieve the increasing medical burden and standardize the healthcare. In this study, the CDSS was built under a hierarchical-label interpretable classification framework, trained by a state-of-the-art transformer-based model, and validated in a multi-center style.

Methods: We conducted stratified sampling on a previously established dataset containing 302,965 electronic colonoscopy reports with pathology, identified 2041 records representative of overall features, and randomly divided into the training and testing sets (7:3). A total of 5 main labels and 22 sublabels were applied to annotate each record on a network platform, and the data were trained respectively by three pre-training models on Chinese corpus website, including BERT-base-Chinese (BC), the BERT-wwm-ext-Chinese (BWEC), and ernie-3.0-base-zh (E3BZ). The performance of trained models was subsequently compared with a randomly initialized model, and the preferred model was selected. Model fine-tuning was applied to further enhance the capacity. The system was validated in five other hospitals with 3177 consecutive colonoscopy cases.

Results: The E3BZ pre-trained model exhibited the best performance, with a 90.18% accuracy and a 69.14% Macro-F1 score overall. The model achieved 100% accuracy in identifying cancer cases and 99.16% for normal cases. In external validation, the model exhibited favorable consistency and good performance among five hospitals.

Conclusions: The novel CDSS possesses high-level semantic recognition of colonoscopy reports, provides appropriate recommendations, and holds the potential to be a powerful tool for physicians and patients. The hierarchical multi-label strategy and pre-training method should be amendable to manage more medical text in the future.

查看原文本刊更多论文

使用分层多标签深度学习方法的指南驱动的结肠镜检查患者临床决策支持。

背景：中国每年有超过2000万例结肠镜检查。一个对结肠镜检查报告进行准确语义识别并基于指南的自动临床决策支持系统（CDSS）有助于减轻日益增加的医疗负担和规范医疗保健。在本研究中，CDSS是在一个层次标签可解释的分类框架下建立的，由最先进的基于变压器的模型训练，并以多中心的方式进行验证。方法：我们对先前建立的包含302,965份带病理的电子结肠镜检查报告的数据集进行分层抽样，识别出具有总体特征的2041份记录，并随机分为训练集和测试集（7:3）。使用5个主标签和22个子标签对网络平台上的每条记录进行标注，并在中文语料库网站上分别使用BERT-base-Chinese （BC）、bert - wm-ext-Chinese （BWEC）和erne -3.0-base-zh （E3BZ）三种预训练模型对数据进行训练。随后将训练后的模型与随机初始化的模型进行性能比较，选择首选模型。采用模型微调进一步提高了容量。该系统在其他五家医院进行了3177例连续结肠镜检查。结果：E3BZ预训练模型表现最佳，准确率为90.18%，宏观f1总分为69.14%。该模型识别癌症病例的准确率为100%，正常病例的准确率为99.16%。在外部验证中，模型在五家医院间具有较好的一致性和良好的性能。结论：新的CDSS对结肠镜检查报告具有高水平的语义识别，提供适当的建议，并有可能成为医生和患者的有力工具。分级多标签策略和预训练方法有待改进，以在未来管理更多的医学文本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Chinese Medical Journal 医学-医学：内科

CiteScore

9.80

自引率

4.90%

发文量

19245

审稿时长

6 months

期刊介绍： The Chinese Medical Journal (CMJ) is published semimonthly in English by the Chinese Medical Association, and is a peer reviewed general medical journal for all doctors, researchers, and health workers regardless of their medical specialty or type of employment. Established in 1887, it is the oldest medical periodical in China and is distributed worldwide. The journal functions as a window into China’s medical sciences and reflects the advances and progress in China’s medical sciences and technology. It serves the objective of international academic exchange. The journal includes Original Articles, Editorial, Review Articles, Medical Progress, Brief Reports, Case Reports, Viewpoint, Clinical Exchange, Letter,and News,etc. CMJ is abstracted or indexed in many databases including Biological Abstracts, Chemical Abstracts, Index Medicus/Medline, Science Citation Index (SCI), Current Contents, Cancerlit, Health Plan & Administration, Embase, Social Scisearch, Aidsline, Toxline, Biocommercial Abstracts, Arts and Humanities Search, Nuclear Science Abstracts, Water Resources Abstracts, Cab Abstracts, Occupation Safety & Health, etc. In 2007, the impact factor of the journal by SCI is 0.636, and the total citation is 2315.