Junling Wu, Jun Chen, Hanwen Zhang, Zhe Luan, Yiming Zhao, Mengxuan Sun, Shufang Wang, Congyong Li, Zhizhuang Zhao, Wei Zhang, Yi Chen, Jiaqi Zhang, Yansheng Li, Kejia Liu, Jinghao Niu, Gang Sun
{"title":"Guideline-driven clinical decision support for colonoscopy patients using the hierarchical multi-label deep learning method.","authors":"Junling Wu, Jun Chen, Hanwen Zhang, Zhe Luan, Yiming Zhao, Mengxuan Sun, Shufang Wang, Congyong Li, Zhizhuang Zhao, Wei Zhang, Yi Chen, Jiaqi Zhang, Yansheng Li, Kejia Liu, Jinghao Niu, Gang Sun","doi":"10.1097/CM9.0000000000003469","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Over 20 million colonoscopies are performed in China annually. An automatic clinical decision support system (CDSS) with accurate semantic recognition of colonoscopy reports and guideline-based is helpful to relieve the increasing medical burden and standardize the healthcare. In this study, the CDSS was built under a hierarchical-label interpretable classification framework, trained by a state-of-the-art transformer-based model, and validated in a multi-center style.</p><p><strong>Methods: </strong>We conducted stratified sampling on a previously established dataset containing 302,965 electronic colonoscopy reports with pathology, identified 2041 records representative of overall features, and randomly divided into the training and testing sets (7:3). A total of 5 main labels and 22 sublabels were applied to annotate each record on a network platform, and the data were trained respectively by three pre-training models on Chinese corpus website, including BERT-base-Chinese (BC), the BERT-wwm-ext-Chinese (BWEC), and ernie-3.0-base-zh (E3BZ). The performance of trained models was subsequently compared with a randomly initialized model, and the preferred model was selected. Model fine-tuning was applied to further enhance the capacity. The system was validated in five other hospitals with 3177 consecutive colonoscopy cases.</p><p><strong>Results: </strong>The E3BZ pre-trained model exhibited the best performance, with a 90.18% accuracy and a 69.14% Macro-F1 score overall. The model achieved 100% accuracy in identifying cancer cases and 99.16% for normal cases. In external validation, the model exhibited favorable consistency and good performance among five hospitals.</p><p><strong>Conclusions: </strong>The novel CDSS possesses high-level semantic recognition of colonoscopy reports, provides appropriate recommendations, and holds the potential to be a powerful tool for physicians and patients. The hierarchical multi-label strategy and pre-training method should be amendable to manage more medical text in the future.</p>","PeriodicalId":10183,"journal":{"name":"Chinese Medical Journal","volume":" ","pages":""},"PeriodicalIF":7.5000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Medical Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/CM9.0000000000003469","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Over 20 million colonoscopies are performed in China annually. An automatic clinical decision support system (CDSS) with accurate semantic recognition of colonoscopy reports and guideline-based is helpful to relieve the increasing medical burden and standardize the healthcare. In this study, the CDSS was built under a hierarchical-label interpretable classification framework, trained by a state-of-the-art transformer-based model, and validated in a multi-center style.
Methods: We conducted stratified sampling on a previously established dataset containing 302,965 electronic colonoscopy reports with pathology, identified 2041 records representative of overall features, and randomly divided into the training and testing sets (7:3). A total of 5 main labels and 22 sublabels were applied to annotate each record on a network platform, and the data were trained respectively by three pre-training models on Chinese corpus website, including BERT-base-Chinese (BC), the BERT-wwm-ext-Chinese (BWEC), and ernie-3.0-base-zh (E3BZ). The performance of trained models was subsequently compared with a randomly initialized model, and the preferred model was selected. Model fine-tuning was applied to further enhance the capacity. The system was validated in five other hospitals with 3177 consecutive colonoscopy cases.
Results: The E3BZ pre-trained model exhibited the best performance, with a 90.18% accuracy and a 69.14% Macro-F1 score overall. The model achieved 100% accuracy in identifying cancer cases and 99.16% for normal cases. In external validation, the model exhibited favorable consistency and good performance among five hospitals.
Conclusions: The novel CDSS possesses high-level semantic recognition of colonoscopy reports, provides appropriate recommendations, and holds the potential to be a powerful tool for physicians and patients. The hierarchical multi-label strategy and pre-training method should be amendable to manage more medical text in the future.
期刊介绍:
The Chinese Medical Journal (CMJ) is published semimonthly in English by the Chinese Medical Association, and is a peer reviewed general medical journal for all doctors, researchers, and health workers regardless of their medical specialty or type of employment. Established in 1887, it is the oldest medical periodical in China and is distributed worldwide. The journal functions as a window into China’s medical sciences and reflects the advances and progress in China’s medical sciences and technology. It serves the objective of international academic exchange. The journal includes Original Articles, Editorial, Review Articles, Medical Progress, Brief Reports, Case Reports, Viewpoint, Clinical Exchange, Letter,and News,etc. CMJ is abstracted or indexed in many databases including Biological Abstracts, Chemical Abstracts, Index Medicus/Medline, Science Citation Index (SCI), Current Contents, Cancerlit, Health Plan & Administration, Embase, Social Scisearch, Aidsline, Toxline, Biocommercial Abstracts, Arts and Humanities Search, Nuclear Science Abstracts, Water Resources Abstracts, Cab Abstracts, Occupation Safety & Health, etc. In 2007, the impact factor of the journal by SCI is 0.636, and the total citation is 2315.