多任务人工智能辅助小肠胶囊内窥镜系统的开发与验证。

IF 2.1 4区医学 Q2 MEDICINE, GENERAL & INTERNAL

International Journal of General Medicine Pub Date : 2025-05-12 eCollection Date: 2025-01-01 DOI:10.2147/IJGM.S522587

Jian Chen, Hongwei Wang, Zihao Zhang, Kaijian Xia, Fuli Gao, Xiaodan Xu, Ganhong Wang

{"title":"多任务人工智能辅助小肠胶囊内窥镜系统的开发与验证。","authors":"Jian Chen, Hongwei Wang, Zihao Zhang, Kaijian Xia, Fuli Gao, Xiaodan Xu, Ganhong Wang","doi":"10.2147/IJGM.S522587","DOIUrl":null,"url":null,"abstract":"Objective: To develop a multi-task artificial intelligence-assisted system for small bowel capsule endoscopy (SBCE) based on various Transformer neural network architectures. The system integrates lesion recognition, cumulative time statistics, and progress bar marking functions to enhance the efficiency and accuracy of endoscopic image interpretation while effectively reducing missed diagnoses.Methods: A dataset comprising 12 annotated categories of images captured by three different brands of capsule endoscopy devices was collected. Transfer learning and fine-tuning were conducted on five pre-trained Transformer models. Performance metrics, including accuracy, sensitivity, specificity, and recognition speed, were evaluated to select the best-performing model. The optimal model was converted from PyTorch to Open Neural Network Exchange (ONNX) format. Using OpenCV and MMCV tools, a multi-task SBCE-assisted reading system was developed.Results: A total of 34,799 images were included in the study. The best-performing model, FocalNet, achieved a weighted average sensitivity of 85.69%, specificity of 98.58%, accuracy of 85.69%, and an AUC of 0.98 across all categories. Its diagnostic accuracy outperformed junior physicians (χ²=17.26, p<0.05) and showed no statistical difference compared to senior physicians (χ²=0.0716, p>0.05). The multi-task AI-assisted reading system, \"FocalCE-Master\", developed based on FocalNet, achieved a diagnostic speed of 592.40 frames per second, significantly faster than endoscopists. By integrating cumulative time bar charts with progress bar marking functionality, the system enables rapid localization and review of lesions, effectively streamlining the diagnostic workflow of SBCE.Conclusion: The multi-task SBCE-assisted reading system developed using Transformer networks demonstrated rapid and accurate classification of various small bowel lesions. It holds significant potential in enhancing diagnostic efficiency and image review speed for endoscopists. However, the AI system has not yet been validated in prospective clinical trials, and further real-world studies are needed to confirm its clinical applicability.","PeriodicalId":14131,"journal":{"name":"International Journal of General Medicine","volume":"18 ","pages":"2521-2536"},"PeriodicalIF":2.1000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083486/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and Validation of a Multi-Task Artificial Intelligence-Assisted System for Small Bowel Capsule Endoscopy.\",\"authors\":\"Jian Chen, Hongwei Wang, Zihao Zhang, Kaijian Xia, Fuli Gao, Xiaodan Xu, Ganhong Wang\",\"doi\":\"10.2147/IJGM.S522587\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective: To develop a multi-task artificial intelligence-assisted system for small bowel capsule endoscopy (SBCE) based on various Transformer neural network architectures. The system integrates lesion recognition, cumulative time statistics, and progress bar marking functions to enhance the efficiency and accuracy of endoscopic image interpretation while effectively reducing missed diagnoses.Methods: A dataset comprising 12 annotated categories of images captured by three different brands of capsule endoscopy devices was collected. Transfer learning and fine-tuning were conducted on five pre-trained Transformer models. Performance metrics, including accuracy, sensitivity, specificity, and recognition speed, were evaluated to select the best-performing model. The optimal model was converted from PyTorch to Open Neural Network Exchange (ONNX) format. Using OpenCV and MMCV tools, a multi-task SBCE-assisted reading system was developed.Results: A total of 34,799 images were included in the study. The best-performing model, FocalNet, achieved a weighted average sensitivity of 85.69%, specificity of 98.58%, accuracy of 85.69%, and an AUC of 0.98 across all categories. Its diagnostic accuracy outperformed junior physicians (χ²=17.26, p<0.05) and showed no statistical difference compared to senior physicians (χ²=0.0716, p>0.05). The multi-task AI-assisted reading system, \\\"FocalCE-Master\\\", developed based on FocalNet, achieved a diagnostic speed of 592.40 frames per second, significantly faster than endoscopists. By integrating cumulative time bar charts with progress bar marking functionality, the system enables rapid localization and review of lesions, effectively streamlining the diagnostic workflow of SBCE.Conclusion: The multi-task SBCE-assisted reading system developed using Transformer networks demonstrated rapid and accurate classification of various small bowel lesions. It holds significant potential in enhancing diagnostic efficiency and image review speed for endoscopists. However, the AI system has not yet been validated in prospective clinical trials, and further real-world studies are needed to confirm its clinical applicability.\",\"PeriodicalId\":14131,\"journal\":{\"name\":\"International Journal of General Medicine\",\"volume\":\"18 \",\"pages\":\"2521-2536\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083486/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of General Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2147/IJGM.S522587\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of General Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/IJGM.S522587","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

摘要

目的：开发一种基于各种Transformer神经网络架构的多任务小肠胶囊内窥镜人工智能辅助系统。系统集成病变识别、累积时间统计、进度条标注等功能，提高内镜图像判读的效率和准确性，同时有效减少漏诊。方法：收集由三种不同品牌胶囊内窥镜设备捕获的12个带注释的图像类别的数据集。对5个预训练的Transformer模型进行迁移学习和微调。评估性能指标，包括准确性、灵敏度、特异性和识别速度，以选择性能最佳的模型。将最优模型从PyTorch转换为Open Neural Network Exchange （ONNX）格式。利用OpenCV和MMCV工具，开发了一个多任务sbce辅助阅读系统。结果：共纳入34,799张图像。表现最好的模型FocalNet的加权平均灵敏度为85.69%，特异性为98.58%，准确率为85.69%，所有类别的AUC为0.98。其诊断正确率优于初级医师（χ²=17.26,pχ²=0.0716,p < 0.05）。基于FocalNet开发的多任务人工智能辅助阅读系统“FocalCE-Master”的诊断速度达到592.40帧/秒，明显快于内窥镜医生。通过集成累积时间条形图和进度条标记功能，该系统能够快速定位和检查病变，有效地简化了SBCE的诊断工作流程。结论：使用Transformer网络开发的多任务sbce辅助阅读系统能够快速准确地分类各种小肠病变。它在提高内窥镜医生的诊断效率和图像审查速度方面具有重要的潜力。然而，人工智能系统尚未在前瞻性临床试验中得到验证，需要进一步的现实世界研究来证实其临床适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Development and Validation of a Multi-Task Artificial Intelligence-Assisted System for Small Bowel Capsule Endoscopy.

Objective: To develop a multi-task artificial intelligence-assisted system for small bowel capsule endoscopy (SBCE) based on various Transformer neural network architectures. The system integrates lesion recognition, cumulative time statistics, and progress bar marking functions to enhance the efficiency and accuracy of endoscopic image interpretation while effectively reducing missed diagnoses.

Methods: A dataset comprising 12 annotated categories of images captured by three different brands of capsule endoscopy devices was collected. Transfer learning and fine-tuning were conducted on five pre-trained Transformer models. Performance metrics, including accuracy, sensitivity, specificity, and recognition speed, were evaluated to select the best-performing model. The optimal model was converted from PyTorch to Open Neural Network Exchange (ONNX) format. Using OpenCV and MMCV tools, a multi-task SBCE-assisted reading system was developed.

Results: A total of 34,799 images were included in the study. The best-performing model, FocalNet, achieved a weighted average sensitivity of 85.69%, specificity of 98.58%, accuracy of 85.69%, and an AUC of 0.98 across all categories. Its diagnostic accuracy outperformed junior physicians (χ²=17.26, p<0.05) and showed no statistical difference compared to senior physicians (χ²=0.0716, p>0.05). The multi-task AI-assisted reading system, "FocalCE-Master", developed based on FocalNet, achieved a diagnostic speed of 592.40 frames per second, significantly faster than endoscopists. By integrating cumulative time bar charts with progress bar marking functionality, the system enables rapid localization and review of lesions, effectively streamlining the diagnostic workflow of SBCE.

Conclusion: The multi-task SBCE-assisted reading system developed using Transformer networks demonstrated rapid and accurate classification of various small bowel lesions. It holds significant potential in enhancing diagnostic efficiency and image review speed for endoscopists. However, the AI system has not yet been validated in prospective clinical trials, and further real-world studies are needed to confirm its clinical applicability.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of General Medicine Medicine-General Medicine

自引率

0.00%

发文量

1113

审稿时长

16 weeks

期刊介绍： The International Journal of General Medicine is an international, peer-reviewed, open access journal that focuses on general and internal medicine, pathogenesis, epidemiology, diagnosis, monitoring and treatment protocols. The journal is characterized by the rapid reporting of reviews, original research and clinical studies across all disease areas. A key focus of the journal is the elucidation of disease processes and management protocols resulting in improved outcomes for the patient. Patient perspectives such as satisfaction, quality of life, health literacy and communication and their role in developing new healthcare programs and optimizing clinical outcomes are major areas of interest for the journal. As of 1st April 2019, the International Journal of General Medicine will no longer consider meta-analyses for publication.