YOLOv8-BCD: a real-time deep learning framework for pulmonary nodule detection in computed tomography imaging.

IF 2.3 2区 医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Quantitative Imaging in Medicine and Surgery Pub Date : 2025-09-01 Epub Date: 2025-08-12 DOI:10.21037/qims-2025-824
Wenjun Zhu, Xinyue Wang, Jie Xing, Xu Steven Xu, Min Yuan
{"title":"YOLOv8-BCD: a real-time deep learning framework for pulmonary nodule detection in computed tomography imaging.","authors":"Wenjun Zhu, Xinyue Wang, Jie Xing, Xu Steven Xu, Min Yuan","doi":"10.21037/qims-2025-824","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Lung cancer remains one of the malignant tumors with the highest global morbidity and mortality rates. Detecting pulmonary nodules in computed tomography (CT) images is essential for early lung cancer screening. However, traditional detection methods often suffer from low accuracy and efficiency, limiting their clinical effectiveness. This study aims to devise an advanced deep-learning framework capable of achieving high-precision, rapid identification of pulmonary nodules in CT imaging, thereby facilitating earlier and more accurate diagnosis of lung cancer.</p><p><strong>Methods: </strong>To address these issues, this paper proposes an improved deep-learning framework named YOLOv8-BCD, based on YOLOv8 and integrating the BiFormer attention mechanism, Content-Aware ReAssembly of Features (CARAFE) up-sampling method, and Depth-wise Over-Parameterized Depth-wise Convolution (DO-DConv) enhanced convolution. To overcome common challenges such as low resolution, noise, and artifacts in lung CT images, the model employs Super-Resolution Generative Adversarial Network (SRGAN)-based image enhancement during preprocessing. The BiFormer attention mechanism is introduced into the backbone to enhance feature extraction capabilities, particularly for small nodules, while CARAFE and DO-DConv modules are incorporated into the head to optimize feature fusion efficiency and reduce computational complexity.</p><p><strong>Results: </strong>Experimental comparisons using 550 CT images from the LUng Nodule Analysis 2016 dataset (LUNA16 dataset) demonstrated that the proposed YOLOv8-BCD achieved detection accuracy and mean average precision (mAP) at an intersection over union (IoU) threshold of 0.5 (mAP<sub>0.5</sub>) of 86.4% and 88.3%, respectively, surpassing YOLOv8 by 2.2% in accuracy, 4.5% in mAP<sub>0.5</sub>. Additional evaluation on the external TianChi lung nodule dataset further confirmed the model's generalization capability, achieving an mAP<sub>0.5</sub> of 83.8% and mAP<sub>0.5-0.95</sub> of 43.9% with an inference speed of 98 frames per second (FPS).</p><p><strong>Conclusions: </strong>The YOLOv8-BCD model effectively assists clinicians by significantly reducing interpretation time, improving diagnostic accuracy, and minimizing the risk of missed diagnoses, thereby enhancing patient outcomes.</p>","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":"15 9","pages":"8189-8204"},"PeriodicalIF":2.3000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12397618/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-2025-824","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Lung cancer remains one of the malignant tumors with the highest global morbidity and mortality rates. Detecting pulmonary nodules in computed tomography (CT) images is essential for early lung cancer screening. However, traditional detection methods often suffer from low accuracy and efficiency, limiting their clinical effectiveness. This study aims to devise an advanced deep-learning framework capable of achieving high-precision, rapid identification of pulmonary nodules in CT imaging, thereby facilitating earlier and more accurate diagnosis of lung cancer.

Methods: To address these issues, this paper proposes an improved deep-learning framework named YOLOv8-BCD, based on YOLOv8 and integrating the BiFormer attention mechanism, Content-Aware ReAssembly of Features (CARAFE) up-sampling method, and Depth-wise Over-Parameterized Depth-wise Convolution (DO-DConv) enhanced convolution. To overcome common challenges such as low resolution, noise, and artifacts in lung CT images, the model employs Super-Resolution Generative Adversarial Network (SRGAN)-based image enhancement during preprocessing. The BiFormer attention mechanism is introduced into the backbone to enhance feature extraction capabilities, particularly for small nodules, while CARAFE and DO-DConv modules are incorporated into the head to optimize feature fusion efficiency and reduce computational complexity.

Results: Experimental comparisons using 550 CT images from the LUng Nodule Analysis 2016 dataset (LUNA16 dataset) demonstrated that the proposed YOLOv8-BCD achieved detection accuracy and mean average precision (mAP) at an intersection over union (IoU) threshold of 0.5 (mAP0.5) of 86.4% and 88.3%, respectively, surpassing YOLOv8 by 2.2% in accuracy, 4.5% in mAP0.5. Additional evaluation on the external TianChi lung nodule dataset further confirmed the model's generalization capability, achieving an mAP0.5 of 83.8% and mAP0.5-0.95 of 43.9% with an inference speed of 98 frames per second (FPS).

Conclusions: The YOLOv8-BCD model effectively assists clinicians by significantly reducing interpretation time, improving diagnostic accuracy, and minimizing the risk of missed diagnoses, thereby enhancing patient outcomes.

Abstract Image

Abstract Image

Abstract Image

YOLOv8-BCD:计算机断层成像中肺结节检测的实时深度学习框架。
背景:肺癌是全球发病率和死亡率最高的恶性肿瘤之一。在计算机断层扫描(CT)图像中发现肺结节是早期肺癌筛查的必要条件。然而,传统的检测方法往往准确性和效率较低,限制了其临床效果。本研究旨在设计一种先进的深度学习框架,能够在CT影像中实现对肺结节的高精度、快速识别,从而更早、更准确地诊断肺癌。方法:为了解决这些问题,本文提出了一个改进的深度学习框架,名为YOLOv8- bcd,该框架基于YOLOv8并集成了BiFormer注意机制、内容感知特征重组(CARAFE)上采样方法和深度感知过参数化深度感知卷积(DO-DConv)增强卷积。为了克服肺部CT图像中的低分辨率、噪声和伪影等常见问题,该模型在预处理过程中采用了基于超分辨率生成对抗网络(SRGAN)的图像增强技术。在主干中引入了BiFormer注意机制,以增强特征提取能力,特别是对于小结节,而在头部中加入了CARAFE和DO-DConv模块,以优化特征融合效率并降低计算复杂度。结果:使用肺结节分析2016数据集(LUNA16数据集)的550张CT图像进行实验比较,结果表明,所提出的YOLOv8- bcd在0.5 (mAP0.5)交叉点(IoU)阈值下的检测精度和平均精度(mAP)分别为86.4%和88.3%,准确度比YOLOv8高2.2%,比mAP0.5高4.5%。对外部天池肺结节数据集的进一步评价进一步证实了该模型的泛化能力,mAP0.5为83.8%,mAP0.5-0.95为43.9%,推理速度为98帧/秒(FPS)。结论:YOLOv8-BCD模型有效地帮助临床医生显著减少解释时间,提高诊断准确性,最大限度地降低漏诊风险,从而提高患者的预后。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Quantitative Imaging in Medicine and Surgery
Quantitative Imaging in Medicine and Surgery Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
4.20
自引率
17.90%
发文量
252
期刊介绍: Information not localized
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信