YOLOv8-BCD: a real-time deep learning framework for pulmonary nodule detection in computed tomography imaging.

IF 2.3 2区医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Quantitative Imaging in Medicine and Surgery Pub Date : 2025-09-01 Epub Date: 2025-08-12 DOI:10.21037/qims-2025-824

Wenjun Zhu, Xinyue Wang, Jie Xing, Xu Steven Xu, Min Yuan

{"title":"YOLOv8-BCD: a real-time deep learning framework for pulmonary nodule detection in computed tomography imaging.","authors":"Wenjun Zhu, Xinyue Wang, Jie Xing, Xu Steven Xu, Min Yuan","doi":"10.21037/qims-2025-824","DOIUrl":null,"url":null,"abstract":"Background: Lung cancer remains one of the malignant tumors with the highest global morbidity and mortality rates. Detecting pulmonary nodules in computed tomography (CT) images is essential for early lung cancer screening. However, traditional detection methods often suffer from low accuracy and efficiency, limiting their clinical effectiveness. This study aims to devise an advanced deep-learning framework capable of achieving high-precision, rapid identification of pulmonary nodules in CT imaging, thereby facilitating earlier and more accurate diagnosis of lung cancer.Methods: To address these issues, this paper proposes an improved deep-learning framework named YOLOv8-BCD, based on YOLOv8 and integrating the BiFormer attention mechanism, Content-Aware ReAssembly of Features (CARAFE) up-sampling method, and Depth-wise Over-Parameterized Depth-wise Convolution (DO-DConv) enhanced convolution. To overcome common challenges such as low resolution, noise, and artifacts in lung CT images, the model employs Super-Resolution Generative Adversarial Network (SRGAN)-based image enhancement during preprocessing. The BiFormer attention mechanism is introduced into the backbone to enhance feature extraction capabilities, particularly for small nodules, while CARAFE and DO-DConv modules are incorporated into the head to optimize feature fusion efficiency and reduce computational complexity.Results: Experimental comparisons using 550 CT images from the LUng Nodule Analysis 2016 dataset (LUNA16 dataset) demonstrated that the proposed YOLOv8-BCD achieved detection accuracy and mean average precision (mAP) at an intersection over union (IoU) threshold of 0.5 (mAP0.5) of 86.4% and 88.3%, respectively, surpassing YOLOv8 by 2.2% in accuracy, 4.5% in mAP0.5. Additional evaluation on the external TianChi lung nodule dataset further confirmed the model's generalization capability, achieving an mAP0.5 of 83.8% and mAP0.5-0.95 of 43.9% with an inference speed of 98 frames per second (FPS).Conclusions: The YOLOv8-BCD model effectively assists clinicians by significantly reducing interpretation time, improving diagnostic accuracy, and minimizing the risk of missed diagnoses, thereby enhancing patient outcomes.","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":"15 9","pages":"8189-8204"},"PeriodicalIF":2.3000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12397618/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-2025-824","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Lung cancer remains one of the malignant tumors with the highest global morbidity and mortality rates. Detecting pulmonary nodules in computed tomography (CT) images is essential for early lung cancer screening. However, traditional detection methods often suffer from low accuracy and efficiency, limiting their clinical effectiveness. This study aims to devise an advanced deep-learning framework capable of achieving high-precision, rapid identification of pulmonary nodules in CT imaging, thereby facilitating earlier and more accurate diagnosis of lung cancer.

Methods: To address these issues, this paper proposes an improved deep-learning framework named YOLOv8-BCD, based on YOLOv8 and integrating the BiFormer attention mechanism, Content-Aware ReAssembly of Features (CARAFE) up-sampling method, and Depth-wise Over-Parameterized Depth-wise Convolution (DO-DConv) enhanced convolution. To overcome common challenges such as low resolution, noise, and artifacts in lung CT images, the model employs Super-Resolution Generative Adversarial Network (SRGAN)-based image enhancement during preprocessing. The BiFormer attention mechanism is introduced into the backbone to enhance feature extraction capabilities, particularly for small nodules, while CARAFE and DO-DConv modules are incorporated into the head to optimize feature fusion efficiency and reduce computational complexity.

Results: Experimental comparisons using 550 CT images from the LUng Nodule Analysis 2016 dataset (LUNA16 dataset) demonstrated that the proposed YOLOv8-BCD achieved detection accuracy and mean average precision (mAP) at an intersection over union (IoU) threshold of 0.5 (mAP_0.5) of 86.4% and 88.3%, respectively, surpassing YOLOv8 by 2.2% in accuracy, 4.5% in mAP_0.5. Additional evaluation on the external TianChi lung nodule dataset further confirmed the model's generalization capability, achieving an mAP_0.5 of 83.8% and mAP_0.5-0.95 of 43.9% with an inference speed of 98 frames per second (FPS).

Conclusions: The YOLOv8-BCD model effectively assists clinicians by significantly reducing interpretation time, improving diagnostic accuracy, and minimizing the risk of missed diagnoses, thereby enhancing patient outcomes.

Abstract Image

查看原文本刊更多论文

YOLOv8-BCD：计算机断层成像中肺结节检测的实时深度学习框架。

背景：肺癌是全球发病率和死亡率最高的恶性肿瘤之一。在计算机断层扫描（CT）图像中发现肺结节是早期肺癌筛查的必要条件。然而，传统的检测方法往往准确性和效率较低，限制了其临床效果。本研究旨在设计一种先进的深度学习框架，能够在CT影像中实现对肺结节的高精度、快速识别，从而更早、更准确地诊断肺癌。方法：为了解决这些问题，本文提出了一个改进的深度学习框架，名为YOLOv8- bcd，该框架基于YOLOv8并集成了BiFormer注意机制、内容感知特征重组（CARAFE）上采样方法和深度感知过参数化深度感知卷积（DO-DConv）增强卷积。为了克服肺部CT图像中的低分辨率、噪声和伪影等常见问题，该模型在预处理过程中采用了基于超分辨率生成对抗网络（SRGAN）的图像增强技术。在主干中引入了BiFormer注意机制，以增强特征提取能力，特别是对于小结节，而在头部中加入了CARAFE和DO-DConv模块，以优化特征融合效率并降低计算复杂度。结果：使用肺结节分析2016数据集（LUNA16数据集）的550张CT图像进行实验比较，结果表明，所提出的YOLOv8- bcd在0.5 （mAP0.5）交叉点（IoU）阈值下的检测精度和平均精度（mAP）分别为86.4%和88.3%，准确度比YOLOv8高2.2%，比mAP0.5高4.5%。对外部天池肺结节数据集的进一步评价进一步证实了该模型的泛化能力，mAP0.5为83.8%,mAP0.5-0.95为43.9%，推理速度为98帧/秒（FPS）。结论：YOLOv8-BCD模型有效地帮助临床医生显著减少解释时间，提高诊断准确性，最大限度地降低漏诊风险，从而提高患者的预后。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊