Enhanced pulmonary nodule detection with U-Net, YOLOv8, and swin transformer.

IF 2.9 3区医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

BMC Medical Imaging Pub Date : 2025-07-01 DOI:10.1186/s12880-025-01784-0

Xing Wang, Houde Wu, Longshuang Wang, Jingxu Chen, Yi Li, Xinliu He, Ting Chen, Minghui Wang, Li Guo

{"title":"Enhanced pulmonary nodule detection with U-Net, YOLOv8, and swin transformer.","authors":"Xing Wang, Houde Wu, Longshuang Wang, Jingxu Chen, Yi Li, Xinliu He, Ting Chen, Minghui Wang, Li Guo","doi":"10.1186/s12880-025-01784-0","DOIUrl":null,"url":null,"abstract":"Rationale and objectives: Lung cancer remains the leading cause of cancer-related mortality worldwide, emphasizing the critical need for early pulmonary nodule detection to improve patient outcomes. Current methods encounter challenges in detecting small nodules and exhibit high false positive rates, placing an additional diagnostic burden on radiologists. This study aimed to develop a two-stage deep learning model integrating U-Net, Yolov8s, and the Swin transformer to enhance pulmonary nodule detection in computer tomography (CT) images, particularly for small nodules, with the goal of improving detection accuracy and reducing false positives.Materials and methods: We utilized the LUNA16 dataset (888 CT scans) and an additional 308 CT scans from Tianjin Chest Hospital. Images were preprocessed for consistency. The proposed model first employs U-Net for precise lung segmentation, followed by Yolov8s augmented with the Swin transformer for nodule detection. The Shape-aware IoU (SIoU) loss function was implemented to improve bounding box predictions.Results: For the LUNA16 dataset, the model achieved a precision of 0.898, a recall of 0.851, and a mean average precision at 50% IoU (mAP50) of 0.879, outperforming state-of-the-art models. The Tianjin Chest Hospital dataset has a precision of 0.855, a recall of 0.872, and an mAP50 of 0.862.Conclusion: This study presents a two-stage deep learning model that leverages U-Net, Yolov8s, and the Swin transformer for enhanced pulmonary nodule detection in CT images. The model demonstrates high accuracy and a reduced false positive rate, suggesting its potential as a useful tool for early lung cancer diagnosis and treatment.","PeriodicalId":9020,"journal":{"name":"BMC Medical Imaging","volume":"25 1","pages":"247"},"PeriodicalIF":2.9000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12880-025-01784-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Rationale and objectives: Lung cancer remains the leading cause of cancer-related mortality worldwide, emphasizing the critical need for early pulmonary nodule detection to improve patient outcomes. Current methods encounter challenges in detecting small nodules and exhibit high false positive rates, placing an additional diagnostic burden on radiologists. This study aimed to develop a two-stage deep learning model integrating U-Net, Yolov8s, and the Swin transformer to enhance pulmonary nodule detection in computer tomography (CT) images, particularly for small nodules, with the goal of improving detection accuracy and reducing false positives.

Materials and methods: We utilized the LUNA16 dataset (888 CT scans) and an additional 308 CT scans from Tianjin Chest Hospital. Images were preprocessed for consistency. The proposed model first employs U-Net for precise lung segmentation, followed by Yolov8s augmented with the Swin transformer for nodule detection. The Shape-aware IoU (SIoU) loss function was implemented to improve bounding box predictions.

Results: For the LUNA16 dataset, the model achieved a precision of 0.898, a recall of 0.851, and a mean average precision at 50% IoU (mAP50) of 0.879, outperforming state-of-the-art models. The Tianjin Chest Hospital dataset has a precision of 0.855, a recall of 0.872, and an mAP50 of 0.862.

Conclusion: This study presents a two-stage deep learning model that leverages U-Net, Yolov8s, and the Swin transformer for enhanced pulmonary nodule detection in CT images. The model demonstrates high accuracy and a reduced false positive rate, suggesting its potential as a useful tool for early lung cancer diagnosis and treatment.

查看原文本刊更多论文

U-Net、YOLOv8和swin变压器增强肺结节检测。

理由和目的：肺癌仍然是世界范围内癌症相关死亡的主要原因，强调早期肺结节检测以改善患者预后的关键必要性。目前的方法在检测小结节方面遇到了挑战，并表现出高假阳性率，给放射科医生带来了额外的诊断负担。本研究旨在开发一种集成U-Net、Yolov8s和Swin变压器的两阶段深度学习模型，以增强计算机断层扫描（CT）图像中的肺结节检测，特别是对小结节的检测，目的是提高检测准确性并减少假阳性。材料和方法：我们利用LUNA16数据集（888个CT扫描）和天津胸科医院的308个CT扫描。图像经过预处理以保持一致性。该模型首先使用U-Net进行精确的肺分割，然后使用Yolov8s增强Swin变压器进行结节检测。实现了形状感知IoU （SIoU）损失函数以改进边界盒预测。结果：对于LUNA16数据集，该模型的精度为0.898，召回率为0.851,50% IoU （mAP50）的平均精度为0.879，优于最先进的模型。天津胸科医院数据集的准确率为0.855，召回率为0.872，mAP50为0.862。结论：本研究提出了一种两阶段深度学习模型，该模型利用U-Net、Yolov8s和Swin变压器增强CT图像中的肺结节检测。该模型具有较高的准确性和较低的假阳性率，表明其有潜力成为早期肺癌诊断和治疗的有用工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.60

自引率

3.70%

发文量

198

审稿时长

27 weeks

期刊介绍： BMC Medical Imaging is an open access journal publishing original peer-reviewed research articles in the development, evaluation, and use of imaging techniques and image processing tools to diagnose and manage disease.