ndcnn：一种基于改进YOLOv7的肺结节检测模型

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Biomedical Signal Processing and Control Pub Date : 2025-10-08 DOI:10.1016/j.bspc.2025.108754

Lu Meng , Lijun Zhou

{"title":"ndcnn：一种基于改进YOLOv7的肺结节检测模型","authors":"Lu Meng , Lijun Zhou","doi":"10.1016/j.bspc.2025.108754","DOIUrl":null,"url":null,"abstract":"<div><div>The automatic detection of lung nodules is crucial for cancer screening. However, lung nodules typically occupy small areas in CT images with indistinct features, posing significant detection challenges. To address this, YOLOv7 is optimized and a lung nodule detection model is proposed. Firstly, considering the problem of feature loss caused by the PAFPN (Path Aggregation Feature Pyramid Network), an Enhanced Feature Pyramid Network (E_FPN) is introduced, which employs dilated convolutions in series to adjust the receptive field size and supplement contextual information. The dilation rate is designed to fully utilize fine-grained features. Then, an adaptive fusion strategy is proposed, enabling each detection head to adaptively integrate information from other detection heads, thereby achieving complementary utilization of detection head information. Finally, considering that the IoU (Intersection over Union) metric is very sensitive to the position change of small objects, the Gaussian Wasserstein distance metric is adopted to optimize the loss function and enhance target localization accuracy. Experiments are primarily conducted on the LUNA16 dataset. The optimized model, LNDCNN (Lung Nodule Detection Convolutional Neural Network), achieves an improvement of 11.42% in mAP<sub>50</sub>, 6.17% in mAP<span><math><msub><mrow></mrow><mrow><mn>50</mn><mo>:</mo><mn>90</mn></mrow></msub></math></span>, and a 14.4% increase in Recall compared to YOLOv7. At the same time, we validate the algorithm’s generalization ability on the ALIBABA TianChi dataset. LNDCNN demonstrates consistent improvements with 5.68% higher mAP<sub>50</sub>, 2.92% higher mAP<span><math><msub><mrow></mrow><mrow><mn>50</mn><mo>:</mo><mn>90</mn></mrow></msub></math></span>, and 8.61% higher Recall compared to YOLOv7, demonstrating promising generalization capability. The implementation of LNDCNN is publicly available at <span><span>https://github.com/zlj-zlj/LNDCNN</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108754"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LNDCNN: A lung nodule detection model based on improved YOLOv7\",\"authors\":\"Lu Meng , Lijun Zhou\",\"doi\":\"10.1016/j.bspc.2025.108754\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The automatic detection of lung nodules is crucial for cancer screening. However, lung nodules typically occupy small areas in CT images with indistinct features, posing significant detection challenges. To address this, YOLOv7 is optimized and a lung nodule detection model is proposed. Firstly, considering the problem of feature loss caused by the PAFPN (Path Aggregation Feature Pyramid Network), an Enhanced Feature Pyramid Network (E_FPN) is introduced, which employs dilated convolutions in series to adjust the receptive field size and supplement contextual information. The dilation rate is designed to fully utilize fine-grained features. Then, an adaptive fusion strategy is proposed, enabling each detection head to adaptively integrate information from other detection heads, thereby achieving complementary utilization of detection head information. Finally, considering that the IoU (Intersection over Union) metric is very sensitive to the position change of small objects, the Gaussian Wasserstein distance metric is adopted to optimize the loss function and enhance target localization accuracy. Experiments are primarily conducted on the LUNA16 dataset. The optimized model, LNDCNN (Lung Nodule Detection Convolutional Neural Network), achieves an improvement of 11.42% in mAP<sub>50</sub>, 6.17% in mAP<span><math><msub><mrow></mrow><mrow><mn>50</mn><mo>:</mo><mn>90</mn></mrow></msub></math></span>, and a 14.4% increase in Recall compared to YOLOv7. At the same time, we validate the algorithm’s generalization ability on the ALIBABA TianChi dataset. LNDCNN demonstrates consistent improvements with 5.68% higher mAP<sub>50</sub>, 2.92% higher mAP<span><math><msub><mrow></mrow><mrow><mn>50</mn><mo>:</mo><mn>90</mn></mrow></msub></math></span>, and 8.61% higher Recall compared to YOLOv7, demonstrating promising generalization capability. The implementation of LNDCNN is publicly available at <span><span>https://github.com/zlj-zlj/LNDCNN</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"112 \",\"pages\":\"Article 108754\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425012650\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425012650","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

摘要

肺结节的自动检测对癌症筛查至关重要。然而，肺结节通常在CT图像中占据很小的区域，特征不明显，给检测带来了很大的挑战。针对这一问题，对YOLOv7进行了优化，提出了肺结节检测模型。首先，针对路径聚合特征金字塔网络（Path Aggregation feature Pyramid Network， PAFPN）造成的特征丢失问题，提出了一种增强特征金字塔网络（E_FPN），该网络采用扩展卷积序列来调整接收野大小并补充上下文信息。膨胀率的设计充分利用了细粒度的特点。然后，提出了一种自适应融合策略，使每个检测头能够自适应地整合来自其他检测头的信息，从而实现检测头信息的互补利用。最后，考虑到IoU （Intersection over Union）度量对小目标的位置变化非常敏感，采用高斯Wasserstein距离度量优化损失函数，提高目标定位精度。实验主要在LUNA16数据集上进行。优化后的模型LNDCNN （Lung Nodule Detection Convolutional Neural Network，肺结节检测卷积神经网络）在mAP50和mAP50:90上分别提高了11.42%和6.17%，召回率比YOLOv7提高了14.4%。同时，在阿里巴巴天池数据集上验证了算法的泛化能力。与YOLOv7相比，LNDCNN的mAP50提高了5.68%，mAP50:90提高了2.92%，召回率提高了8.61%，显示出良好的泛化能力。LNDCNN的实现可以在https://github.com/zlj-zlj/LNDCNN上公开获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

LNDCNN: A lung nodule detection model based on improved YOLOv7

The automatic detection of lung nodules is crucial for cancer screening. However, lung nodules typically occupy small areas in CT images with indistinct features, posing significant detection challenges. To address this, YOLOv7 is optimized and a lung nodule detection model is proposed. Firstly, considering the problem of feature loss caused by the PAFPN (Path Aggregation Feature Pyramid Network), an Enhanced Feature Pyramid Network (E_FPN) is introduced, which employs dilated convolutions in series to adjust the receptive field size and supplement contextual information. The dilation rate is designed to fully utilize fine-grained features. Then, an adaptive fusion strategy is proposed, enabling each detection head to adaptively integrate information from other detection heads, thereby achieving complementary utilization of detection head information. Finally, considering that the IoU (Intersection over Union) metric is very sensitive to the position change of small objects, the Gaussian Wasserstein distance metric is adopted to optimize the loss function and enhance target localization accuracy. Experiments are primarily conducted on the LUNA16 dataset. The optimized model, LNDCNN (Lung Nodule Detection Convolutional Neural Network), achieves an improvement of 11.42% in mAP₅₀, 6.17% in mAP

_{50 : 90}

, and a 14.4% increase in Recall compared to YOLOv7. At the same time, we validate the algorithm’s generalization ability on the ALIBABA TianChi dataset. LNDCNN demonstrates consistent improvements with 5.68% higher mAP₅₀, 2.92% higher mAP

_{50 : 90}

, and 8.61% higher Recall compared to YOLOv7, demonstrating promising generalization capability. The implementation of LNDCNN is publicly available at https://github.com/zlj-zlj/LNDCNN.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Biomedical Signal Processing and Control 工程技术-工程：生物医学

CiteScore

9.80

自引率

13.70%

发文量

822

审稿时长

4 months

期刊介绍： Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.