ndcnn:一种基于改进YOLOv7的肺结节检测模型

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL
Lu Meng , Lijun Zhou
{"title":"ndcnn:一种基于改进YOLOv7的肺结节检测模型","authors":"Lu Meng ,&nbsp;Lijun Zhou","doi":"10.1016/j.bspc.2025.108754","DOIUrl":null,"url":null,"abstract":"<div><div>The automatic detection of lung nodules is crucial for cancer screening. However, lung nodules typically occupy small areas in CT images with indistinct features, posing significant detection challenges. To address this, YOLOv7 is optimized and a lung nodule detection model is proposed. Firstly, considering the problem of feature loss caused by the PAFPN (Path Aggregation Feature Pyramid Network), an Enhanced Feature Pyramid Network (E_FPN) is introduced, which employs dilated convolutions in series to adjust the receptive field size and supplement contextual information. The dilation rate is designed to fully utilize fine-grained features. Then, an adaptive fusion strategy is proposed, enabling each detection head to adaptively integrate information from other detection heads, thereby achieving complementary utilization of detection head information. Finally, considering that the IoU (Intersection over Union) metric is very sensitive to the position change of small objects, the Gaussian Wasserstein distance metric is adopted to optimize the loss function and enhance target localization accuracy. Experiments are primarily conducted on the LUNA16 dataset. The optimized model, LNDCNN (Lung Nodule Detection Convolutional Neural Network), achieves an improvement of 11.42% in mAP<sub>50</sub>, 6.17% in mAP<span><math><msub><mrow></mrow><mrow><mn>50</mn><mo>:</mo><mn>90</mn></mrow></msub></math></span>, and a 14.4% increase in Recall compared to YOLOv7. At the same time, we validate the algorithm’s generalization ability on the ALIBABA TianChi dataset. LNDCNN demonstrates consistent improvements with 5.68% higher mAP<sub>50</sub>, 2.92% higher mAP<span><math><msub><mrow></mrow><mrow><mn>50</mn><mo>:</mo><mn>90</mn></mrow></msub></math></span>, and 8.61% higher Recall compared to YOLOv7, demonstrating promising generalization capability. The implementation of LNDCNN is publicly available at <span><span>https://github.com/zlj-zlj/LNDCNN</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108754"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LNDCNN: A lung nodule detection model based on improved YOLOv7\",\"authors\":\"Lu Meng ,&nbsp;Lijun Zhou\",\"doi\":\"10.1016/j.bspc.2025.108754\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The automatic detection of lung nodules is crucial for cancer screening. However, lung nodules typically occupy small areas in CT images with indistinct features, posing significant detection challenges. To address this, YOLOv7 is optimized and a lung nodule detection model is proposed. Firstly, considering the problem of feature loss caused by the PAFPN (Path Aggregation Feature Pyramid Network), an Enhanced Feature Pyramid Network (E_FPN) is introduced, which employs dilated convolutions in series to adjust the receptive field size and supplement contextual information. The dilation rate is designed to fully utilize fine-grained features. Then, an adaptive fusion strategy is proposed, enabling each detection head to adaptively integrate information from other detection heads, thereby achieving complementary utilization of detection head information. Finally, considering that the IoU (Intersection over Union) metric is very sensitive to the position change of small objects, the Gaussian Wasserstein distance metric is adopted to optimize the loss function and enhance target localization accuracy. Experiments are primarily conducted on the LUNA16 dataset. The optimized model, LNDCNN (Lung Nodule Detection Convolutional Neural Network), achieves an improvement of 11.42% in mAP<sub>50</sub>, 6.17% in mAP<span><math><msub><mrow></mrow><mrow><mn>50</mn><mo>:</mo><mn>90</mn></mrow></msub></math></span>, and a 14.4% increase in Recall compared to YOLOv7. At the same time, we validate the algorithm’s generalization ability on the ALIBABA TianChi dataset. LNDCNN demonstrates consistent improvements with 5.68% higher mAP<sub>50</sub>, 2.92% higher mAP<span><math><msub><mrow></mrow><mrow><mn>50</mn><mo>:</mo><mn>90</mn></mrow></msub></math></span>, and 8.61% higher Recall compared to YOLOv7, demonstrating promising generalization capability. The implementation of LNDCNN is publicly available at <span><span>https://github.com/zlj-zlj/LNDCNN</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"112 \",\"pages\":\"Article 108754\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425012650\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425012650","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

肺结节的自动检测对癌症筛查至关重要。然而,肺结节通常在CT图像中占据很小的区域,特征不明显,给检测带来了很大的挑战。针对这一问题,对YOLOv7进行了优化,提出了肺结节检测模型。首先,针对路径聚合特征金字塔网络(Path Aggregation feature Pyramid Network, PAFPN)造成的特征丢失问题,提出了一种增强特征金字塔网络(E_FPN),该网络采用扩展卷积序列来调整接收野大小并补充上下文信息。膨胀率的设计充分利用了细粒度的特点。然后,提出了一种自适应融合策略,使每个检测头能够自适应地整合来自其他检测头的信息,从而实现检测头信息的互补利用。最后,考虑到IoU (Intersection over Union)度量对小目标的位置变化非常敏感,采用高斯Wasserstein距离度量优化损失函数,提高目标定位精度。实验主要在LUNA16数据集上进行。优化后的模型LNDCNN (Lung Nodule Detection Convolutional Neural Network,肺结节检测卷积神经网络)在mAP50和mAP50:90上分别提高了11.42%和6.17%,召回率比YOLOv7提高了14.4%。同时,在阿里巴巴天池数据集上验证了算法的泛化能力。与YOLOv7相比,LNDCNN的mAP50提高了5.68%,mAP50:90提高了2.92%,召回率提高了8.61%,显示出良好的泛化能力。LNDCNN的实现可以在https://github.com/zlj-zlj/LNDCNN上公开获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
LNDCNN: A lung nodule detection model based on improved YOLOv7
The automatic detection of lung nodules is crucial for cancer screening. However, lung nodules typically occupy small areas in CT images with indistinct features, posing significant detection challenges. To address this, YOLOv7 is optimized and a lung nodule detection model is proposed. Firstly, considering the problem of feature loss caused by the PAFPN (Path Aggregation Feature Pyramid Network), an Enhanced Feature Pyramid Network (E_FPN) is introduced, which employs dilated convolutions in series to adjust the receptive field size and supplement contextual information. The dilation rate is designed to fully utilize fine-grained features. Then, an adaptive fusion strategy is proposed, enabling each detection head to adaptively integrate information from other detection heads, thereby achieving complementary utilization of detection head information. Finally, considering that the IoU (Intersection over Union) metric is very sensitive to the position change of small objects, the Gaussian Wasserstein distance metric is adopted to optimize the loss function and enhance target localization accuracy. Experiments are primarily conducted on the LUNA16 dataset. The optimized model, LNDCNN (Lung Nodule Detection Convolutional Neural Network), achieves an improvement of 11.42% in mAP50, 6.17% in mAP50:90, and a 14.4% increase in Recall compared to YOLOv7. At the same time, we validate the algorithm’s generalization ability on the ALIBABA TianChi dataset. LNDCNN demonstrates consistent improvements with 5.68% higher mAP50, 2.92% higher mAP50:90, and 8.61% higher Recall compared to YOLOv7, demonstrating promising generalization capability. The implementation of LNDCNN is publicly available at https://github.com/zlj-zlj/LNDCNN.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信