Predicting malignancy in breast lesions: enhancing accuracy with fine-tuned convolutional neural network models.

IF 2.9 3区医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

BMC Medical Imaging Pub Date : 2024-11-11 DOI:10.1186/s12880-024-01484-1

Li Li, Changjie Pan, Ming Zhang, Dong Shen, Guangyuan He, Mingzhu Meng

{"title":"Predicting malignancy in breast lesions: enhancing accuracy with fine-tuned convolutional neural network models.","authors":"Li Li, Changjie Pan, Ming Zhang, Dong Shen, Guangyuan He, Mingzhu Meng","doi":"10.1186/s12880-024-01484-1","DOIUrl":null,"url":null,"abstract":"Background: This study aims to explore the accuracy of Convolutional Neural Network (CNN) models in predicting malignancy in Dynamic Contrast-Enhanced Breast Magnetic Resonance Imaging (DCE-BMRI).Methods: A total of 273 benign lesions (benign group) and 274 malignant lesions (malignant group) were collected and randomly divided into a training set (246 benign and 245 malignant lesions) and a testing set (28 benign and 28 malignant lesions) in a 9:1 ratio. An additional 53 lesions from 53 patients were designated as the validation set. Five models-VGG16, VGG19, DenseNet201, ResNet50, and MobileNetV2-were evaluated. Model performance was assessed using accuracy (Ac) in the training and testing sets, and precision (Pr), recall (Rc), F1 score (F1), and area under the receiver operating characteristic curve (AUC) in the validation set.Results: The accuracy of VGG19 on the test set (0.96) is higher than that of VGG16 (0.91), DenseNet201 (0.91), ResNet50 (0.67), and MobileNetV2 (0.88). For the validation set, VGG19 achieved higher performance metrics (Pr 0.75, Rc 0.76, F1 0.73, AUC 0.76) compared to the other models, specifically VGG16 (Pr 0.73, Rc 0.75, F1 0.70, AUC 0.73), DenseNet201 (Pr 0.71, Rc 0.74, F1 0.69, AUC 0.71), ResNet50 (Pr 0.65, Rc 0.68, F1 0.60, AUC 0.65), and MobileNetV2 (Pr 0.73, Rc 0.75, F1 0.71, AUC 0.73). S4 model achieved higher performance metrics (Pr 0.89, Rc 0.88, F1 0.87, AUC 0.89) compared to the other four fine-tuned models, specifically S1 (Pr 0.75, Rc 0.76, F1 0.74, AUC 0.75), S2 (Pr 0.77, Rc 0.79, F1 0.75, AUC 0.77), S3 (Pr 0.76, Rc 0.76, F1 0.73, AUC 0.75), and S5 (Pr 0.77, Rc 0.79, F1 0.75, AUC 0.77). Additionally, S4 model showed the lowest loss value in the testing set. Notably, the AUC of S4 for BI-RADS 3 was 0.90 and for BI-RADS 4 was 0.86, both significantly higher than the 0.65 AUC for BI-RADS 5.Conclusions: The S4 model we propose has demonstrated superior performance in predicting the likelihood of malignancy in DCE-BMRI, making it a promising candidate for clinical application in patients with breast diseases. However, further validation is essential, highlighting the need for additional data to confirm its efficacy.","PeriodicalId":9020,"journal":{"name":"BMC Medical Imaging","volume":"24 1","pages":"303"},"PeriodicalIF":2.9000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11552211/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12880-024-01484-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Background: This study aims to explore the accuracy of Convolutional Neural Network (CNN) models in predicting malignancy in Dynamic Contrast-Enhanced Breast Magnetic Resonance Imaging (DCE-BMRI).

Methods: A total of 273 benign lesions (benign group) and 274 malignant lesions (malignant group) were collected and randomly divided into a training set (246 benign and 245 malignant lesions) and a testing set (28 benign and 28 malignant lesions) in a 9:1 ratio. An additional 53 lesions from 53 patients were designated as the validation set. Five models-VGG16, VGG19, DenseNet201, ResNet50, and MobileNetV2-were evaluated. Model performance was assessed using accuracy (Ac) in the training and testing sets, and precision (Pr), recall (Rc), F1 score (F1), and area under the receiver operating characteristic curve (AUC) in the validation set.

Results: The accuracy of VGG19 on the test set (0.96) is higher than that of VGG16 (0.91), DenseNet201 (0.91), ResNet50 (0.67), and MobileNetV2 (0.88). For the validation set, VGG19 achieved higher performance metrics (Pr 0.75, Rc 0.76, F1 0.73, AUC 0.76) compared to the other models, specifically VGG16 (Pr 0.73, Rc 0.75, F1 0.70, AUC 0.73), DenseNet201 (Pr 0.71, Rc 0.74, F1 0.69, AUC 0.71), ResNet50 (Pr 0.65, Rc 0.68, F1 0.60, AUC 0.65), and MobileNetV2 (Pr 0.73, Rc 0.75, F1 0.71, AUC 0.73). S4 model achieved higher performance metrics (Pr 0.89, Rc 0.88, F1 0.87, AUC 0.89) compared to the other four fine-tuned models, specifically S1 (Pr 0.75, Rc 0.76, F1 0.74, AUC 0.75), S2 (Pr 0.77, Rc 0.79, F1 0.75, AUC 0.77), S3 (Pr 0.76, Rc 0.76, F1 0.73, AUC 0.75), and S5 (Pr 0.77, Rc 0.79, F1 0.75, AUC 0.77). Additionally, S4 model showed the lowest loss value in the testing set. Notably, the AUC of S4 for BI-RADS 3 was 0.90 and for BI-RADS 4 was 0.86, both significantly higher than the 0.65 AUC for BI-RADS 5.

Conclusions: The S4 model we propose has demonstrated superior performance in predicting the likelihood of malignancy in DCE-BMRI, making it a promising candidate for clinical application in patients with breast diseases. However, further validation is essential, highlighting the need for additional data to confirm its efficacy.

查看原文本刊更多论文

预测乳腺病变的恶性程度：利用微调卷积神经网络模型提高准确性。

研究背景本研究旨在探讨卷积神经网络（CNN）模型在动态对比增强乳腺磁共振成像（DCE-BMRI）中预测恶性病变的准确性：共收集了 273 个良性病灶（良性组）和 274 个恶性病灶（恶性组），并按 9:1 的比例随机分为训练集（246 个良性病灶和 245 个恶性病灶）和测试集（28 个良性病灶和 28 个恶性病灶）。另外 53 名患者的 53 个病灶被指定为验证集。评估了五个模型-VGG16、VGG19、DenseNet201、ResNet50 和 MobileNetV2。使用训练集和测试集中的准确度（Ac）以及验证集中的精确度（Pr）、召回率（Rc）、F1 分数（F1）和接收器工作特征曲线下面积（AUC）评估模型性能：VGG19 在测试集上的精确度（0.96）高于 VGG16（0.91）、DenseNet201（0.91）、ResNet50（0.67）和 MobileNetV2（0.88）。在验证集上，VGG19 的性能指标（Pr 0.75、Rc 0.76、F1 0.73、AUC 0.76）高于其他模型，特别是 VGG16（Pr 0.73、Rc 0.75，F1 0.70，AUC 0.73）、DenseNet201（Pr 0.71，Rc 0.74，F1 0.69，AUC 0.71）、ResNet50（Pr 0.65，Rc 0.68，F1 0.60，AUC 0.65）和 MobileNetV2（Pr 0.73，Rc 0.75，F1 0.71，AUC 0.73）。与其他四个微调模型相比，S4 模型获得了更高的性能指标（Pr 0.89，Rc 0.88，F1 0.87，AUC 0.89），特别是 S1（Pr 0.75，Rc 0.76，F1 0.74，AUC 0.75）、S2（Pr 0.77，Rc 0.79，F1 0.75，AUC 0.77）、S3（Pr 0.76，Rc 0.76，F1 0.73，AUC 0.75）和 S5（Pr 0.77，Rc 0.79，F1 0.75，AUC 0.77）。此外，S4 模型在测试集中的损失值最低。值得注意的是，S4 对 BI-RADS 3 的 AUC 为 0.90，对 BI-RADS 4 的 AUC 为 0.86，均显著高于对 BI-RADS 5 的 0.65 AUC：我们提出的 S4 模型在预测 DCE-BMRI 中恶性肿瘤的可能性方面表现出色，有望在乳腺疾病患者中得到临床应用。然而，进一步的验证是必不可少的，因此需要更多的数据来证实其有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.60

自引率

3.70%

发文量

198

审稿时长

27 weeks

期刊介绍： BMC Medical Imaging is an open access journal publishing original peer-reviewed research articles in the development, evaluation, and use of imaging techniques and image processing tools to diagnose and manage disease.