鉴别乳腺良恶性肿块的机器学习模型:整合自动乳腺体积扫描肿瘤内、肿瘤周围特征和临床信息。

IF 2.9 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES
DIGITAL HEALTH Pub Date : 2025-04-01 eCollection Date: 2025-01-01 DOI:10.1177/20552076251332738
Meixue Dai, Yueqiong Yan, Zhong Li, Jidong Xiao
{"title":"鉴别乳腺良恶性肿块的机器学习模型:整合自动乳腺体积扫描肿瘤内、肿瘤周围特征和临床信息。","authors":"Meixue Dai, Yueqiong Yan, Zhong Li, Jidong Xiao","doi":"10.1177/20552076251332738","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Differentiating between benign and malignant breast masses is critical for clinical decision-making. Automated breast volume scanning (ABVS) provides high-resolution three-dimensional imaging, addressing the limitations of conventional ultrasound. However, the impact of peritumoral region size on predictive performance has not been systematically studied. This study aims to optimize diagnostic performance by integrating radiomics features and clinical data using multiple machine-learning models.</p><p><strong>Methods: </strong>This retrospective study included ABVS images and clinical data from 250 patients with breast masses. Radiomics features were extracted from both intratumoral and peritumoral regions (5, 10, and 20 mm). These features, combined with clinical data, were used to develop models based on four algorithms: Support vector machine, random forest, extreme gradient boosting, and light gradient boosting machine (LGBM). Model performance was evaluated using area under the receiver operating characteristic curve (AUC), calibration curves, and decision curves, with SHapley Additive exPlanations (SHAP) analysis employed for interpretability.</p><p><strong>Results: </strong>The inclusion of peritumoral features improved the diagnostic performance to varying degrees, with the model incorporating a 10 mm peritumoral region achieving the highest overall accuracy. Combining radiomics with clinical features further enhanced predictive performance. The LGBM model outperformed the other algorithms across subgroups, achieving a maximum AUC of 0.909, an accuracy of 0.878, and an F1-score of 0.971. SHAP analysis revealed the contribution of key features, improving model interpretability.</p><p><strong>Conclusion: </strong>This study demonstrates the value of integrating radiomics and clinical features for breast mass diagnosis, with optimized peritumoral regions enhancing model performance. The LGBM model emerged as the preferred algorithm due to its superior performance. These findings provide strong support for the clinical application of ABVS imaging and future multicenter studies, highlighting the importance of microenvironmental features in diagnosis.</p>","PeriodicalId":51333,"journal":{"name":"DIGITAL HEALTH","volume":"11 ","pages":"20552076251332738"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963789/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine-learning models for differentiating benign and malignant breast masses: Integrating automated breast volume scanning intra-tumoral, peri-tumoral features, and clinical information.\",\"authors\":\"Meixue Dai, Yueqiong Yan, Zhong Li, Jidong Xiao\",\"doi\":\"10.1177/20552076251332738\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Differentiating between benign and malignant breast masses is critical for clinical decision-making. Automated breast volume scanning (ABVS) provides high-resolution three-dimensional imaging, addressing the limitations of conventional ultrasound. However, the impact of peritumoral region size on predictive performance has not been systematically studied. This study aims to optimize diagnostic performance by integrating radiomics features and clinical data using multiple machine-learning models.</p><p><strong>Methods: </strong>This retrospective study included ABVS images and clinical data from 250 patients with breast masses. Radiomics features were extracted from both intratumoral and peritumoral regions (5, 10, and 20 mm). These features, combined with clinical data, were used to develop models based on four algorithms: Support vector machine, random forest, extreme gradient boosting, and light gradient boosting machine (LGBM). Model performance was evaluated using area under the receiver operating characteristic curve (AUC), calibration curves, and decision curves, with SHapley Additive exPlanations (SHAP) analysis employed for interpretability.</p><p><strong>Results: </strong>The inclusion of peritumoral features improved the diagnostic performance to varying degrees, with the model incorporating a 10 mm peritumoral region achieving the highest overall accuracy. Combining radiomics with clinical features further enhanced predictive performance. The LGBM model outperformed the other algorithms across subgroups, achieving a maximum AUC of 0.909, an accuracy of 0.878, and an F1-score of 0.971. SHAP analysis revealed the contribution of key features, improving model interpretability.</p><p><strong>Conclusion: </strong>This study demonstrates the value of integrating radiomics and clinical features for breast mass diagnosis, with optimized peritumoral regions enhancing model performance. The LGBM model emerged as the preferred algorithm due to its superior performance. These findings provide strong support for the clinical application of ABVS imaging and future multicenter studies, highlighting the importance of microenvironmental features in diagnosis.</p>\",\"PeriodicalId\":51333,\"journal\":{\"name\":\"DIGITAL HEALTH\",\"volume\":\"11 \",\"pages\":\"20552076251332738\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963789/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DIGITAL HEALTH\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1177/20552076251332738\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DIGITAL HEALTH","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/20552076251332738","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

摘要

背景:鉴别乳腺肿块的良恶性是临床决策的关键。自动乳房体积扫描(ABVS)提供高分辨率的三维成像,解决传统超声的局限性。然而,肿瘤周围区域大小对预测性能的影响尚未得到系统的研究。本研究旨在通过使用多种机器学习模型整合放射组学特征和临床数据来优化诊断性能。方法:回顾性分析250例乳腺肿块患者的ABVS影像及临床资料。从肿瘤内和肿瘤周围区域(5、10和20 mm)提取放射组学特征。结合临床数据,利用这些特征建立基于支持向量机、随机森林、极端梯度增强和光梯度增强机(LGBM)四种算法的模型。使用受试者工作特征曲线(AUC)、校准曲线和决策曲线下的面积来评估模型的性能,并使用SHapley可加解释(SHAP)分析来评估模型的可解释性。结果:肿瘤周围特征的纳入不同程度地提高了诊断性能,其中包含10 mm肿瘤周围区域的模型获得了最高的整体准确性。放射组学与临床特征的结合进一步提高了预测效果。LGBM模型在子组上的表现优于其他算法,最大AUC为0.909,准确率为0.878,f1得分为0.971。SHAP分析揭示了关键特征的贡献,提高了模型的可解释性。结论:本研究证明了结合放射组学和临床特征对乳腺肿块诊断的价值,优化肿瘤周围区域可提高模型的性能。LGBM模型由于其优越的性能而成为首选算法。这些发现为ABVS成像的临床应用和未来的多中心研究提供了强有力的支持,突出了微环境特征在诊断中的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine-learning models for differentiating benign and malignant breast masses: Integrating automated breast volume scanning intra-tumoral, peri-tumoral features, and clinical information.

Background: Differentiating between benign and malignant breast masses is critical for clinical decision-making. Automated breast volume scanning (ABVS) provides high-resolution three-dimensional imaging, addressing the limitations of conventional ultrasound. However, the impact of peritumoral region size on predictive performance has not been systematically studied. This study aims to optimize diagnostic performance by integrating radiomics features and clinical data using multiple machine-learning models.

Methods: This retrospective study included ABVS images and clinical data from 250 patients with breast masses. Radiomics features were extracted from both intratumoral and peritumoral regions (5, 10, and 20 mm). These features, combined with clinical data, were used to develop models based on four algorithms: Support vector machine, random forest, extreme gradient boosting, and light gradient boosting machine (LGBM). Model performance was evaluated using area under the receiver operating characteristic curve (AUC), calibration curves, and decision curves, with SHapley Additive exPlanations (SHAP) analysis employed for interpretability.

Results: The inclusion of peritumoral features improved the diagnostic performance to varying degrees, with the model incorporating a 10 mm peritumoral region achieving the highest overall accuracy. Combining radiomics with clinical features further enhanced predictive performance. The LGBM model outperformed the other algorithms across subgroups, achieving a maximum AUC of 0.909, an accuracy of 0.878, and an F1-score of 0.971. SHAP analysis revealed the contribution of key features, improving model interpretability.

Conclusion: This study demonstrates the value of integrating radiomics and clinical features for breast mass diagnosis, with optimized peritumoral regions enhancing model performance. The LGBM model emerged as the preferred algorithm due to its superior performance. These findings provide strong support for the clinical application of ABVS imaging and future multicenter studies, highlighting the importance of microenvironmental features in diagnosis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
DIGITAL HEALTH
DIGITAL HEALTH Multiple-
CiteScore
2.90
自引率
7.70%
发文量
302
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信