Machine-learning models for differentiating benign and malignant breast masses: Integrating automated breast volume scanning intra-tumoral, peri-tumoral features, and clinical information.
{"title":"Machine-learning models for differentiating benign and malignant breast masses: Integrating automated breast volume scanning intra-tumoral, peri-tumoral features, and clinical information.","authors":"Meixue Dai, Yueqiong Yan, Zhong Li, Jidong Xiao","doi":"10.1177/20552076251332738","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Differentiating between benign and malignant breast masses is critical for clinical decision-making. Automated breast volume scanning (ABVS) provides high-resolution three-dimensional imaging, addressing the limitations of conventional ultrasound. However, the impact of peritumoral region size on predictive performance has not been systematically studied. This study aims to optimize diagnostic performance by integrating radiomics features and clinical data using multiple machine-learning models.</p><p><strong>Methods: </strong>This retrospective study included ABVS images and clinical data from 250 patients with breast masses. Radiomics features were extracted from both intratumoral and peritumoral regions (5, 10, and 20 mm). These features, combined with clinical data, were used to develop models based on four algorithms: Support vector machine, random forest, extreme gradient boosting, and light gradient boosting machine (LGBM). Model performance was evaluated using area under the receiver operating characteristic curve (AUC), calibration curves, and decision curves, with SHapley Additive exPlanations (SHAP) analysis employed for interpretability.</p><p><strong>Results: </strong>The inclusion of peritumoral features improved the diagnostic performance to varying degrees, with the model incorporating a 10 mm peritumoral region achieving the highest overall accuracy. Combining radiomics with clinical features further enhanced predictive performance. The LGBM model outperformed the other algorithms across subgroups, achieving a maximum AUC of 0.909, an accuracy of 0.878, and an F1-score of 0.971. SHAP analysis revealed the contribution of key features, improving model interpretability.</p><p><strong>Conclusion: </strong>This study demonstrates the value of integrating radiomics and clinical features for breast mass diagnosis, with optimized peritumoral regions enhancing model performance. The LGBM model emerged as the preferred algorithm due to its superior performance. These findings provide strong support for the clinical application of ABVS imaging and future multicenter studies, highlighting the importance of microenvironmental features in diagnosis.</p>","PeriodicalId":51333,"journal":{"name":"DIGITAL HEALTH","volume":"11 ","pages":"20552076251332738"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963789/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DIGITAL HEALTH","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/20552076251332738","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Differentiating between benign and malignant breast masses is critical for clinical decision-making. Automated breast volume scanning (ABVS) provides high-resolution three-dimensional imaging, addressing the limitations of conventional ultrasound. However, the impact of peritumoral region size on predictive performance has not been systematically studied. This study aims to optimize diagnostic performance by integrating radiomics features and clinical data using multiple machine-learning models.
Methods: This retrospective study included ABVS images and clinical data from 250 patients with breast masses. Radiomics features were extracted from both intratumoral and peritumoral regions (5, 10, and 20 mm). These features, combined with clinical data, were used to develop models based on four algorithms: Support vector machine, random forest, extreme gradient boosting, and light gradient boosting machine (LGBM). Model performance was evaluated using area under the receiver operating characteristic curve (AUC), calibration curves, and decision curves, with SHapley Additive exPlanations (SHAP) analysis employed for interpretability.
Results: The inclusion of peritumoral features improved the diagnostic performance to varying degrees, with the model incorporating a 10 mm peritumoral region achieving the highest overall accuracy. Combining radiomics with clinical features further enhanced predictive performance. The LGBM model outperformed the other algorithms across subgroups, achieving a maximum AUC of 0.909, an accuracy of 0.878, and an F1-score of 0.971. SHAP analysis revealed the contribution of key features, improving model interpretability.
Conclusion: This study demonstrates the value of integrating radiomics and clinical features for breast mass diagnosis, with optimized peritumoral regions enhancing model performance. The LGBM model emerged as the preferred algorithm due to its superior performance. These findings provide strong support for the clinical application of ABVS imaging and future multicenter studies, highlighting the importance of microenvironmental features in diagnosis.