{"title":"Self-Supervised Learning Method for Breast Cancer Detection with Image Feature Set and Modified U-Net Segmentation Using Whole Slide Image.","authors":"Sangishetti Karunakar, Praveen Pappula","doi":"10.1080/07357907.2025.2562535","DOIUrl":null,"url":null,"abstract":"<p><p>Breast cancer (BC) is the second most prevalent cause of death for women and the most frequently diagnosed malignancy. Early identification of this deadly illness lowers treatment costs while significantly improving survival rates. In contrast, skilled radiologists and pathologists analyze radiographic and histopathological images, respectively. In addition to being expensive, the procedure is prone to errors. The paper offers a solution to these challenges by presenting an innovative approach that combines a Modified U-Net architecture with sophisticated self-supervised learning methods to the accuracy and efficiency of breast cancer detection in WSIs. The proposed model improves the accuracy of tumor detection by integrating a multi-stage process: starting with Gaussian filtering for image preprocessing to remove noise, followed by the Modified U-Net for precise tumor segmentation including multi-scale processing and attention mechanisms. Feature extraction is achieved through the Bag of Visual Words (BoW), Improved Local Gradient and Intensity Pattern (LGIP), and Pyramidal Histogram of Oriented Gradients (PHOG) techniques to capture diverse image characteristics. The classification phase employs an Improved Self-Supervised Learning (ISSL) method, which improves feature representation via a novel loss function and an improved Multiple Instance Pooling (IMIP) mechanism. This method is designed to overcome the limitations of conventional techniques by offering clearer tumor boundaries and more accurate classifications, thereby improving the overall reliability and efficacy of breast cancer detection in clinical practice. Moreover, the ISSL strategy yielded the highest performance metrics, including an accuracy of 0.924, a sensitivity of 0.886, and a negative predictive value (NPV) of 0.943.</p>","PeriodicalId":9463,"journal":{"name":"Cancer Investigation","volume":" ","pages":"1-22"},"PeriodicalIF":1.9000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/07357907.2025.2562535","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Breast cancer (BC) is the second most prevalent cause of death for women and the most frequently diagnosed malignancy. Early identification of this deadly illness lowers treatment costs while significantly improving survival rates. In contrast, skilled radiologists and pathologists analyze radiographic and histopathological images, respectively. In addition to being expensive, the procedure is prone to errors. The paper offers a solution to these challenges by presenting an innovative approach that combines a Modified U-Net architecture with sophisticated self-supervised learning methods to the accuracy and efficiency of breast cancer detection in WSIs. The proposed model improves the accuracy of tumor detection by integrating a multi-stage process: starting with Gaussian filtering for image preprocessing to remove noise, followed by the Modified U-Net for precise tumor segmentation including multi-scale processing and attention mechanisms. Feature extraction is achieved through the Bag of Visual Words (BoW), Improved Local Gradient and Intensity Pattern (LGIP), and Pyramidal Histogram of Oriented Gradients (PHOG) techniques to capture diverse image characteristics. The classification phase employs an Improved Self-Supervised Learning (ISSL) method, which improves feature representation via a novel loss function and an improved Multiple Instance Pooling (IMIP) mechanism. This method is designed to overcome the limitations of conventional techniques by offering clearer tumor boundaries and more accurate classifications, thereby improving the overall reliability and efficacy of breast cancer detection in clinical practice. Moreover, the ISSL strategy yielded the highest performance metrics, including an accuracy of 0.924, a sensitivity of 0.886, and a negative predictive value (NPV) of 0.943.
期刊介绍:
Cancer Investigation is one of the most highly regarded and recognized journals in the field of basic and clinical oncology. It is designed to give physicians a comprehensive resource on the current state of progress in the cancer field as well as a broad background of reliable information necessary for effective decision making. In addition to presenting original papers of fundamental significance, it also publishes reviews, essays, specialized presentations of controversies, considerations of new technologies and their applications to specific laboratory problems, discussions of public issues, miniseries on major topics, new and experimental drugs and therapies, and an innovative letters to the editor section. One of the unique features of the journal is its departmentalized editorial sections reporting on more than 30 subject categories covering the broad spectrum of specialized areas that together comprise the field of oncology. Edited by leading physicians and research scientists, these sections make Cancer Investigation the prime resource for clinicians seeking to make sense of the sometimes-overwhelming amount of information available throughout the field. In addition to its peer-reviewed clinical research, the journal also features translational studies that bridge the gap between the laboratory and the clinic.