Robert B. Eshun , Marwan Bikdash , A.K.M. Kamrul Islam
{"title":"A deep convolutional neural network for the classification of imbalanced breast cancer dataset","authors":"Robert B. Eshun , Marwan Bikdash , A.K.M. Kamrul Islam","doi":"10.1016/j.health.2024.100330","DOIUrl":null,"url":null,"abstract":"<div><p>The primary procedures for breast cancer diagnosis involve the assessment of histopathological slide images by skilled patholo-gists. This procedure is prone to human subjectivity and can lead to diagnostic errors with adverse implications for patient health and welfare. Artificial intelligence-based models have yielded promising results in other medical tasks and offer tools for potentially addressing the shortcomings of traditional medical image analysis. The BreakHis breast cancer dataset suffers from insufficient data for the minority class with an imbalance ratio >0.40, which poses challenges for deep learning models. To avoid performance degradation, researchers have explored a variety of data augmentation schemes to generate adequate samples for analysis. This study designed a Deep Convolutional Neural Network (DCGAN) with specific generator and discriminator architectures to mitigate model instability and generate high-quality synthetic data for the minority class. The balanced dataset was passed to the fine-tuned ResNet50 model for breast tumor detection. The study produced high accuracy in diagnosing benign/malignancy at 40X magnification, outperforming the state-of-art. The results demonstrated that deep learning methods can potentially to support effective screening in clinical practice.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100330"},"PeriodicalIF":0.0000,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000327/pdfft?md5=9d04a7f6f58d049abde8b5a3fdbb0a8b&pid=1-s2.0-S2772442524000327-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare analytics (New York, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772442524000327","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The primary procedures for breast cancer diagnosis involve the assessment of histopathological slide images by skilled patholo-gists. This procedure is prone to human subjectivity and can lead to diagnostic errors with adverse implications for patient health and welfare. Artificial intelligence-based models have yielded promising results in other medical tasks and offer tools for potentially addressing the shortcomings of traditional medical image analysis. The BreakHis breast cancer dataset suffers from insufficient data for the minority class with an imbalance ratio >0.40, which poses challenges for deep learning models. To avoid performance degradation, researchers have explored a variety of data augmentation schemes to generate adequate samples for analysis. This study designed a Deep Convolutional Neural Network (DCGAN) with specific generator and discriminator architectures to mitigate model instability and generate high-quality synthetic data for the minority class. The balanced dataset was passed to the fine-tuned ResNet50 model for breast tumor detection. The study produced high accuracy in diagnosing benign/malignancy at 40X magnification, outperforming the state-of-art. The results demonstrated that deep learning methods can potentially to support effective screening in clinical practice.