Cristiana Moroz-Dubenco, Adél Bajcsi, Anca Andreica, Camelia Chira
{"title":"Towards an interpretable breast cancer detection and diagnosis system.","authors":"Cristiana Moroz-Dubenco, Adél Bajcsi, Anca Andreica, Camelia Chira","doi":"10.1016/j.compbiomed.2024.109520","DOIUrl":null,"url":null,"abstract":"<p><p>According to the World Health Organization, breast cancer becomes fatal only if it spreads throughout the body. Therefore, regular screening is essential. Whilst mammography is the most frequently used technique, its interpretation can be challenging and time-consuming. For this reason, computer-aided detection and diagnosis systems are increasingly being used for second opinion. However, in order for doctors to trust such systems, they need to understand their decisions. We propose an automated and interpretable system for the detection and diagnosis of breast cancer, encompassing five steps. After a robust pre-processing and an unsupervised segmentation, we analyze four feature extraction techniques, both textural and shape-based, and three methods for feature selection. To facilitate interpretation, we employ the Decision Tree algorithm for benign/malignant classification and experiment with different methods to avoid overfitting: pre-pruning, post-pruning, and ensemble-based (Random Forest classifier). Our system reaches a maximum accuracy of 95% and 100% precision and specificity when tested on images from the mini-MIAS dataset, while also offering its users the possibility to analyze each of the steps.</p>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"185 ","pages":"109520"},"PeriodicalIF":7.0000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.compbiomed.2024.109520","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
According to the World Health Organization, breast cancer becomes fatal only if it spreads throughout the body. Therefore, regular screening is essential. Whilst mammography is the most frequently used technique, its interpretation can be challenging and time-consuming. For this reason, computer-aided detection and diagnosis systems are increasingly being used for second opinion. However, in order for doctors to trust such systems, they need to understand their decisions. We propose an automated and interpretable system for the detection and diagnosis of breast cancer, encompassing five steps. After a robust pre-processing and an unsupervised segmentation, we analyze four feature extraction techniques, both textural and shape-based, and three methods for feature selection. To facilitate interpretation, we employ the Decision Tree algorithm for benign/malignant classification and experiment with different methods to avoid overfitting: pre-pruning, post-pruning, and ensemble-based (Random Forest classifier). Our system reaches a maximum accuracy of 95% and 100% precision and specificity when tested on images from the mini-MIAS dataset, while also offering its users the possibility to analyze each of the steps.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.