Maximizing AUC with Deep Learning for Classification of Imbalanced Mammogram Datasets

Eurographics Workshop on Visual Computing for Biomedicine Pub Date : 2017-01-01 DOI:10.2312/vcbm.20171246

Jeremias Sulam, Rami Ben-Ari, P. Kisilev

引用次数: 18

Abstract

Breast cancer is the second most common cause of death in women. Computer-aided diagnosis typically demand for carefully annotated data, precise tumor allocation and delineation of the boundaries, which is rarely available in the medical system. In this paper we present a new deep learning approach for classification of mammograms that requires only a global binary label. Traditional deep learning methods typically employ classification error losses, which are highly biased by class imbalance – a situation that naturally arises in medical classification problems. We hereby suggest a novel loss measure that directly maximizes the Area Under the ROC Curve (AUC), providing an unbiased loss. We validate the proposed model on two mammogram datasets: IMG, comprising of 796 patients, 80 positive (164 images) and 716 negative (1869 images), and the publicly available dataset INbreast. Our results are encouraging, as the proposed scheme achieves an AUC of 0.76 and 0.65 for IMG and INbreast,

查看原文本刊更多论文

利用深度学习最大化不平衡乳房x线照片数据集分类的AUC

乳腺癌是导致妇女死亡的第二大常见原因。计算机辅助诊断通常需要仔细注释的数据，精确的肿瘤分配和边界划定，这些在医疗系统中很少可用。在本文中，我们提出了一种新的乳房x线照片分类的深度学习方法，它只需要一个全局二值标签。传统的深度学习方法通常采用分类误差损失，这种方法由于类别不平衡而高度偏倚，这是医学分类问题中自然出现的一种情况。我们在此提出一种新的损失测量方法，可以直接最大化ROC曲线下的面积(AUC)，从而提供无偏损失。我们在两个乳房x线照片数据集上验证了所提出的模型:IMG，包括796名患者，80例阳性(164张)和716例阴性(1869张)，以及公开可用的数据集INbreast。我们的结果是令人鼓舞的，因为所提出的方案在IMG和INbreast上实现了0.76和0.65的AUC。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Eurographics Workshop on Visual Computing for Biomedicine

自引率

0.00%

发文量