利用多任务对抗cnn学习肿瘤可解释的微观特征以提高泛化

The journal of machine learning for biomedical imaging Pub Date : 2020-08-04 DOI:10.59275/j.melba.2023-3462

Mara Graziani, Sebastian Otálora, S. Marchand-Maillet, H. Muller, V. Andrearczyk

{"title":"利用多任务对抗cnn学习肿瘤可解释的微观特征以提高泛化","authors":"Mara Graziani, Sebastian Otálora, S. Marchand-Maillet, H. Muller, V. Andrearczyk","doi":"10.59275/j.melba.2023-3462","DOIUrl":null,"url":null,"abstract":"Adopting Convolutional Neural Networks (CNNs) in the daily routine of primary diagnosis requires not only near-perfect precision, but also a sufficient degree of generalization to data acquisition shifts and transparency. Existing CNN models act as black boxes, not ensuring to the physicians that important diagnostic features are used by the model. Building on top of successfully existing techniques such as multi-task learning, domain adversarial training and concept-based interpretability, this paper addresses the challenge of introducing diagnostic factors in the training objectives. Here we show that our architecture, by learning end-to-end an uncertainty-based weighting combination of multi-task and adversarial losses, is encouraged to focus on pathology features such as density and pleomorphism of nuclei, e.g. variations in size and appearance, while discarding misleading features such as staining differences. Our results on breast lymph node tissue show significantly improved generalization in the detection of tumorous tissue, with best average AUC 0.89 (0.01) against the baseline AUC 0.86 (0.005). By applying the interpretability technique of linearly probing intermediate representations, we also demonstrate that interpretable pathology features such as nuclei density are learned by the proposed CNN architecture, confirming the increased transparency of this model. This result is a starting point towards building interpretable multi-task architectures that are robust to data heterogeneity. Our code is available at https://github.com/maragraziani/multitask_adversarial","PeriodicalId":75083,"journal":{"name":"The journal of machine learning for biomedical imaging","volume":"4 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning Interpretable Microscopic Features of Tumor by Multi-task Adversarial CNNs To Improve Generalization\",\"authors\":\"Mara Graziani, Sebastian Otálora, S. Marchand-Maillet, H. Muller, V. Andrearczyk\",\"doi\":\"10.59275/j.melba.2023-3462\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Adopting Convolutional Neural Networks (CNNs) in the daily routine of primary diagnosis requires not only near-perfect precision, but also a sufficient degree of generalization to data acquisition shifts and transparency. Existing CNN models act as black boxes, not ensuring to the physicians that important diagnostic features are used by the model. Building on top of successfully existing techniques such as multi-task learning, domain adversarial training and concept-based interpretability, this paper addresses the challenge of introducing diagnostic factors in the training objectives. Here we show that our architecture, by learning end-to-end an uncertainty-based weighting combination of multi-task and adversarial losses, is encouraged to focus on pathology features such as density and pleomorphism of nuclei, e.g. variations in size and appearance, while discarding misleading features such as staining differences. Our results on breast lymph node tissue show significantly improved generalization in the detection of tumorous tissue, with best average AUC 0.89 (0.01) against the baseline AUC 0.86 (0.005). By applying the interpretability technique of linearly probing intermediate representations, we also demonstrate that interpretable pathology features such as nuclei density are learned by the proposed CNN architecture, confirming the increased transparency of this model. This result is a starting point towards building interpretable multi-task architectures that are robust to data heterogeneity. Our code is available at https://github.com/maragraziani/multitask_adversarial\",\"PeriodicalId\":75083,\"journal\":{\"name\":\"The journal of machine learning for biomedical imaging\",\"volume\":\"4 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-08-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The journal of machine learning for biomedical imaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.59275/j.melba.2023-3462\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The journal of machine learning for biomedical imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59275/j.melba.2023-3462","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在日常的初级诊断中采用卷积神经网络(cnn)不仅需要接近完美的精度，而且需要对数据采集偏移和透明度有足够的泛化程度。现有的CNN模型就像黑盒子一样，不能向医生保证模型使用了重要的诊断特征。本文以现有的多任务学习、领域对抗训练和基于概念的可解释性等成功技术为基础，解决了在训练目标中引入诊断因素的挑战。在这里，我们展示了我们的架构，通过端到端学习基于不确定性的多任务加权组合和对抗性损失，被鼓励关注病理特征，如细胞核的密度和多形性，例如大小和外观的变化，同时丢弃误导性特征，如染色差异。我们在乳腺淋巴结组织的结果显示，肿瘤组织检测的通用性显著提高，最佳平均AUC为0.89(0.01)，而基线AUC为0.86(0.005)。通过应用线性探测中间表征的可解释性技术，我们还证明了可解释的病理特征，如核密度被提出的CNN架构所学习，证实了该模型的透明度增加。该结果是构建可解释的多任务体系结构的起点，该体系结构对数据异构具有鲁棒性。我们的代码可在https://github.com/maragraziani/multitask_adversarial上获得

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learning Interpretable Microscopic Features of Tumor by Multi-task Adversarial CNNs To Improve Generalization

Adopting Convolutional Neural Networks (CNNs) in the daily routine of primary diagnosis requires not only near-perfect precision, but also a sufficient degree of generalization to data acquisition shifts and transparency. Existing CNN models act as black boxes, not ensuring to the physicians that important diagnostic features are used by the model. Building on top of successfully existing techniques such as multi-task learning, domain adversarial training and concept-based interpretability, this paper addresses the challenge of introducing diagnostic factors in the training objectives. Here we show that our architecture, by learning end-to-end an uncertainty-based weighting combination of multi-task and adversarial losses, is encouraged to focus on pathology features such as density and pleomorphism of nuclei, e.g. variations in size and appearance, while discarding misleading features such as staining differences. Our results on breast lymph node tissue show significantly improved generalization in the detection of tumorous tissue, with best average AUC 0.89 (0.01) against the baseline AUC 0.86 (0.005). By applying the interpretability technique of linearly probing intermediate representations, we also demonstrate that interpretable pathology features such as nuclei density are learned by the proposed CNN architecture, confirming the increased transparency of this model. This result is a starting point towards building interpretable multi-task architectures that are robust to data heterogeneity. Our code is available at https://github.com/maragraziani/multitask_adversarial

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The journal of machine learning for biomedical imaging

自引率

0.00%

发文量