Comparison of attention models and post-hoc explanation methods for embryo stage identification: a case study

ICPR Workshops Pub Date : 2022-05-13 DOI:10.48550/arXiv.2205.06546

T. Gomez, Thomas Fr'eour, H. Mouchère

引用次数: 2

Abstract

An important limitation to the development of AI-based solutions for In Vitro Fertilization (IVF) is the black-box nature of most state-of-the-art models, due to the complexity of deep learning architectures, which raises potential bias and fairness issues. The need for interpretable AI has risen not only in the IVF field but also in the deep learning community in general. This has started a trend in literature where authors focus on designing objective metrics to evaluate generic explanation methods. In this paper, we study the behavior of recently proposed objective faithfulness metrics applied to the problem of embryo stage identification. We benchmark attention models and post-hoc methods using metrics and further show empirically that (1) the metrics produce low overall agreement on the model ranking and (2) depending on the metric approach, either post-hoc methods or attention models are favored. We conclude with general remarks about the difficulty of defining faithfulness and the necessity of understanding its relationship with the type of approach that is favored.

查看原文本刊更多论文

胚胎阶段识别的注意模型与事后解释方法之比较:个案研究

基于人工智能的体外受精(IVF)解决方案发展的一个重要限制是，由于深度学习架构的复杂性，大多数最先进模型的黑箱性质，这引发了潜在的偏见和公平性问题。不仅在试管婴儿领域，而且在整个深度学习社区，对可解释人工智能的需求都在上升。这已经在文献中开始了一种趋势，作者专注于设计客观的指标来评估通用的解释方法。在本文中，我们研究了最近提出的用于胚胎阶段识别问题的客观忠实度度量的行为。我们使用指标对注意力模型和事后方法进行基准测试，并进一步通过经验表明:(1)指标对模型排名的总体一致性较低;(2)根据指标方法的不同，事后方法或注意力模型更受青睐。我们总结了关于定义忠诚的困难以及理解其与所青睐的方法类型之间关系的必要性的一般性评论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ICPR Workshops

自引率

0.00%

发文量