Constructing Global Coherence Representations: Identifying Interpretability and Coherences of Transformer Attention in Time Series Data

2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA) Pub Date : 2021-10-06 DOI:10.1109/DSAA53316.2021.9564126

Leonid Schwenke, Martin Atzmueller

引用次数: 5

Abstract

Transformer models have shown significant advances recently based on the general concept of Attention — to focus on specifically important and relevant parts of the input data. However, methods for enhancing their interpretability and explainability are still lacking. This is the problem which we tackle in this paper, to make Multi-Headed Attention more interpretable and explainable for time series classification. We present a method for constructing global coherence representations from Multi-Headed Attention of Transformer architectures. Accordingly, we present abstraction and interpretation methods, leading to intuitive visualizations of the respective attention patterns. We evaluate our proposed approach and the presented methods on several datasets demonstrating their efficacy.

查看原文本刊更多论文

构建全局相干表示:识别时间序列数据中变压器注意力的可解释性和相干性

基于注意力的一般概念——专注于输入数据中特别重要和相关的部分——变压器模型最近显示出了显著的进步。然而，提高其可解释性和可解释性的方法仍然缺乏。为了使多头注意力在时间序列分类中更具可解释性和可解释性，本文对这一问题进行了研究。提出了一种利用变压器结构的多头注意构造全局相干表示的方法。因此，我们提出抽象和解释的方法，导致直观的可视化各自的注意模式。我们在几个数据集上评估了我们提出的方法和提出的方法，证明了它们的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)

自引率

0.00%

发文量