Position Information in Transformers: An Overview

IF 5.3 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Computational Linguistics Pub Date : 2021-02-22 DOI:10.1162/coli_a_00445

Philipp Dufter, Martin Schmitt, Hinrich Schütze

引用次数: 69

Abstract

Abstract Transformers are arguably the main workhorse in recent natural language processing research. By definition, a Transformer is invariant with respect to reordering of the input. However, language is inherently sequential and word order is essential to the semantics and syntax of an utterance. In this article, we provide an overview and theoretical comparison of existing methods to incorporate position information into Transformer models. The objectives of this survey are to (1) showcase that position information in Transformer is a vibrant and extensive research area; (2) enable the reader to compare existing methods by providing a unified notation and systematization of different approaches along important model dimensions; (3) indicate what characteristics of an application should be taken into account when selecting a position encoding; and (4) provide stimuli for future research.

查看原文本刊更多论文

变压器中的位置信息：综述

摘要在最近的自然语言处理研究中，变形器可以说是主要的主力。根据定义，Transformer对于输入的重新排序是不变的。然而，语言本质上是顺序的，语序对话语的语义和句法至关重要。在本文中，我们提供了一个概述和理论比较现有的方法，以纳入位置信息到Transformer模型。本次调查的目的是:(1)展示Transformer中的位置信息是一个充满活力和广泛的研究领域;(2)根据重要的模型维度为不同的方法提供统一的符号和系统化，使读者能够比较现有的方法;(3)表明在选择位置编码时应考虑应用程序的哪些特征;(4)为今后的研究提供激励。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computational Linguistics 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

0.00%

发文量

审稿时长

>12 weeks

期刊介绍： Computational Linguistics, the longest-running publication dedicated solely to the computational and mathematical aspects of language and the design of natural language processing systems, provides university and industry linguists, computational linguists, AI and machine learning researchers, cognitive scientists, speech specialists, and philosophers with the latest insights into the computational aspects of language research.