Implements of Transformer in NLP and DKT

2022 4th International Conference on Artificial Intelligence and Advanced Manufacturing (AIAM) Pub Date : 2022-10-01 DOI:10.1109/AIAM57466.2022.00163

Haotong Gong

引用次数: 0

Abstract

Transformer is a strong model proposed by Google team in 2017. It was a huge improvement that it entirely abandons the mechanism of Recurrent Neural Network (RNN) and Convolutional Neural Network (RNN). As a result, soon it became a popular choice in a diversity of scenarios. A typical implement of Transformer is for handling text-like input sequences, such as Natural Language Process (NLP) and Knowledge Tracing (KT). Although Transformer is a strong model, it still has a number of improvements. Some state-of-the-art Deep-Learning-based models (e.g., BERT in [2], SAINT in [3], etc.) are based on Transformer. In this paper, I give some examples of application of Transformer or Transformer-based models and summarize the pros and cons of Transformer.

查看原文本刊更多论文

NLP和DKT中变压器的实现

Transformer是谷歌团队在2017年提出的一个强模型。这是一个巨大的进步，它完全抛弃了循环神经网络(RNN)和卷积神经网络(RNN)的机制。因此，它很快成为各种场景中的流行选择。Transformer的典型实现是用于处理类似文本的输入序列，例如自然语言处理(NLP)和知识跟踪(KT)。尽管Transformer是一个强大的模型，但它仍然有许多改进。一些最先进的基于深度学习的模型(例如，[2]中的BERT，[3]中的SAINT等)是基于Transformer的。本文给出了Transformer或基于Transformer的模型的一些应用实例，并总结了Transformer的优缺点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 4th International Conference on Artificial Intelligence and Advanced Manufacturing (AIAM)

自引率

0.00%

发文量