Scaling Native Language Identification with Transformer Adapters

International Conference on Natural Language and Speech Processing Pub Date : 2022-11-18 DOI:10.48550/arXiv.2211.10117

Ahmet Uluslu, G. Schneider

引用次数: 2

Abstract

Native language identification (NLI) is the task of automatically identifying the native language (L1) of an individual based on their language production in a learned language. It is useful for a variety of purposes including marketing, security and educational applications. NLI is usually framed as a multi-label classification task, where numerous designed features are combined to achieve state-of-the-art results. Recently deep generative approach based on transformer decoders (GPT-2) outperformed its counterparts and achieved the best results on the NLI benchmark datasets. We investigate this approach to determine the practical implications compared to traditional state-of-the-art NLI systems. We introduce transformer adapters to address memory limitations and improve training/inference speed to scale NLI applications for production.

查看原文本刊更多论文

使用变压器适配器扩展本地语言标识

母语识别(NLI)是一种基于个体在学习语言中产生的语言来自动识别其母语(L1)的任务。它对各种用途都很有用，包括营销、安全和教育应用。NLI通常被定义为一个多标签分类任务，其中许多设计特征被结合起来以获得最先进的结果。近年来，基于变压器解码器的深度生成方法(GPT-2)在NLI基准数据集上取得了较好的效果。我们研究了这种方法，以确定与传统的最先进的NLI系统相比的实际意义。我们引入变压器适配器来解决内存限制和提高训练/推理速度，以扩展NLI应用的生产。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Natural Language and Speech Processing

自引率

0.00%

发文量