使用变压器适配器扩展本地语言标识

International Conference on Natural Language and Speech Processing Pub Date : 2022-11-18 DOI:10.48550/arXiv.2211.10117

Ahmet Uluslu, G. Schneider

{"title":"使用变压器适配器扩展本地语言标识","authors":"Ahmet Uluslu, G. Schneider","doi":"10.48550/arXiv.2211.10117","DOIUrl":null,"url":null,"abstract":"Native language identification (NLI) is the task of automatically identifying the native language (L1) of an individual based on their language production in a learned language. It is useful for a variety of purposes including marketing, security and educational applications. NLI is usually framed as a multi-label classification task, where numerous designed features are combined to achieve state-of-the-art results. Recently deep generative approach based on transformer decoders (GPT-2) outperformed its counterparts and achieved the best results on the NLI benchmark datasets. We investigate this approach to determine the practical implications compared to traditional state-of-the-art NLI systems. We introduce transformer adapters to address memory limitations and improve training/inference speed to scale NLI applications for production.","PeriodicalId":405017,"journal":{"name":"International Conference on Natural Language and Speech Processing","volume":"109 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Scaling Native Language Identification with Transformer Adapters\",\"authors\":\"Ahmet Uluslu, G. Schneider\",\"doi\":\"10.48550/arXiv.2211.10117\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Native language identification (NLI) is the task of automatically identifying the native language (L1) of an individual based on their language production in a learned language. It is useful for a variety of purposes including marketing, security and educational applications. NLI is usually framed as a multi-label classification task, where numerous designed features are combined to achieve state-of-the-art results. Recently deep generative approach based on transformer decoders (GPT-2) outperformed its counterparts and achieved the best results on the NLI benchmark datasets. We investigate this approach to determine the practical implications compared to traditional state-of-the-art NLI systems. We introduce transformer adapters to address memory limitations and improve training/inference speed to scale NLI applications for production.\",\"PeriodicalId\":405017,\"journal\":{\"name\":\"International Conference on Natural Language and Speech Processing\",\"volume\":\"109 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Natural Language and Speech Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2211.10117\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Natural Language and Speech Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2211.10117","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

母语识别(NLI)是一种基于个体在学习语言中产生的语言来自动识别其母语(L1)的任务。它对各种用途都很有用，包括营销、安全和教育应用。NLI通常被定义为一个多标签分类任务，其中许多设计特征被结合起来以获得最先进的结果。近年来，基于变压器解码器的深度生成方法(GPT-2)在NLI基准数据集上取得了较好的效果。我们研究了这种方法，以确定与传统的最先进的NLI系统相比的实际意义。我们引入变压器适配器来解决内存限制和提高训练/推理速度，以扩展NLI应用的生产。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Scaling Native Language Identification with Transformer Adapters

Native language identification (NLI) is the task of automatically identifying the native language (L1) of an individual based on their language production in a learned language. It is useful for a variety of purposes including marketing, security and educational applications. NLI is usually framed as a multi-label classification task, where numerous designed features are combined to achieve state-of-the-art results. Recently deep generative approach based on transformer decoders (GPT-2) outperformed its counterparts and achieved the best results on the NLI benchmark datasets. We investigate this approach to determine the practical implications compared to traditional state-of-the-art NLI systems. We introduce transformer adapters to address memory limitations and improve training/inference speed to scale NLI applications for production.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Natural Language and Speech Processing

自引率

0.00%

发文量