词性标注的Transformer神经网络结构

2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus) Pub Date : 2021-01-26 DOI:10.1109/ElConRus51938.2021.9396231

A. A. Maksutov, Vladimir I. Zamyatovskiy, Viacheslav O. Morozov, S. Dmitriev

{"title":"词性标注的Transformer神经网络结构","authors":"A. A. Maksutov, Vladimir I. Zamyatovskiy, Viacheslav O. Morozov, S. Dmitriev","doi":"10.1109/ElConRus51938.2021.9396231","DOIUrl":null,"url":null,"abstract":"Part-of-speech tagging (POS tagging) is one of the most important tasks in natural language processing. This process implies determining part of speech and assigning an appropriate tag for each word in given sentence. The resulting tag sequence can be used as is and as a part of more complicated tasks, such as dependency and constituency parsing. This task belongs to sequence-to-sequence tasks and multilayer bidirectional LSTM networks are commonly used for POS tagging. Such networks are rather slow in terms of training and processing large amounts of information due to sequential computation of each timestamp from the input sequence. This paper is focused on developing an accurate model for POS tagging that uses the original Transformer neural network architecture.","PeriodicalId":447345,"journal":{"name":"2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"The Transformer Neural Network Architecture for Part-of-Speech Tagging\",\"authors\":\"A. A. Maksutov, Vladimir I. Zamyatovskiy, Viacheslav O. Morozov, S. Dmitriev\",\"doi\":\"10.1109/ElConRus51938.2021.9396231\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Part-of-speech tagging (POS tagging) is one of the most important tasks in natural language processing. This process implies determining part of speech and assigning an appropriate tag for each word in given sentence. The resulting tag sequence can be used as is and as a part of more complicated tasks, such as dependency and constituency parsing. This task belongs to sequence-to-sequence tasks and multilayer bidirectional LSTM networks are commonly used for POS tagging. Such networks are rather slow in terms of training and processing large amounts of information due to sequential computation of each timestamp from the input sequence. This paper is focused on developing an accurate model for POS tagging that uses the original Transformer neural network architecture.\",\"PeriodicalId\":447345,\"journal\":{\"name\":\"2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus)\",\"volume\":\"61 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ElConRus51938.2021.9396231\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ElConRus51938.2021.9396231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

词性标注是自然语言处理中最重要的任务之一。这个过程意味着确定词性，并为给定句子中的每个单词分配适当的标签。生成的标记序列可以原样使用，也可以作为更复杂任务的一部分使用，例如依赖项和选区解析。该任务属于序列到序列的任务，多层双向LSTM网络通常用于词性标注。由于从输入序列中对每个时间戳进行顺序计算，这种网络在训练和处理大量信息方面相当缓慢。本文的重点是开发一个准确的POS标注模型，该模型使用原始的Transformer神经网络架构。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

The Transformer Neural Network Architecture for Part-of-Speech Tagging

Part-of-speech tagging (POS tagging) is one of the most important tasks in natural language processing. This process implies determining part of speech and assigning an appropriate tag for each word in given sentence. The resulting tag sequence can be used as is and as a part of more complicated tasks, such as dependency and constituency parsing. This task belongs to sequence-to-sequence tasks and multilayer bidirectional LSTM networks are commonly used for POS tagging. Such networks are rather slow in terms of training and processing large amounts of information due to sequential computation of each timestamp from the input sequence. This paper is focused on developing an accurate model for POS tagging that uses the original Transformer neural network architecture.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus)

自引率

0.00%

发文量