N. Bogdanova-Beglarian, O. Blinova, Tatiana Y. Sherstinova, G. Martynenko, Kristina Zaides
{"title":"Pragmatic Markers in Russian Spoken Speech: an Experience of Systematization and Annotation for the Improvement of NLP Tasks","authors":"N. Bogdanova-Beglarian, O. Blinova, Tatiana Y. Sherstinova, G. Martynenko, Kristina Zaides","doi":"10.23919/FRUCT.2018.8588101","DOIUrl":null,"url":null,"abstract":"Pragmatic markers are an integral part of spontaneous spoken speech, however, they still have no systematic scientific description. These speech elements perform mostly pragmatic functions and are characterized by almost complete absence (or significant weakening) of lexical and/or grammatical meaning. The frequency of pragmatic markers in speech exceeds that of almost all content words. Because of that, for the improvement of many current NLP tasks, it is very important to obtain proper systematization of pragmatic markers and to develop effective and reliable schemes for their annotation. In current research, we describe the preliminary set of pragmatic markers categories and present the results of two stages of their pilot annotation made independently by a group of experts.","PeriodicalId":183812,"journal":{"name":"2018 23rd Conference of Open Innovations Association (FRUCT)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 23rd Conference of Open Innovations Association (FRUCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/FRUCT.2018.8588101","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Pragmatic markers are an integral part of spontaneous spoken speech, however, they still have no systematic scientific description. These speech elements perform mostly pragmatic functions and are characterized by almost complete absence (or significant weakening) of lexical and/or grammatical meaning. The frequency of pragmatic markers in speech exceeds that of almost all content words. Because of that, for the improvement of many current NLP tasks, it is very important to obtain proper systematization of pragmatic markers and to develop effective and reliable schemes for their annotation. In current research, we describe the preliminary set of pragmatic markers categories and present the results of two stages of their pilot annotation made independently by a group of experts.