N. Bogdanova-Beglarian, O. Blinova, Tatiana Y. Sherstinova, G. Martynenko, Kristina Zaides
{"title":"俄语口语中的语用标记:对NLP任务改进的系统化和注释经验","authors":"N. Bogdanova-Beglarian, O. Blinova, Tatiana Y. Sherstinova, G. Martynenko, Kristina Zaides","doi":"10.23919/FRUCT.2018.8588101","DOIUrl":null,"url":null,"abstract":"Pragmatic markers are an integral part of spontaneous spoken speech, however, they still have no systematic scientific description. These speech elements perform mostly pragmatic functions and are characterized by almost complete absence (or significant weakening) of lexical and/or grammatical meaning. The frequency of pragmatic markers in speech exceeds that of almost all content words. Because of that, for the improvement of many current NLP tasks, it is very important to obtain proper systematization of pragmatic markers and to develop effective and reliable schemes for their annotation. In current research, we describe the preliminary set of pragmatic markers categories and present the results of two stages of their pilot annotation made independently by a group of experts.","PeriodicalId":183812,"journal":{"name":"2018 23rd Conference of Open Innovations Association (FRUCT)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Pragmatic Markers in Russian Spoken Speech: an Experience of Systematization and Annotation for the Improvement of NLP Tasks\",\"authors\":\"N. Bogdanova-Beglarian, O. Blinova, Tatiana Y. Sherstinova, G. Martynenko, Kristina Zaides\",\"doi\":\"10.23919/FRUCT.2018.8588101\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pragmatic markers are an integral part of spontaneous spoken speech, however, they still have no systematic scientific description. These speech elements perform mostly pragmatic functions and are characterized by almost complete absence (or significant weakening) of lexical and/or grammatical meaning. The frequency of pragmatic markers in speech exceeds that of almost all content words. Because of that, for the improvement of many current NLP tasks, it is very important to obtain proper systematization of pragmatic markers and to develop effective and reliable schemes for their annotation. In current research, we describe the preliminary set of pragmatic markers categories and present the results of two stages of their pilot annotation made independently by a group of experts.\",\"PeriodicalId\":183812,\"journal\":{\"name\":\"2018 23rd Conference of Open Innovations Association (FRUCT)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 23rd Conference of Open Innovations Association (FRUCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/FRUCT.2018.8588101\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 23rd Conference of Open Innovations Association (FRUCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/FRUCT.2018.8588101","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Pragmatic Markers in Russian Spoken Speech: an Experience of Systematization and Annotation for the Improvement of NLP Tasks
Pragmatic markers are an integral part of spontaneous spoken speech, however, they still have no systematic scientific description. These speech elements perform mostly pragmatic functions and are characterized by almost complete absence (or significant weakening) of lexical and/or grammatical meaning. The frequency of pragmatic markers in speech exceeds that of almost all content words. Because of that, for the improvement of many current NLP tasks, it is very important to obtain proper systematization of pragmatic markers and to develop effective and reliable schemes for their annotation. In current research, we describe the preliminary set of pragmatic markers categories and present the results of two stages of their pilot annotation made independently by a group of experts.