Russian Sign Language Database for Clinical Use: Data and Annotation Peculiarities

NSU Vestnik. Series: Linguistics and Intercultural Communication Pub Date : 2022-10-14 DOI:10.25205/1818-7935-2022-20-3-90-108

I. Kagirov, D. Ryumin

{"title":"Russian Sign Language Database for Clinical Use: Data and Annotation Peculiarities","authors":"I. Kagirov, D. Ryumin","doi":"10.25205/1818-7935-2022-20-3-90-108","DOIUrl":null,"url":null,"abstract":"This work has been carried out by the members of the Laboratory of Speech and Multimodal Interfaces of the St. Petersburg Federal Research Center of the Russian Academy of Sciences within an interdisciplinary research project aimed at the creation of an automatic Russian sign language translation system. The paper presents the design of a Russian sign language digital database for a specific subject area, namely, «The first visit to doctor».Our database is meant to be used first of all a dataset for training neural network-based systems of automatic translation from Russian sign language. But also, it can be of interest for linguistics of sign languages in general since the approach elucidates solution of a number of distorting phenomena typical for continuous sign language tracking, such as epenthesis, assimilation, reduction, and hold deletion.The principal difference between the presented video data and other datasets developed for similar purposes is the use of continuous sign utterances and elements of Russian sign language proper instead of the so-called “signed Russian” (that is, the visual form of the Russian spoken language), popular in deaf schooling.One of the most challenging problems, dealt with in the paper, is an automatic segmentation of sign speech into separate meaningful units with clearly defined boundaries. Its solution is hampered by signs’ deformations in continuous speech. In this paper, the segmentation is carried out within the framework of the movement-hold model. This approach allows extraction of their functional core as well as annotation of possible changes likely to appear in different signs’ segments. Accordingly, each utterance was subdivided into segments of motion and hold, and the resulting sign schemes were then entered into a separate directory containing information about the main parameters of each hold, including the possibilities of change due to immediate environment. The result of this work is a full decomposition of the signs, forming the database which can find its application in different statistical models of Russian Sign Language.Among other linguistic peculiarities of the database should be noted lexical variability of signs, mouthing and code switching between Russian sign language and “signed Russian”. Also, our signers learned Russian sign language in different regions of Russia, thus the collected data is potentially a source for research in dialectal variability of Russian sign language.","PeriodicalId":434662,"journal":{"name":"NSU Vestnik. Series: Linguistics and Intercultural Communication","volume":"191 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NSU Vestnik. Series: Linguistics and Intercultural Communication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25205/1818-7935-2022-20-3-90-108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This work has been carried out by the members of the Laboratory of Speech and Multimodal Interfaces of the St. Petersburg Federal Research Center of the Russian Academy of Sciences within an interdisciplinary research project aimed at the creation of an automatic Russian sign language translation system. The paper presents the design of a Russian sign language digital database for a specific subject area, namely, «The first visit to doctor».Our database is meant to be used first of all a dataset for training neural network-based systems of automatic translation from Russian sign language. But also, it can be of interest for linguistics of sign languages in general since the approach elucidates solution of a number of distorting phenomena typical for continuous sign language tracking, such as epenthesis, assimilation, reduction, and hold deletion.The principal difference between the presented video data and other datasets developed for similar purposes is the use of continuous sign utterances and elements of Russian sign language proper instead of the so-called “signed Russian” (that is, the visual form of the Russian spoken language), popular in deaf schooling.One of the most challenging problems, dealt with in the paper, is an automatic segmentation of sign speech into separate meaningful units with clearly defined boundaries. Its solution is hampered by signs’ deformations in continuous speech. In this paper, the segmentation is carried out within the framework of the movement-hold model. This approach allows extraction of their functional core as well as annotation of possible changes likely to appear in different signs’ segments. Accordingly, each utterance was subdivided into segments of motion and hold, and the resulting sign schemes were then entered into a separate directory containing information about the main parameters of each hold, including the possibilities of change due to immediate environment. The result of this work is a full decomposition of the signs, forming the database which can find its application in different statistical models of Russian Sign Language.Among other linguistic peculiarities of the database should be noted lexical variability of signs, mouthing and code switching between Russian sign language and “signed Russian”. Also, our signers learned Russian sign language in different regions of Russia, thus the collected data is potentially a source for research in dialectal variability of Russian sign language.

查看原文本刊更多论文

临床用俄语手语数据库:数据和注释特点

这项工作是由俄罗斯科学院圣彼得堡联邦研究中心语音和多模态接口实验室的成员在一个跨学科研究项目中进行的，该项目旨在创建一个自动俄语手语翻译系统。本文介绍了针对特定主题领域的俄语手语数字数据库的设计，即“第一次看医生”。我们的数据库首先是用来训练基于神经网络的俄罗斯手语自动翻译系统。但是，它也可以引起一般手语语言学的兴趣，因为该方法阐明了连续手语跟踪中典型的一些扭曲现象的解决方案，例如扩大，同化，减少和保留删除。所呈现的视频数据与为类似目的开发的其他数据集之间的主要区别在于使用了连续的手势话语和俄罗斯手语本身的元素，而不是所谓的“手语俄语”(即俄语口语的视觉形式)，这在聋哑学校中很流行。本文研究的最具挑战性的问题之一是将手语语音自动分割成具有明确边界的独立有意义的单元。它的解决受到连续语音中符号变形的阻碍。在本文中，分割是在移动保持模型的框架内进行的。这种方法可以提取它们的功能核心，并注释可能出现在不同标志部分的可能变化。因此，每个话语被细分为运动和保持的片段，然后产生的标志方案被输入到一个单独的目录中，其中包含关于每个保持的主要参数的信息，包括由于直接环境而变化的可能性。本工作的结果是对这些符号进行了全面的分解，形成了一个数据库，该数据库可以应用于俄语手语的不同统计模型。在该数据库的其他语言特点中，应该注意到俄语手语和“手语俄语”之间的词法变化，口型和代码转换。此外，我们的签名者在俄罗斯不同地区学习俄语手语，因此收集的数据可能是研究俄语手语方言变异性的潜在来源。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NSU Vestnik. Series: Linguistics and Intercultural Communication

自引率

0.00%

发文量