Styling Words: A Simple and Natural Way to Increase Variability in Training Data Collection for Gesture Recognition

Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems Pub Date : 2021-05-06 DOI:10.1145/3411764.3445457

Woojin Kang, Intaek Jung, Daeho Lee, Jin-Hyuk Hong

{"title":"Styling Words: A Simple and Natural Way to Increase Variability in Training Data Collection for Gesture Recognition","authors":"Woojin Kang, Intaek Jung, Daeho Lee, Jin-Hyuk Hong","doi":"10.1145/3411764.3445457","DOIUrl":null,"url":null,"abstract":"Due to advances in deep learning, gestures have become a more common tool for human-computer interaction. When implementing a large amount of training data, deep learning models show remarkable performance in gesture recognition. Since it is expensive and time consuming to collect gesture data from people, we are often confronted with a practicality issue when managing the quantity and quality of training data. It is a well-known fact that increasing training data variability can help to improve the generalization performance of machine learning models. Thus, we directly intervene in the collection of gesture data to increase human gesture variability by adding some words (called styling words) into the data collection instructions, e.g., giving the instruction \"perform gesture #1 faster\" as opposed to \"perform gesture #1.\" Through an in-depth analysis of gesture features and video-based gesture recognition, we have confirmed the advantageous use of styling words in gesture training data collection.","PeriodicalId":20451,"journal":{"name":"Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems","volume":"2014 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3411764.3445457","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Due to advances in deep learning, gestures have become a more common tool for human-computer interaction. When implementing a large amount of training data, deep learning models show remarkable performance in gesture recognition. Since it is expensive and time consuming to collect gesture data from people, we are often confronted with a practicality issue when managing the quantity and quality of training data. It is a well-known fact that increasing training data variability can help to improve the generalization performance of machine learning models. Thus, we directly intervene in the collection of gesture data to increase human gesture variability by adding some words (called styling words) into the data collection instructions, e.g., giving the instruction "perform gesture #1 faster" as opposed to "perform gesture #1." Through an in-depth analysis of gesture features and video-based gesture recognition, we have confirmed the advantageous use of styling words in gesture training data collection.

查看原文本刊更多论文

样式词:一种增加手势识别训练数据收集可变性的简单而自然的方法

由于深度学习的进步，手势已经成为一种更常见的人机交互工具。在实现大量训练数据时，深度学习模型在手势识别方面表现出显著的性能。由于人体手势数据的采集成本高、耗时长，在管理训练数据的数量和质量时，我们经常面临一个实用性问题。众所周知，增加训练数据的可变性有助于提高机器学习模型的泛化性能。因此，我们直接干预手势数据的收集，通过在数据收集指令中添加一些单词(称为样式词)来增加人类手势的可变性，例如，给出指令“更快地执行手势#1”，而不是“执行手势#1”。通过对手势特征和基于视频的手势识别的深入分析，我们证实了样式词在手势训练数据收集中的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems

自引率

0.00%

发文量