Multimodal optimal matching and augmentation method for small sample gesture recognition.

IF 5 4区生物学 Q1 BIOLOGY

Bioscience trends Pub Date : 2025-03-06 Epub Date: 2025-01-25 DOI:10.5582/bst.2024.01370

Wenli Zhang, Bo Liu, Tingsong Zhao, Shuyan Qie

{"title":"Multimodal optimal matching and augmentation method for small sample gesture recognition.","authors":"Wenli Zhang, Bo Liu, Tingsong Zhao, Shuyan Qie","doi":"10.5582/bst.2024.01370","DOIUrl":null,"url":null,"abstract":"<p><p>In human-computer interaction, gesture recognition based on physiological signals offers advantages such as a more natural and fast interaction mode and less constrained by the environment than visual-based. Surface electromyography-based gesture recognition has significantly progressed. However, since individuals have physical differences, researchers must collect data multiple times from each user to train the deep learning model. This data acquisition process can be particularly burdensome for non-healthy users. Researchers are currently exploring transfer learning and data augmentation techniques to enhance the accuracy of small-sample gesture recognition models. However, challenges persist, such as negative transfer and limited diversity in training samples, leading to suboptimal recognition performance. Therefore, We introduce motion information into sEMG-based recognition and propose a multimodal optimal matching and augmentation method for small sample gesture recognition, achieving efficient gesture recognition with only one acquisition per gesture. Firstly, this method utilizes the optimal matching signal selection module to select the most similar signals from the existing data to the new user as the training set, reducing inter-domain differences. Secondly, the similarity calculation augmentation module enhances the diversity of the training set. Finally, the Modal-type embedding enhances the information interaction between each mode signal. We evaluated the effectiveness on Self-collected Stroke Patient, the Ninapro DB1 dataset and the Ninapro DB5 dataset and achieved accuracies of 93.69%, 91.65% and 98.56%, respectively. These results demonstrate that the method achieved performance comparable to traditional recognition models while significantly reducing the collected data.</p>","PeriodicalId":8957,"journal":{"name":"Bioscience trends","volume":" ","pages":"125-139"},"PeriodicalIF":5.0000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioscience trends","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.5582/bst.2024.01370","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/25 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

In human-computer interaction, gesture recognition based on physiological signals offers advantages such as a more natural and fast interaction mode and less constrained by the environment than visual-based. Surface electromyography-based gesture recognition has significantly progressed. However, since individuals have physical differences, researchers must collect data multiple times from each user to train the deep learning model. This data acquisition process can be particularly burdensome for non-healthy users. Researchers are currently exploring transfer learning and data augmentation techniques to enhance the accuracy of small-sample gesture recognition models. However, challenges persist, such as negative transfer and limited diversity in training samples, leading to suboptimal recognition performance. Therefore, We introduce motion information into sEMG-based recognition and propose a multimodal optimal matching and augmentation method for small sample gesture recognition, achieving efficient gesture recognition with only one acquisition per gesture. Firstly, this method utilizes the optimal matching signal selection module to select the most similar signals from the existing data to the new user as the training set, reducing inter-domain differences. Secondly, the similarity calculation augmentation module enhances the diversity of the training set. Finally, the Modal-type embedding enhances the information interaction between each mode signal. We evaluated the effectiveness on Self-collected Stroke Patient, the Ninapro DB1 dataset and the Ninapro DB5 dataset and achieved accuracies of 93.69%, 91.65% and 98.56%, respectively. These results demonstrate that the method achieved performance comparable to traditional recognition models while significantly reducing the collected data.

查看原文本刊更多论文

小样本手势识别的多模态优化匹配与增强方法。

在人机交互中，基于生理信号的手势识别比基于视觉的手势识别具有更自然、更快速的交互方式和更少的环境约束等优点。基于表面肌电图的手势识别已经取得了显著的进展。然而，由于个体存在生理差异，研究人员必须从每个用户那里多次收集数据来训练深度学习模型。对于不健康的用户来说，这种数据获取过程可能特别繁重。研究人员目前正在探索迁移学习和数据增强技术，以提高小样本手势识别模型的准确性。然而，挑战仍然存在，例如训练样本的负迁移和有限的多样性，导致次优识别性能。因此，我们将运动信息引入到基于表面肌电信号的识别中，并提出了一种多模态最优匹配和增强方法，用于小样本手势识别，实现了每个手势仅需一次采集的高效手势识别。首先，该方法利用最优匹配信号选择模块，从现有数据中选择与新用户最相似的信号作为训练集，减少域间差异；其次，相似度计算增强模块增强了训练集的多样性；最后，模态型嵌入增强了各模态信号之间的信息交互。我们对Self-collected Stroke患者、Ninapro DB1数据集和Ninapro DB5数据集进行了有效性评估，准确率分别为93.69%、91.65%和98.56%。这些结果表明，该方法在显著减少收集数据的同时，取得了与传统识别模型相当的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Bioscience trends BIOLOGY-

CiteScore

13.60

自引率

1.80%

发文量

审稿时长

>12 weeks

期刊介绍： BioScience Trends (Print ISSN 1881-7815, Online ISSN 1881-7823) is an international peer-reviewed journal. BioScience Trends devotes to publishing the latest and most exciting advances in scientific research. Articles cover fields of life science such as biochemistry, molecular biology, clinical research, public health, medical care system, and social science in order to encourage cooperation and exchange among scientists and clinical researchers.