American sign language fingerspelling recognition with phonological feature-based tandem models

2012 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2012-12-01 DOI:10.1109/SLT.2012.6424208

Taehwan Kim, Karen Livescu, Gregory Shakhnarovich

引用次数: 20

Abstract

We study the recognition of fingerspelling sequences in American Sign Language from video using tandem-style models, in which the outputs of multilayer perceptron (MLP) classifiers are used as observations in a hidden Markov model (HMM)-based recognizer. We compare a baseline HMM-based recognizer, a tandem recognizer using MLP letter classifiers, and a tandem recognizer using MLP classifiers of phonological features. We present experiments on a database of fingerspelling videos. We find that the tandem approaches outperform an HMM-based baseline, and that phonological feature-based tandem models outperform letter-based tandem models.

查看原文本刊更多论文

基于语音特征串联模型的美国手语手势语拼写识别

我们使用串联式模型研究了美国手语视频中指纹拼写序列的识别，其中多层感知器(MLP)分类器的输出作为基于隐马尔可夫模型(HMM)的识别器的观察值。我们比较了基于基线hmm的识别器、使用MLP字母分类器的串联识别器和使用MLP语音特征分类器的串联识别器。我们在一个指纹拼写视频数据库上进行实验。我们发现串联方法优于基于hmm的基线，并且基于语音特征的串联模型优于基于字母的串联模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 IEEE Spoken Language Technology Workshop (SLT)

自引率

0.00%

发文量