Architecture for Low Power Large Vocabulary Speech Recognition

2006 IEEE International SOC Conference Pub Date : 2006-09-01 DOI:10.1109/SOCC.2006.283836

Dhruba Chandra, U. Pazhayaveetil, P. Franzon

引用次数: 10

Abstract

This paper proposes an architecture for real-time large vocabulary speech recognition on a mobile embedded device. The speech recognition system is based on Hidden Markov Model (HMM), which involves complex mathematical operations such as probability estimation and Viterbi decoding. This computational nature makes it power hungry and realtime recognition is not achieved by porting software solutions on embedded device. Our system architecture has a low power embedded processor and dedicated ASIC units for complex computations. These units operate at a low frequency of 50 MHz thus consuming low power. The system uses RAM for the intermediate values and flash memory to store acoustic and language models for speech recognition.

查看原文本刊更多论文

低功耗大词汇量语音识别体系结构

提出了一种基于移动嵌入式设备的实时大词汇量语音识别体系结构。语音识别系统基于隐马尔可夫模型(HMM)，涉及到概率估计和维特比解码等复杂的数学运算。这种计算性质使得它非常耗电，并且通过在嵌入式设备上移植软件解决方案无法实现实时识别。我们的系统架构具有低功耗嵌入式处理器和专用的ASIC单元，用于复杂的计算。这些装置工作在50兆赫的低频率，因此消耗低功率。该系统使用RAM作为中间值，并使用闪存存储语音识别的声学和语言模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2006 IEEE International SOC Conference

自引率

0.00%

发文量