A low memory bandwidth Gaussian mixture model (GMM) processor for 20,000-word real-time speech recognition FPGA system

2008 International Conference on Field-Programmable Technology Pub Date : 2008-12-01 DOI:10.1109/FPT.2008.4762413

Kazuo Miura, Hiroki Noguchi, H. Kawaguchi, M. Yoshimoto

引用次数: 19

Abstract

We propose a GMM processor for large vocabulary real-time continuous speech recognition. This processor achieves low operating frequency and low memory bandwidth using parallelization and vector look-ahead schemes, which are suitable to FPGA implementation. We designed the proposed processor on a Celoxica RC250 FPGA board, and confirmed that the required frequency and memory bandwidth for real-time operation are reduced by 89.8% and 84.2%, respectively. The 20,000-word real-time GMM computation is made at a frequency of 30.4 MHz and memory bandwidth of 47 Mbps, on the prototype.

查看原文本刊更多论文

用于2万字实时语音识别的低存储带宽高斯混合模型(GMM)处理器

提出了一种用于大词汇量实时连续语音识别的GMM处理器。该处理器采用并行化和矢量预查方案，实现了低工作频率和低存储带宽，适合FPGA实现。我们在Celoxica RC250 FPGA板上设计了所提出的处理器，并证实实时操作所需的频率和内存带宽分别降低了89.8%和84.2%。2万字的实时GMM计算在原型机上以30.4 MHz的频率和47mbps的内存带宽进行。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 International Conference on Field-Programmable Technology

自引率

0.00%

发文量