片上英语语音识别系统*

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Tsinghua Science and Technology Pub Date : 2011-02-01 DOI:10.1016/S1007-0214(11)70015-3

Liu Hong (刘鸿), Qian Yanmin (钱彦旻), Liu Jia (刘加)

{"title":"片上英语语音识别系统*","authors":"Liu Hong (刘鸿), Qian Yanmin (钱彦旻), Liu Jia (刘加)","doi":"10.1016/S1007-0214(11)70015-3","DOIUrl":null,"url":null,"abstract":"<div><p>An English speech recognition system was implemented on a chip, called speech system-on-chip (SoC). The SoC included an application specific integrated circuit with a vector accelerator to improve performance. The sub-word model based on a continuous density hidden Markov model recognition algorithm ran on a very cheap speech chip. The algorithm was a two-stage fixed-width beam-search baseline system with a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce the recognition time. Tests show that this method reduces the recognition time nearly 6 fold and the memory size nearly 2 fold compared to the original system, with less than 1% accuracy degradation for a 600 word recognition task and recognition accuracy rate of about 98%.</p></div>","PeriodicalId":60306,"journal":{"name":"Tsinghua Science and Technology","volume":null,"pages":null},"PeriodicalIF":5.2000,"publicationDate":"2011-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S1007-0214(11)70015-3","citationCount":"8","resultStr":"{\"title\":\"English Speech Recognition System on Chip*\",\"authors\":\"Liu Hong (刘鸿), Qian Yanmin (钱彦旻), Liu Jia (刘加)\",\"doi\":\"10.1016/S1007-0214(11)70015-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>An English speech recognition system was implemented on a chip, called speech system-on-chip (SoC). The SoC included an application specific integrated circuit with a vector accelerator to improve performance. The sub-word model based on a continuous density hidden Markov model recognition algorithm ran on a very cheap speech chip. The algorithm was a two-stage fixed-width beam-search baseline system with a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce the recognition time. Tests show that this method reduces the recognition time nearly 6 fold and the memory size nearly 2 fold compared to the original system, with less than 1% accuracy degradation for a 600 word recognition task and recognition accuracy rate of about 98%.</p></div>\",\"PeriodicalId\":60306,\"journal\":{\"name\":\"Tsinghua Science and Technology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2011-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/S1007-0214(11)70015-3\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tsinghua Science and Technology\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1007021411700153\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tsinghua Science and Technology","FirstCategoryId":"1093","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1007021411700153","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 8

摘要

一个英语语音识别系统被实现在一个芯片上，称为语音系统芯片(SoC)。该SoC包括一个带有矢量加速器的特定应用集成电路，以提高性能。基于连续密度隐马尔可夫模型识别算法的子词模型在一个非常便宜的语音芯片上运行。该算法采用两阶段固定宽度波束搜索基线系统，采用变波束宽度剪枝策略和帧同步字级剪枝策略，显著缩短了识别时间。测试表明，该方法与原系统相比，识别时间缩短了近6倍，内存大小减少了近2倍，对于600字的识别任务，准确率下降不到1%，识别准确率约为98%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

English Speech Recognition System on Chip*

An English speech recognition system was implemented on a chip, called speech system-on-chip (SoC). The SoC included an application specific integrated circuit with a vector accelerator to improve performance. The sub-word model based on a continuous density hidden Markov model recognition algorithm ran on a very cheap speech chip. The algorithm was a two-stage fixed-width beam-search baseline system with a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce the recognition time. Tests show that this method reduces the recognition time nearly 6 fold and the memory size nearly 2 fold compared to the original system, with less than 1% accuracy degradation for a 600 word recognition task and recognition accuracy rate of about 98%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Tsinghua Science and Technology

CiteScore

12.10

自引率

0.00%

发文量

2340