The GTM-UVIGO System for Albayzin 2018 Speech-to-Text Evaluation

IberSPEECH Conference Pub Date : 2018-11-21 DOI:10.21437/IBERSPEECH.2018-58

Laura Docío Fernández, C. García-Mateo

引用次数: 2

Abstract

This paper describes the Speech-to-Text system developed by the Multimedia Technologies Group (GTM) of the atlanTTic research center at the University of Vigo, for the Albayzin Speech-to-Text Challenge (S2T) organized in the Iberspeech 2018 conference. The large vocabulary automatic speech recognition system is built using the Kaldi toolkit. It uses an hybrid Deep Neural Network - Hidden Markov Model (DNN-HMM) for acoustic modeling, and a rescoring of a trigram based word-lattices, obtained in a ﬁrst decoding stage, with a fourgram language model or a language model based on a recurrent neural network. The system was evaluated only on the open set training condition.

查看原文本刊更多论文

Albayzin 2018语音到文本评价的GTM-UVIGO系统

本文描述了由维戈大学大西洋研究中心的多媒体技术小组(GTM)为Iberspeech 2018会议组织的Albayzin语音到文本挑战(S2T)开发的语音到文本系统。利用Kaldi工具箱构建了大词汇自动语音识别系统。它使用混合深度神经网络-隐马尔可夫模型(DNN-HMM)进行声学建模，并使用四格语言模型或基于循环神经网络的语言模型对在第一解码阶段获得的基于三格的词格进行重新评分。系统仅在开放集训练条件下进行评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IberSPEECH Conference

自引率

0.00%

发文量