LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic challenge

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2016-12-01 DOI:10.1109/SLT.2016.7846278

N. Tomashenko, Kevin Vythelingum, Anthony Rousseau, Y. Estève

引用次数: 17

Abstract

This paper describes the automatic speech recognition (ASR) systems developed by LIUM in the framework of the 2016 Multi-Genre Broadcast (MGB-2) Challenge in the Arabic language. LIUM participated in the first of the two proposed tasks, namely the speech-to-text transcription of Aljazeera recordings. We present the approaches and details found in our systems, as well as our results in the evaluation campaign: the primary LIUM ASR system attained the second position. The main aspects come from the use of GMM-derived features for training a DNN, combined with the use of time-delay neural networks for acoustic models, the use of two different approaches in order to automatically phonetize Arabic words, and finally, the training data selection strategy for acoustic and language models.

查看原文本刊更多论文

2016年多类型广播阿拉伯语挑战赛LIUM ASR系统

本文介绍了LIUM在2016年阿拉伯语多类型广播(MGB-2)挑战赛框架下开发的自动语音识别(ASR)系统。LIUM参与了两项拟议任务中的第一项，即半岛电视台录音的语音到文本转录。我们介绍了在我们的系统中发现的方法和细节，以及我们在评估活动中的结果:主要的LIUM ASR系统获得了第二名。主要方面来自使用gmm衍生的特征来训练DNN，结合使用时滞神经网络进行声学模型，使用两种不同的方法来自动拼音阿拉伯语单词，最后是声学和语言模型的训练数据选择策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 IEEE Spoken Language Technology Workshop (SLT)

自引率

0.00%

发文量