VLSP 2021 - ASR Challenge for Vietnamese Automatic Speech Recognition

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI:10.25073/2588-1086/vnucsce.356

Van Hai Do

引用次数: 0

Abstract

Recently, Vietnamese speech recognition has been attracted by various research groups in both academics and industry. This paper presents a Vietnamese automatic speech recognition challenge for the eighth annual workshop on Vietnamese Language and Speech Processing (VLSP 2021). There are two sub-tasks in the challenge. The first task is ASR-Task1 focusing on a full pipeline development of the ASR model from scratch with both labeled and unlabeled training data provided by the organizer. The second task is ASR-Task2 focusing on spontaneous speech in different real scenarios e.g., meeting conversation, lecture speech. In the ASR-Task2, participants can use all available data sources to develop their models without any limitations. The quality of the models is evaluated by the Syllable Error Rate (SyER) metric.

查看原文本刊更多论文

VLSP 2021 -越南语自动语音识别的ASR挑战

最近，越南语语音识别受到学术界和产业界各种研究团体的关注。本文为第八届越南语言和语音处理年度研讨会(VLSP 2021)提出了越南语自动语音识别挑战。挑战中有两个子任务。第一个任务是ASR- task1，重点是使用组织者提供的标记和未标记的训练数据从零开始对ASR模型进行完整的流水线开发。第二个任务是ASR-Task2，侧重于不同真实场景下的自发演讲，如会议对话，讲座演讲。在ASR-Task2中，参与者可以使用所有可用的数据源来开发他们的模型，没有任何限制。通过音节错误率(SyER)度量来评估模型的质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

VNU Journal of Science: Computer Science and Communication Engineering

自引率

0.00%

发文量