Topic dependent language modelling for spoken term detection

2014 22nd European Signal Processing Conference (EUSIPCO) Pub Date : 2014-11-13 DOI:10.5281/ZENODO.44201

Shahram Kalantari, David Dean, S. Sridharan, R. Wallace

引用次数: 5

Abstract

This paper investigates the effect of topic dependent language models (TDLM) on phonetic spoken term detection (STD) using dynamic match lattice spotting (DMLS). Phonetic STD consists of two steps: indexing and search. The accuracy of indexing audio segments into phone sequences using phone recognition methods directly affects the accuracy of the final STD system. If the topic of a document in known, recognizing the spoken words and indexing them to an intermediate representation is an easier task and consequently, detecting a search word in it will be more accurate and robust. In this paper, we propose the use of TDLMs in the indexing stage to improve the accuracy of STD in situations where the topic of the audio document is known in advance. It is shown that using TDLMs instead of the traditional general language model (GLM) improves STD performance according to figure of merit (FOM) criteria.

查看原文本刊更多论文

基于主题的口语术语检测语言模型

本文研究了主题相关语言模型(TDLM)对动态匹配点阵(DMLS)语音口语词检测(STD)的影响。语音STD包括两个步骤:索引和搜索。利用电话识别方法将音频片段编入电话序列的准确性直接影响到最终STD系统的准确性。如果文档的主题是已知的，那么识别口语单词并将其索引到中间表示是一项更容易的任务，因此，在其中检测搜索词将更加准确和健壮。在本文中，我们建议在索引阶段使用tdlm，以提高在预先知道音频文档主题的情况下STD的准确性。结果表明，使用tdlm代替传统的通用语言模型(GLM)可以根据优点图(FOM)标准提高STD性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 22nd European Signal Processing Conference (EUSIPCO)

自引率

0.00%

发文量