Evaluation Framework Design of Spoken Term Detection Study at the NTCIR-9 IR for Spoken Documents Task

Q4 Computer Science

Journal of Information Processing Pub Date : 2012-12-14 DOI:10.5715/JNLP.19.329

H. Nishizaki, T. Akiba, K. Aikawa, Tatsuya Kawahara, T. Matsui

引用次数: 4

Abstract

This paper describes a design of spoken term detection (STD) studies and their evaluating framework at the STD sub-task of the NTCIR-9 IR for Spoken Documents (SpokenDoc) task. STD is the one of information access technologies for spoken documents. The goal of the STD sub-task is to rapidly detect presence of a given query term, consisting of word or a few word sequences spoken, from the spoken documents included in the Corpus of Spontaneous Japanese. To successfully complete the sub-task, we considered the design of the sub-task and the evaluation methods, and arranged the task schedule. Finally, seven teams participated in the STD subtask and submitted 18 STD results. This paper explains the STD sub-task details we conducted, the data used in the sub-task, how to make transcriptions by speech recognition for data distribution, the evaluation measurement, introduction of the participants’ techniques, and the evaluation results of the task participants.

查看原文本刊更多论文

ntir - 9ir口语文件任务中口语词汇检测研究的评价框架设计

本文描述了ntir -9口语文件检索(SpokenDoc)任务的STD子任务中口语术语检测(STD)研究的设计及其评估框架。STD是一种有声文件信息访问技术。STD子任务的目标是从自发日语语料库中包含的口语文档中快速检测给定查询词(由单词或几个单词序列组成)的存在。为了顺利完成子任务，我们考虑了子任务的设计和评估方法，并安排了任务时间表。最后，有7个团队参与了STD子任务，并提交了18个STD结果。本文阐述了STD子任务的细节，子任务中使用的数据，如何通过语音识别进行转录进行数据分布，评估测量，参与者技术介绍，以及任务参与者的评估结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Information Processing Computer Science-Computer Science (all)

CiteScore

1.20

自引率

0.00%

发文量