Balancing data-driven and rule-based approaches in the context of a Multimodal Conversational System

2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721) Pub Date : 2003-11-30 DOI:10.1109/ASRU.2003.1318444

S. Bangalore, Michael Johnston

引用次数: 35

Abstract

We address the issue of combining data-driven and grammar-based models for rapid prototyping of a multimodal conversational system. Moderate-sized rule-based spoken language models for recognition and understanding are easy to develop and provide the ability to prototype conversational applications rapidly. However, scalability of such systems is a bottleneck due to the heavy cost of authoring and maintenance of rule sets and inevitable brittleness due to lack of coverage in the rule sets. In contrast, data-driven approaches are robust and the procedure for model building is usually simple. However, the lack of data in an application context limits the ability to build data-driven models, especially in multimodal systems. We also present methods that reuse data from different domains and investigate the limits of such models in the context of an application domain.

查看原文本刊更多论文

在多模态会话系统中平衡数据驱动和基于规则的方法

我们解决了结合数据驱动和基于语法的模型来快速构建多模态会话系统的问题。用于识别和理解的中等大小的基于规则的口语模型很容易开发，并提供快速构建会话应用程序原型的能力。然而，由于编写和维护规则集的高昂成本，以及由于规则集缺乏覆盖而不可避免的脆弱性，此类系统的可伸缩性是一个瓶颈。相反，数据驱动的方法是健壮的，模型构建的过程通常很简单。然而，在应用程序上下文中缺少数据限制了构建数据驱动模型的能力，特别是在多模态系统中。我们还提出了重用来自不同领域的数据的方法，并研究了这些模型在应用领域上下文中的局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)

自引率

0.00%

发文量