在多模态会话系统中平衡数据驱动和基于规则的方法

2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721) Pub Date : 2003-11-30 DOI:10.1109/ASRU.2003.1318444

S. Bangalore, Michael Johnston

{"title":"在多模态会话系统中平衡数据驱动和基于规则的方法","authors":"S. Bangalore, Michael Johnston","doi":"10.1109/ASRU.2003.1318444","DOIUrl":null,"url":null,"abstract":"We address the issue of combining data-driven and grammar-based models for rapid prototyping of a multimodal conversational system. Moderate-sized rule-based spoken language models for recognition and understanding are easy to develop and provide the ability to prototype conversational applications rapidly. However, scalability of such systems is a bottleneck due to the heavy cost of authoring and maintenance of rule sets and inevitable brittleness due to lack of coverage in the rule sets. In contrast, data-driven approaches are robust and the procedure for model building is usually simple. However, the lack of data in an application context limits the ability to build data-driven models, especially in multimodal systems. We also present methods that reuse data from different domains and investigate the limits of such models in the context of an application domain.","PeriodicalId":394174,"journal":{"name":"2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"35","resultStr":"{\"title\":\"Balancing data-driven and rule-based approaches in the context of a Multimodal Conversational System\",\"authors\":\"S. Bangalore, Michael Johnston\",\"doi\":\"10.1109/ASRU.2003.1318444\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We address the issue of combining data-driven and grammar-based models for rapid prototyping of a multimodal conversational system. Moderate-sized rule-based spoken language models for recognition and understanding are easy to develop and provide the ability to prototype conversational applications rapidly. However, scalability of such systems is a bottleneck due to the heavy cost of authoring and maintenance of rule sets and inevitable brittleness due to lack of coverage in the rule sets. In contrast, data-driven approaches are robust and the procedure for model building is usually simple. However, the lack of data in an application context limits the ability to build data-driven models, especially in multimodal systems. We also present methods that reuse data from different domains and investigate the limits of such models in the context of an application domain.\",\"PeriodicalId\":394174,\"journal\":{\"name\":\"2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"35\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASRU.2003.1318444\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASRU.2003.1318444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 35

摘要

我们解决了结合数据驱动和基于语法的模型来快速构建多模态会话系统的问题。用于识别和理解的中等大小的基于规则的口语模型很容易开发，并提供快速构建会话应用程序原型的能力。然而，由于编写和维护规则集的高昂成本，以及由于规则集缺乏覆盖而不可避免的脆弱性，此类系统的可伸缩性是一个瓶颈。相反，数据驱动的方法是健壮的，模型构建的过程通常很简单。然而，在应用程序上下文中缺少数据限制了构建数据驱动模型的能力，特别是在多模态系统中。我们还提出了重用来自不同领域的数据的方法，并研究了这些模型在应用领域上下文中的局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Balancing data-driven and rule-based approaches in the context of a Multimodal Conversational System

We address the issue of combining data-driven and grammar-based models for rapid prototyping of a multimodal conversational system. Moderate-sized rule-based spoken language models for recognition and understanding are easy to develop and provide the ability to prototype conversational applications rapidly. However, scalability of such systems is a bottleneck due to the heavy cost of authoring and maintenance of rule sets and inevitable brittleness due to lack of coverage in the rule sets. In contrast, data-driven approaches are robust and the procedure for model building is usually simple. However, the lack of data in an application context limits the ability to build data-driven models, especially in multimodal systems. We also present methods that reuse data from different domains and investigate the limits of such models in the context of an application domain.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)

自引率

0.00%

发文量