The added value of multimodality in the NESPOLE! speech-to-speech translation system: an experimental study

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI:10.1109/ICMI.2002.1166999

E. Costantini, F. Pianesi, Susanne Burger

引用次数: 9

Abstract

Multimodal interfaces, which combine two or more input modes (speech, pen, touch...), are expected to be more efficient, natural and usable than single-input interfaces. However, the advantage of multimodal input has only been ascertained in highly controlled experimental conditions (S.L. Oviatt, 1997; 1999); in particular, we lack data about what happens with "real" human-human, multilingual communication systems. We discuss the results of an experiment aiming to evaluate the added value of multimodality in a "true" speech-to-speech translation system, the NESPOLE! system, which provides for multilingual and multimodal communication in the tourism domain, allowing users to interact through the Internet sharing maps, Web-pages and pen-based gestures. We compared two experimental conditions differing as to whether multimodal resources were available: a speech-only condition (SO), and a multimodal condition (MM). Most of the data show tendencies for MM to be better than SO.

查看原文本刊更多论文

多模态的附加价值在NESPOLE!语音对语音翻译系统的实验研究

多模式界面，结合了两种或多种输入模式(语音，笔，触摸…)，期望比单输入界面更有效，自然和可用。然而，多模式输入的优势只在高度控制的实验条件下被确定(S.L. Oviatt, 1997;1999);特别是，我们缺乏关于“真正的”人与人之间、多语言交流系统会发生什么的数据。我们讨论了一项实验的结果，该实验旨在评估多模态在“真正的”语音到语音翻译系统NESPOLE!该系统提供了旅游领域的多语言和多模式通信，允许用户通过互联网共享地图、网页和手写手势进行交互。我们比较了两种不同的实验条件，以确定是否有多模态资源:仅语音条件(SO)和多模态条件(MM)。大多数数据显示MM比SO有更好的趋势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces

自引率

0.00%

发文量