Proceedings. Fourth IEEE International Conference on Multimodal Interfaces最新文献_第2页

Context-sensitive help for multimodal dialogue 上下文敏感的多模式对话帮助

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1166975

H. Hastie, Michael Johnston, Patrick Ehlen

引用次数: 19

Talking heads: which matching between faces and synthetic voices? 说话的头:人脸和合成声音之间的哪个匹配?

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1166971

Marc Mersiol, N. Chateau, V. Maffiolo

引用次数: 2

Data driven design of an ANN/HMM system for on-line unconstrained handwritten character recognition 在线无约束手写字符识别的ANN/HMM系统的数据驱动设计

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1166984

Haifeng Li, T. Artières, P. Gallinari

引用次数: 2

Multi-modal embodied agents scripting 多模态嵌入代理脚本

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1167038

Y. Arafa, A. Mamdani

{"title":"Multi-modal embodied agents scripting","authors":"Y. Arafa, A. Mamdani","doi":"10.1109/ICMI.2002.1167038","DOIUrl":"https://doi.org/10.1109/ICMI.2002.1167038","url":null,"abstract":"Embodied agents present ongoing challenging agenda for research in multi-modal user interfaces and human-computer-interaction. Such agent metaphors will only be widely applicable to online applications when there is a standardised way to map underlying engines with the visual presentation of the agents. This paper delineates the functions and specifications of a mark-up language for scripting the animation of virtual characters. The language is called: Character Mark-up Language (CML) and is an XML-based character attribute definition and animation scripting language designed to aid in the rapid incorporation of lifelike characters/agents into online applications or virtual reality worlds. This multi-modal scripting language is designed to be easily understandable by human animators and easily generated by a software process such as software agents. CML is constructed based jointly on motion and multi-modal capabilities of virtual life-like figures. The paper further illustrates the constructs of the language and describes a real-time execution architecture that demonstrates the use of such a language as a 4G language to easily utilise and integrate MPEG-4 media objects in online interfaces and virtual environments.","PeriodicalId":208377,"journal":{"name":"Proceedings. Fourth IEEE International Conference on Multimodal Interfaces","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114442807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

The NESPOLE! multimodal interface for cross-lingual communication $experience and lessons learned NESPOLE !跨语言交流的多模式界面$经验和教训

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1166997

Loredana Taddei, E. Costantini, A. Lavie

引用次数: 6

Parallel computing-based architecture for mixed-initiative spoken dialogue 基于并行计算的混合主动口语对话架构

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1166968

Ryuta Taguma, T. Moriyama, K. Iwano, S. Furui

{"title":"Parallel computing-based architecture for mixed-initiative spoken dialogue","authors":"Ryuta Taguma, T. Moriyama, K. Iwano, S. Furui","doi":"10.1109/ICMI.2002.1166968","DOIUrl":"https://doi.org/10.1109/ICMI.2002.1166968","url":null,"abstract":"This paper describes a new method of implementing mixed-initiative spoken dialogue systems based on parallel computing architecture. In a mixed-initiative dialogue, the user as well as the system needs to be capable of controlling the dialogue sequence. In our implementation, various language models corresponding to different dialogue contents, such as requests for information or replies to the system, are built and multiple recognizers using these language models are driven under a parallel computing architecture. The dialogue content of the user is automatically detected based on likelihood scores given by the recognizers, and the content is used to build the dialogue. A transitional probability from one dialogue state uttering a kind of content to another state uttering a different content is incorporated into the likelihood score. A flexible dialogue structure that gives users the initiative to control the dialogue is implemented by this architecture. Real-time dialogue systems for retrieving information about restaurants and food stores are built and evaluated in terms of dialogue content identification rate and keyword accuracy. The proposed architecture has the advantage that the dialogue system can be easily modified without remaking the whole language model.","PeriodicalId":208377,"journal":{"name":"Proceedings. Fourth IEEE International Conference on Multimodal Interfaces","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124688203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Do multimodal signals need to come from the same place? Crossmodal attentional links between proximal and distal surfaces 多模态信号需要来自同一个地方吗?近端和远端表面之间的跨模态注意联系

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1167035

R. Gray, H. Tan, J. Young

{"title":"Do multimodal signals need to come from the same place? Crossmodal attentional links between proximal and distal surfaces","authors":"R. Gray, H. Tan, J. Young","doi":"10.1109/ICMI.2002.1167035","DOIUrl":"https://doi.org/10.1109/ICMI.2002.1167035","url":null,"abstract":"Previous research has shown that the use of multimodal signals can lead to faster and more accurate responses compared to purely unimodal displays. However, in most cases response facilitation only occurs when the signals are presented in roughly the same spatial location. This would suggest a severe restriction on interface designers: to use multimodal displays effectively all signals must be presented from the same location on the display. We previously reported evidence that the use of haptic cues may provide a solution to this problem as haptic cues presented to a user's back can be used to redirect visual attention to locations on a screen in front of the user (Tan et al., 2001). In the present experiment we used a visual change detection task to investigate whether (i) this type of visual-haptic interaction is robust at low cue validity rates and (ii) similar effects occur for auditory cues. Valid haptic cues resulted in significantly faster change detection times even when they accurately indicated the location of the change on only 20% of the trials. Auditory cues had a much smaller effect on detection times at the high validity rate (80%) than haptic cues and did not significantly improve performance at the 20% validity rate. These results suggest that the use haptic attentional cues may be particularly effective in environments in which information cannot be presented in the same spatial location.","PeriodicalId":208377,"journal":{"name":"Proceedings. Fourth IEEE International Conference on Multimodal Interfaces","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127275694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

The added value of multimodality in the NESPOLE! speech-to-speech translation system: an experimental study 多模态的附加价值在NESPOLE!语音对语音翻译系统的实验研究

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1166999

E. Costantini, F. Pianesi, Susanne Burger

引用次数: 9

A real-time framework for natural multimodal interaction with large screen displays 与大屏幕显示器进行自然多模式交互的实时框架

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1167020

N. Krahnstoever, S. Kettebekov, M. Yeasin, Rajeev Sharma

引用次数: 91

Achieving real-time lip-synch via SVM-based phoneme classification and lip shape refinement 基于支持向量机的音素分类和唇形优化实现实时口型同步

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI: 10.1109/ICMI.2002.1167010

Taeyoon Kim, Yongsung Kang, Hanseok Ko

引用次数: 5