ICMI-MLMI '10最新文献_第5页

Grounding spatial language for video search 基于空间语言的视频搜索

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891944

Stefanie Tellex, T. Kollar, George Shaw, N. Roy, D. Roy

{"title":"Grounding spatial language for video search","authors":"Stefanie Tellex, T. Kollar, George Shaw, N. Roy, D. Roy","doi":"10.1145/1891903.1891944","DOIUrl":"https://doi.org/10.1145/1891903.1891944","url":null,"abstract":"The ability to find a video clip that matches a natural language description of an event would enable intuitive search of large databases of surveillance video. We present a mechanism for connecting a spatial language query to a video clip corresponding to the query. The system can retrieve video clips matching millions of potential queries that describe complex events in video such as \"people walking from the hallway door, around the island, to the kitchen sink.\" By breaking down the query into a sequence of independent structured clauses and modeling the meaning of each component of the structure separately, we are able to improve on previous approaches to video retrieval by finding clips that match much longer and more complex queries using a rich set of spatial relations such as \"down\" and \"past.\" We present a rigorous analysis of the system's performance, based on a large corpus of task-constrained language collected from fourteen subjects. Using this corpus, we show that the system effectively retrieves clips that match natural language descriptions: 58.3% were ranked in the top two of ten in a retrieval task. Furthermore, we show that spatial relations play an important role in the system's performance.","PeriodicalId":181145,"journal":{"name":"ICMI-MLMI '10","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131165400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

3D user-perspective, voxel-based estimation of visual focus of attention in dynamic meeting scenarios 三维用户视角，动态会议场景中基于体素的视觉焦点估计

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891966

M. Voit, R. Stiefelhagen

引用次数: 18

A multimodal interactive text generation system 一个多模态交互文本生成系统

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891918

Luis Rodríguez, I. García-Varea, Alejandro Revuelta-Martínez, E. Vidal

引用次数: 1

Recommendation from robots in a real-world retail shop 来自现实世界零售商店机器人的推荐

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891929

Koji Kamei, K. Shinozawa, Tetsushi Ikeda, A. Utsumi, T. Miyashita, N. Hagita

引用次数: 42

Empathetic video experience through timely multimodal interaction 通过及时的多模态交互进行移情视频体验

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891948

Myunghee Lee, G. Kim

引用次数: 3

Evidence-based automated traffic hazard zone mapping using wearable sensors 使用可穿戴传感器的基于证据的自动交通危险区域地图

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891957

Masahiro Tada, H. Noma, K. Renge

引用次数: 2

Cognitive skills learning: pen input patterns in computer-based athlete training 认知技能学习:基于计算机的运动员训练中的笔输入模式

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891955

Natalie Ruiz, Qian Qian Feng, R. Taib, Tara Handke, Fang Chen

引用次数: 11

A language-based approach to indexing heterogeneous multimedia lifelog 基于语言的异构多媒体生活日志索引方法

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891937

Peng-Wen Chen, Snehal Kumar Chennuru, S. Buthpitiya, Y. Zhang

引用次数: 8

Key-press gestures recognition and interaction based on SEMG signals 基于表面肌电信号的按键手势识别与交互

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891950

Juan Cheng, Xiang Chen, Zhiyuan Lu, Kongqiao Wang, M. Shen

引用次数: 13

Discovering eye gaze behavior during human-agent conversation in an interactive storytelling application 在交互式故事叙述应用程序中发现人-代理对话中的眼睛注视行为

ICMI-MLMI '10 Pub Date : 2010-11-08 DOI: 10.1145/1891903.1891915

Nikolaus Bee, J. Wagner, E. André, Thurid Vogt, Fred Charles, D. Pizzi, M. Cavazza

{"title":"Discovering eye gaze behavior during human-agent conversation in an interactive storytelling application","authors":"Nikolaus Bee, J. Wagner, E. André, Thurid Vogt, Fred Charles, D. Pizzi, M. Cavazza","doi":"10.1145/1891903.1891915","DOIUrl":"https://doi.org/10.1145/1891903.1891915","url":null,"abstract":"In this paper, we investigate the user's eye gaze behavior during the conversation with an interactive storytelling application. We present an interactive eye gaze model for embodied conversational agents in order to improve the experience of users participating in Interactive Storytelling. The underlying narrative in which the approach was tested is based on a classical XIXth century psychological novel: Madame Bovary, by Flaubert. At various stages of the narrative, the user can address the main character or respond to her using free-style spoken natural language input, impersonating her lover. An eye tracker was connected to enable the interactive gaze model to respond to user's current gaze (i.e. looking into the virtual character's eyes or not). We conducted a study with 19 students where we compared our interactive eye gaze model with a non-interactive eye gaze model that was informed by studies of human gaze behaviors, but had no information on where the user was looking. The interactive model achieved a higher score for user ratings than the non-interactive model. In addition we analyzed the users' gaze behavior during the conversation with the virtual character.","PeriodicalId":181145,"journal":{"name":"ICMI-MLMI '10","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122054781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 27