Infusing perceptual expertise and domain knowledge into a human-centered image retrieval system: a prototype application

Proceedings of the Symposium on Eye Tracking Research and Applications Pub Date : 2014-03-26 DOI:10.1145/2578153.2578196

Xuan Guo, Rui Li, Cecilia Ovesdotter Alm, Qi Yu, J. Pelz, P. Shi, Anne R. Haake

{"title":"Infusing perceptual expertise and domain knowledge into a human-centered image retrieval system: a prototype application","authors":"Xuan Guo, Rui Li, Cecilia Ovesdotter Alm, Qi Yu, J. Pelz, P. Shi, Anne R. Haake","doi":"10.1145/2578153.2578196","DOIUrl":null,"url":null,"abstract":"Traditional content-based image retrieval techniques, which primarily rely on image content at the pixel level, are not effective in accessing images at the semantic level. Defining approaches to incorporate experts' perceptual and conceptual capabilities of image understanding in their domain of expertise into the retrieval processes promises to help bridge this semantic gap. Towards accomplishing this, we design and implement a novel multimodal interactive system for image retrieval. To incorporate human expertise, the system stores expert-derived information extracted from two human sensor modalities that intuitively relate to image search, eye movements and verbal descriptions, both generated by medical experts. Experimental evaluation of the system shows that by transferring experts' perceptual expertise and domain knowledge into image-based computational procedures, our system can take advantage of the different human-centered modalities' respective strengths and improve the retrieval performance over just using image-based features.","PeriodicalId":142459,"journal":{"name":"Proceedings of the Symposium on Eye Tracking Research and Applications","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Symposium on Eye Tracking Research and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2578153.2578196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

Abstract

Traditional content-based image retrieval techniques, which primarily rely on image content at the pixel level, are not effective in accessing images at the semantic level. Defining approaches to incorporate experts' perceptual and conceptual capabilities of image understanding in their domain of expertise into the retrieval processes promises to help bridge this semantic gap. Towards accomplishing this, we design and implement a novel multimodal interactive system for image retrieval. To incorporate human expertise, the system stores expert-derived information extracted from two human sensor modalities that intuitively relate to image search, eye movements and verbal descriptions, both generated by medical experts. Experimental evaluation of the system shows that by transferring experts' perceptual expertise and domain knowledge into image-based computational procedures, our system can take advantage of the different human-centered modalities' respective strengths and improve the retrieval performance over just using image-based features.

查看原文本刊更多论文

将感性专业知识和领域知识注入以人为中心的图像检索系统:一个原型应用

传统的基于内容的图像检索技术主要依赖于像素级的图像内容，在语义级的图像检索中效果不佳。定义将专家在其专业领域的图像理解的感知和概念能力纳入检索过程的方法，有望帮助弥合这种语义差距。为了实现这一目标，我们设计并实现了一个新的多模态图像检索交互系统。为了结合人类的专业知识，该系统存储了从两种人类传感器模式中提取的专家衍生信息，这些信息直观地与图像搜索、眼球运动和语言描述相关，都是由医学专家生成的。实验结果表明，通过将专家的感知专业知识和领域知识转化为基于图像的计算过程，我们的系统可以利用不同的以人为中心的模式各自的优势，比仅使用基于图像的特征提高检索性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Symposium on Eye Tracking Research and Applications

自引率

0.00%

发文量