大规模多媒体检索中的人机合作研究

Kimiaki Shirahama, M. Grzegorzek, B. Indurkhya
{"title":"大规模多媒体检索中的人机合作研究","authors":"Kimiaki Shirahama, M. Grzegorzek, B. Indurkhya","doi":"10.7771/1932-6246.1173","DOIUrl":null,"url":null,"abstract":"Large-Scale Multimedia Retrieval (LSMR) is the task to fast analyze a large amount of multimedia data like images or videos and accurately find the ones relevant to a certain semantic meaning. Although LSMR has been investigated for more than two decades in the fields of multimedia processing and computer vision, a more interdisciplinary approach is necessary to develop an LSMR system that is really meaningful for humans. To this end, this paper aims to stimulate attention to the LSMR problem from diverse research fields. By explaining basic terminologies in LSMR, we first survey several representative methods in chronological order. This reveals that due to prioritizing the generality and scalability for large-scale data, recent methods interpret semantic meanings with a completely different mechanism from humans, though such humanlike mechanisms were used in classical heuristic-based methods. Based on this, we discuss human-machine cooperation , which incorporates knowledge about human interpretation into LSMR without sacrificing the generality and scalability. In particular, we present three approaches to human-machine cooperation ( cognitive , ontological , and adaptive ), which are attributed to cognitive science, ontology engineering, and metacognition, respectively. We hope that this paper will create a bridge to enable researchers in different fields to communicate about the LSMR problem and lead to a ground-break-ing next generation of LSMR systems.","PeriodicalId":90070,"journal":{"name":"The journal of problem solving","volume":"30 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Human-Machine Cooperation in Large-Scale Multimedia Retrieval: A Survey\",\"authors\":\"Kimiaki Shirahama, M. Grzegorzek, B. Indurkhya\",\"doi\":\"10.7771/1932-6246.1173\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Large-Scale Multimedia Retrieval (LSMR) is the task to fast analyze a large amount of multimedia data like images or videos and accurately find the ones relevant to a certain semantic meaning. Although LSMR has been investigated for more than two decades in the fields of multimedia processing and computer vision, a more interdisciplinary approach is necessary to develop an LSMR system that is really meaningful for humans. To this end, this paper aims to stimulate attention to the LSMR problem from diverse research fields. By explaining basic terminologies in LSMR, we first survey several representative methods in chronological order. This reveals that due to prioritizing the generality and scalability for large-scale data, recent methods interpret semantic meanings with a completely different mechanism from humans, though such humanlike mechanisms were used in classical heuristic-based methods. Based on this, we discuss human-machine cooperation , which incorporates knowledge about human interpretation into LSMR without sacrificing the generality and scalability. In particular, we present three approaches to human-machine cooperation ( cognitive , ontological , and adaptive ), which are attributed to cognitive science, ontology engineering, and metacognition, respectively. We hope that this paper will create a bridge to enable researchers in different fields to communicate about the LSMR problem and lead to a ground-break-ing next generation of LSMR systems.\",\"PeriodicalId\":90070,\"journal\":{\"name\":\"The journal of problem solving\",\"volume\":\"30 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The journal of problem solving\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7771/1932-6246.1173\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The journal of problem solving","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7771/1932-6246.1173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

大规模多媒体检索(large - scale Multimedia Retrieval, LSMR)是对图像或视频等大量多媒体数据进行快速分析,并准确地找到与某一语义相关的数据的一种任务。尽管LSMR已经在多媒体处理和计算机视觉领域进行了二十多年的研究,但要开发一个对人类真正有意义的LSMR系统,还需要一个更加跨学科的方法。为此,本文旨在激发不同研究领域对LSMR问题的关注。通过解释LSMR中的基本术语,我们首先按时间顺序调查了几种有代表性的方法。这表明,由于优先考虑大规模数据的通用性和可扩展性,尽管在经典的基于启发式的方法中使用了类似人类的机制,但最近的方法使用了与人类完全不同的机制来解释语义。在此基础上,我们讨论了人机协作,在不牺牲通用性和可扩展性的前提下,将人工翻译知识融入LSMR。特别地,我们提出了三种人机合作的方法(认知、本体论和自适应),它们分别归因于认知科学、本体工程和元认知。我们希望这篇论文能建立一个桥梁,使不同领域的研究人员就LSMR问题进行交流,并导致突破性的下一代LSMR系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Human-Machine Cooperation in Large-Scale Multimedia Retrieval: A Survey
Large-Scale Multimedia Retrieval (LSMR) is the task to fast analyze a large amount of multimedia data like images or videos and accurately find the ones relevant to a certain semantic meaning. Although LSMR has been investigated for more than two decades in the fields of multimedia processing and computer vision, a more interdisciplinary approach is necessary to develop an LSMR system that is really meaningful for humans. To this end, this paper aims to stimulate attention to the LSMR problem from diverse research fields. By explaining basic terminologies in LSMR, we first survey several representative methods in chronological order. This reveals that due to prioritizing the generality and scalability for large-scale data, recent methods interpret semantic meanings with a completely different mechanism from humans, though such humanlike mechanisms were used in classical heuristic-based methods. Based on this, we discuss human-machine cooperation , which incorporates knowledge about human interpretation into LSMR without sacrificing the generality and scalability. In particular, we present three approaches to human-machine cooperation ( cognitive , ontological , and adaptive ), which are attributed to cognitive science, ontology engineering, and metacognition, respectively. We hope that this paper will create a bridge to enable researchers in different fields to communicate about the LSMR problem and lead to a ground-break-ing next generation of LSMR systems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信