图像+数据库≠图像数据库

R. Mohr
{"title":"图像+数据库≠图像数据库","authors":"R. Mohr","doi":"10.1145/1039470.1039472","DOIUrl":null,"url":null,"abstract":"For decades, computer vision researchers have been trying to extract high level information from images. While the semantics of images is still unreachable from the signal in most real cases, users would like to express requests to image data bases using high-level queries. This gap between the user needs and the image processing capabilities will limit the use of image databases in the near to mid term future.Meaningful applications, however, are already possible using existing scientific technology, for instance using query-by-example. The scalability of such applications stresses the need for: new indexing methods able to handle approximate measures from the image signal; approximate search methods that are efficient in high dimensional spaces; and robust search methods able to handle many partially erroneous data (outliers).The tutorial will illustrate some limited answers to these open problems using invariant features, robust statistics and probabilistic matching. It will then focus on the long term goal of high level semantics extraction from images. This problem is as yet poorly defined: the semantics of an image is user dependent and nobody knows how to express it in a formal way. Some limited answers exist, however, and the tutorial will illustrate how learning mechanisms provide impressive initial results. Moreover learning can be linked to relevance feedback and therefore allows performing better user dependent search.","PeriodicalId":346313,"journal":{"name":"Computer Vision meets Databases","volume":"21 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Image + database ≠ image database\",\"authors\":\"R. Mohr\",\"doi\":\"10.1145/1039470.1039472\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For decades, computer vision researchers have been trying to extract high level information from images. While the semantics of images is still unreachable from the signal in most real cases, users would like to express requests to image data bases using high-level queries. This gap between the user needs and the image processing capabilities will limit the use of image databases in the near to mid term future.Meaningful applications, however, are already possible using existing scientific technology, for instance using query-by-example. The scalability of such applications stresses the need for: new indexing methods able to handle approximate measures from the image signal; approximate search methods that are efficient in high dimensional spaces; and robust search methods able to handle many partially erroneous data (outliers).The tutorial will illustrate some limited answers to these open problems using invariant features, robust statistics and probabilistic matching. It will then focus on the long term goal of high level semantics extraction from images. This problem is as yet poorly defined: the semantics of an image is user dependent and nobody knows how to express it in a formal way. Some limited answers exist, however, and the tutorial will illustrate how learning mechanisms provide impressive initial results. Moreover learning can be linked to relevance feedback and therefore allows performing better user dependent search.\",\"PeriodicalId\":346313,\"journal\":{\"name\":\"Computer Vision meets Databases\",\"volume\":\"21 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Vision meets Databases\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1039470.1039472\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision meets Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1039470.1039472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

几十年来,计算机视觉研究人员一直试图从图像中提取高级信息。虽然在大多数实际情况下,图像的语义仍然无法从信号中获得,但用户希望使用高级查询来表达对图像数据库的请求。用户需求和图像处理能力之间的这种差距将在近期和中期限制图像数据库的使用。然而,使用现有的科学技术,例如使用逐例查询,已经可以实现有意义的应用。这种应用程序的可扩展性强调需要:新的索引方法能够处理来自图像信号的近似度量;在高维空间中高效的近似搜索方法;强大的搜索方法能够处理许多部分错误的数据(异常值)。本教程将使用不变特征、鲁棒统计和概率匹配来说明这些开放问题的一些有限答案。然后,它将专注于从图像中提取高级语义的长期目标。这个问题还没有得到很好的定义:图像的语义依赖于用户,没有人知道如何以正式的方式表达它。然而,存在一些有限的答案,本教程将说明学习机制如何提供令人印象深刻的初步结果。此外,学习可以与相关反馈相关联,因此可以执行更好的用户依赖搜索。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Image + database ≠ image database
For decades, computer vision researchers have been trying to extract high level information from images. While the semantics of images is still unreachable from the signal in most real cases, users would like to express requests to image data bases using high-level queries. This gap between the user needs and the image processing capabilities will limit the use of image databases in the near to mid term future.Meaningful applications, however, are already possible using existing scientific technology, for instance using query-by-example. The scalability of such applications stresses the need for: new indexing methods able to handle approximate measures from the image signal; approximate search methods that are efficient in high dimensional spaces; and robust search methods able to handle many partially erroneous data (outliers).The tutorial will illustrate some limited answers to these open problems using invariant features, robust statistics and probabilistic matching. It will then focus on the long term goal of high level semantics extraction from images. This problem is as yet poorly defined: the semantics of an image is user dependent and nobody knows how to express it in a formal way. Some limited answers exist, however, and the tutorial will illustrate how learning mechanisms provide impressive initial results. Moreover learning can be linked to relevance feedback and therefore allows performing better user dependent search.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信