Virtually augmenting hundreds of real pictures: An approach based on learning, retrieval, and tracking

Julien Pilet, H. Saito
{"title":"Virtually augmenting hundreds of real pictures: An approach based on learning, retrieval, and tracking","authors":"Julien Pilet, H. Saito","doi":"10.1109/VR.2010.5444811","DOIUrl":null,"url":null,"abstract":"Tracking is a major issue of virtual and augmented reality applications. Single object tracking on monocular video streams is fairly well understood. However, when it comes to multiple objects, existing methods lack scalability and can recognize only a limited number of objects. Thanks to recent progress in feature matching, state-of-the-art image retrieval techniques can deal with millions of images. However, these methods do not focus on real-time video processing and can not track retrieved objects. In this paper, we present a method that combines the speed and accuracy of tracking with the scalability of image retrieval. At the heart of our approach is a bi-layer clustering process that allows our system to index and retrieve objects based on tracks of features, thereby effectively summarizing the information available on multiple video frames. As a result, our system is able to track in real-time multiple objects, recognized with low delay from a database of more than 300 entries.","PeriodicalId":151060,"journal":{"name":"2010 IEEE Virtual Reality Conference (VR)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"35","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE Virtual Reality Conference (VR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VR.2010.5444811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 35

Abstract

Tracking is a major issue of virtual and augmented reality applications. Single object tracking on monocular video streams is fairly well understood. However, when it comes to multiple objects, existing methods lack scalability and can recognize only a limited number of objects. Thanks to recent progress in feature matching, state-of-the-art image retrieval techniques can deal with millions of images. However, these methods do not focus on real-time video processing and can not track retrieved objects. In this paper, we present a method that combines the speed and accuracy of tracking with the scalability of image retrieval. At the heart of our approach is a bi-layer clustering process that allows our system to index and retrieve objects based on tracks of features, thereby effectively summarizing the information available on multiple video frames. As a result, our system is able to track in real-time multiple objects, recognized with low delay from a database of more than 300 entries.
虚拟增强数百张真实图片:一种基于学习、检索和跟踪的方法
跟踪是虚拟现实和增强现实应用中的一个主要问题。单目视频流上的单目标跟踪是相当容易理解的。然而,当涉及到多个对象时,现有的方法缺乏可伸缩性,只能识别有限数量的对象。由于特征匹配的最新进展,最先进的图像检索技术可以处理数以百万计的图像。然而,这些方法不关注实时视频处理,不能跟踪检索到的对象。在本文中,我们提出了一种将跟踪的速度和准确性与图像检索的可扩展性相结合的方法。我们方法的核心是一个双层聚类过程,它允许我们的系统根据特征的轨迹索引和检索对象,从而有效地总结多个视频帧上可用的信息。因此,我们的系统能够实时跟踪多个对象,以低延迟从超过300个条目的数据库中识别。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信