带宽受限分布式摄像机网络中的多视点目标识别

2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC) Pub Date : 2009-10-20 DOI:10.1109/ICDSC.2009.5289410

A. Yang, Subhransu Maji, C. M. Christoudias, Trevor Darrell, Jitendra Malik, S. Sastry

{"title":"带宽受限分布式摄像机网络中的多视点目标识别","authors":"A. Yang, Subhransu Maji, C. M. Christoudias, Trevor Darrell, Jitendra Malik, S. Sastry","doi":"10.1109/ICDSC.2009.5289410","DOIUrl":null,"url":null,"abstract":"In this paper, we study the classical problem of object recognition in low-power, low-bandwidth distributed camera networks. The ability to perform robust object recognition is crucial for applications such as visual surveillance to track and identify objects of interest, and compensate visual nuisances such as occlusion and pose variation between multiple camera views. We propose an effective framework to perform distributed object recognition using a network of smart cameras and a computer as the base station. Due to the limited bandwidth between the cameras and the computer, the method utilizes the available computational power on the smart sensors to locally extract and compress SIFT-type image features to represent individual camera views. In particular, we show that between a network of cameras, high-dimensional SIFT histograms share a joint sparse pattern corresponding to a set of common features in 3-D. Such joint sparse patterns can be explicitly exploited to accurately encode the distributed signal via random projection, which is unsupervised and independent to the sensor modality. On the base station, we study multiple decoding schemes to simultaneously recover the multiple-view object features based on the distributed compressive sensing theory. The system has been implemented on the Berkeley CITRIC smart camera platform. The efficacy of the algorithm is validated through extensive simulation and experiments.","PeriodicalId":324810,"journal":{"name":"2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC)","volume":"591 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"42","resultStr":"{\"title\":\"Multiple-view object recognition in band-limited distributed camera networks\",\"authors\":\"A. Yang, Subhransu Maji, C. M. Christoudias, Trevor Darrell, Jitendra Malik, S. Sastry\",\"doi\":\"10.1109/ICDSC.2009.5289410\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we study the classical problem of object recognition in low-power, low-bandwidth distributed camera networks. The ability to perform robust object recognition is crucial for applications such as visual surveillance to track and identify objects of interest, and compensate visual nuisances such as occlusion and pose variation between multiple camera views. We propose an effective framework to perform distributed object recognition using a network of smart cameras and a computer as the base station. Due to the limited bandwidth between the cameras and the computer, the method utilizes the available computational power on the smart sensors to locally extract and compress SIFT-type image features to represent individual camera views. In particular, we show that between a network of cameras, high-dimensional SIFT histograms share a joint sparse pattern corresponding to a set of common features in 3-D. Such joint sparse patterns can be explicitly exploited to accurately encode the distributed signal via random projection, which is unsupervised and independent to the sensor modality. On the base station, we study multiple decoding schemes to simultaneously recover the multiple-view object features based on the distributed compressive sensing theory. The system has been implemented on the Berkeley CITRIC smart camera platform. The efficacy of the algorithm is validated through extensive simulation and experiments.\",\"PeriodicalId\":324810,\"journal\":{\"name\":\"2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC)\",\"volume\":\"591 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"42\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSC.2009.5289410\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSC.2009.5289410","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 42

摘要

本文研究了低功耗、低带宽分布式摄像机网络中目标识别的经典问题。执行强大的对象识别能力对于诸如跟踪和识别感兴趣的对象的视觉监视等应用至关重要，并补偿视觉干扰，如遮挡和多个摄像机视图之间的姿势变化。我们提出了一个有效的框架来执行分布式目标识别使用智能相机网络和计算机作为基站。由于相机和计算机之间的带宽有限，该方法利用智能传感器上可用的计算能力，局部提取和压缩sift类型的图像特征，以表示单个相机视图。特别是，我们表明，在相机网络之间，高维SIFT直方图共享一个联合稀疏模式，对应于3d中的一组共同特征。这种联合稀疏模式可以明确地利用随机投影对分布式信号进行精确编码，这种随机投影是无监督的，与传感器模态无关。在基站上，我们研究了基于分布式压缩感知理论的多种解码方案，以同时恢复多视点目标特征。该系统已在伯克利CITRIC智能摄像头平台上实现。通过大量的仿真和实验验证了该算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multiple-view object recognition in band-limited distributed camera networks

In this paper, we study the classical problem of object recognition in low-power, low-bandwidth distributed camera networks. The ability to perform robust object recognition is crucial for applications such as visual surveillance to track and identify objects of interest, and compensate visual nuisances such as occlusion and pose variation between multiple camera views. We propose an effective framework to perform distributed object recognition using a network of smart cameras and a computer as the base station. Due to the limited bandwidth between the cameras and the computer, the method utilizes the available computational power on the smart sensors to locally extract and compress SIFT-type image features to represent individual camera views. In particular, we show that between a network of cameras, high-dimensional SIFT histograms share a joint sparse pattern corresponding to a set of common features in 3-D. Such joint sparse patterns can be explicitly exploited to accurately encode the distributed signal via random projection, which is unsupervised and independent to the sensor modality. On the base station, we study multiple decoding schemes to simultaneously recover the multiple-view object features based on the distributed compressive sensing theory. The system has been implemented on the Berkeley CITRIC smart camera platform. The efficacy of the algorithm is validated through extensive simulation and experiments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC)

自引率

0.00%

发文量