通过局部补丁和上下文信息进行视觉跟踪

2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-11 DOI:10.1109/ICALIP.2016.7846526

Hua Bao, Zonghai Chen

{"title":"通过局部补丁和上下文信息进行视觉跟踪","authors":"Hua Bao, Zonghai Chen","doi":"10.1109/ICALIP.2016.7846526","DOIUrl":null,"url":null,"abstract":"In this paper, a new visual tracking approach via the local patches and the contextual information of the target is presented. In the tracking procedure, the target object is decomposed into a set of patches of equal size and each patch is represented by using intensity and gradient histograms. Then, the likelihood of local patches is defined as the weighted sum of reliability and stability indices, which are applied to evaluate the patches' robustness. Furthermore, the target is represented by using double bounding boxes corresponding to the foreground and background, respectively, which are encoded by HSV color histograms. As this, the drifts can be effectively suppressed by using the contextual information. In the tracking process, the object position is estimated by maximizing the likelihood of the target under the Bayesian framework. The experimental results demonstrate that the proposed approach performs much better than the existing state-of-the-art methods do in terms of efficiency, accuracy and robustness.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Visual tracking via local patches and contextual information\",\"authors\":\"Hua Bao, Zonghai Chen\",\"doi\":\"10.1109/ICALIP.2016.7846526\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a new visual tracking approach via the local patches and the contextual information of the target is presented. In the tracking procedure, the target object is decomposed into a set of patches of equal size and each patch is represented by using intensity and gradient histograms. Then, the likelihood of local patches is defined as the weighted sum of reliability and stability indices, which are applied to evaluate the patches' robustness. Furthermore, the target is represented by using double bounding boxes corresponding to the foreground and background, respectively, which are encoded by HSV color histograms. As this, the drifts can be effectively suppressed by using the contextual information. In the tracking process, the object position is estimated by maximizing the likelihood of the target under the Bayesian framework. The experimental results demonstrate that the proposed approach performs much better than the existing state-of-the-art methods do in terms of efficiency, accuracy and robustness.\",\"PeriodicalId\":184170,\"journal\":{\"name\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICALIP.2016.7846526\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2016.7846526","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种利用局部斑块和目标的上下文信息进行视觉跟踪的新方法。在跟踪过程中，将目标物体分解成一组大小相等的小块，每个小块用强度直方图和梯度直方图表示。然后，将局部补丁的似然度定义为可靠性指标和稳定性指标的加权和，用于评估局部补丁的鲁棒性;利用前景和背景分别对应的双边界框表示目标，用HSV颜色直方图编码。因此，可以通过使用上下文信息有效地抑制漂移。在跟踪过程中，在贝叶斯框架下，通过最大化目标的似然来估计目标的位置。实验结果表明，该方法在效率、精度和鲁棒性方面都优于现有的先进方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Visual tracking via local patches and contextual information

In this paper, a new visual tracking approach via the local patches and the contextual information of the target is presented. In the tracking procedure, the target object is decomposed into a set of patches of equal size and each patch is represented by using intensity and gradient histograms. Then, the likelihood of local patches is defined as the weighted sum of reliability and stability indices, which are applied to evaluate the patches' robustness. Furthermore, the target is represented by using double bounding boxes corresponding to the foreground and background, respectively, which are encoded by HSV color histograms. As this, the drifts can be effectively suppressed by using the contextual information. In the tracking process, the object position is estimated by maximizing the likelihood of the target under the Bayesian framework. The experimental results demonstrate that the proposed approach performs much better than the existing state-of-the-art methods do in terms of efficiency, accuracy and robustness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 International Conference on Audio, Language and Image Processing (ICALIP)

自引率

0.00%

发文量