{"title":"通过局部补丁和上下文信息进行视觉跟踪","authors":"Hua Bao, Zonghai Chen","doi":"10.1109/ICALIP.2016.7846526","DOIUrl":null,"url":null,"abstract":"In this paper, a new visual tracking approach via the local patches and the contextual information of the target is presented. In the tracking procedure, the target object is decomposed into a set of patches of equal size and each patch is represented by using intensity and gradient histograms. Then, the likelihood of local patches is defined as the weighted sum of reliability and stability indices, which are applied to evaluate the patches' robustness. Furthermore, the target is represented by using double bounding boxes corresponding to the foreground and background, respectively, which are encoded by HSV color histograms. As this, the drifts can be effectively suppressed by using the contextual information. In the tracking process, the object position is estimated by maximizing the likelihood of the target under the Bayesian framework. The experimental results demonstrate that the proposed approach performs much better than the existing state-of-the-art methods do in terms of efficiency, accuracy and robustness.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Visual tracking via local patches and contextual information\",\"authors\":\"Hua Bao, Zonghai Chen\",\"doi\":\"10.1109/ICALIP.2016.7846526\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a new visual tracking approach via the local patches and the contextual information of the target is presented. In the tracking procedure, the target object is decomposed into a set of patches of equal size and each patch is represented by using intensity and gradient histograms. Then, the likelihood of local patches is defined as the weighted sum of reliability and stability indices, which are applied to evaluate the patches' robustness. Furthermore, the target is represented by using double bounding boxes corresponding to the foreground and background, respectively, which are encoded by HSV color histograms. As this, the drifts can be effectively suppressed by using the contextual information. In the tracking process, the object position is estimated by maximizing the likelihood of the target under the Bayesian framework. The experimental results demonstrate that the proposed approach performs much better than the existing state-of-the-art methods do in terms of efficiency, accuracy and robustness.\",\"PeriodicalId\":184170,\"journal\":{\"name\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICALIP.2016.7846526\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2016.7846526","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Visual tracking via local patches and contextual information
In this paper, a new visual tracking approach via the local patches and the contextual information of the target is presented. In the tracking procedure, the target object is decomposed into a set of patches of equal size and each patch is represented by using intensity and gradient histograms. Then, the likelihood of local patches is defined as the weighted sum of reliability and stability indices, which are applied to evaluate the patches' robustness. Furthermore, the target is represented by using double bounding boxes corresponding to the foreground and background, respectively, which are encoded by HSV color histograms. As this, the drifts can be effectively suppressed by using the contextual information. In the tracking process, the object position is estimated by maximizing the likelihood of the target under the Bayesian framework. The experimental results demonstrate that the proposed approach performs much better than the existing state-of-the-art methods do in terms of efficiency, accuracy and robustness.