{"title":"Effective Part Localization in Latent-SVM Training","authors":"Yaodong Chen, Renfa Li","doi":"10.1109/ICPR.2014.732","DOIUrl":null,"url":null,"abstract":"Deformable part models show a remarkable detection performance for a variety of object categories. During training these models rely on energy-based methods and heuristic initialization to search and localize parts, equivalent to learning local object features. Due to weak supervision, however, those learnt part detectors contain lots of noise and are not enough reliable to classify the object. This paper investigates part localization problem and extends the latent-SVM by incorporating local consistency of image features. The objective is to adaptively select part sub-windows that overlap semantically meaningful components as much as possible, which leads to a more reliable learning of the part detectors in a weakly-supervised setting. The main idea of our method is estimating part-specific color/texture models as well as edge distribution within each training example, followed by a foreground segmentation for part localization. The experimental results show that we achieve an overall improvement of about 3% mAP over the latent-SVM on non-rigid objects.","PeriodicalId":142159,"journal":{"name":"2014 22nd International Conference on Pattern Recognition","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 22nd International Conference on Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPR.2014.732","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Deformable part models show a remarkable detection performance for a variety of object categories. During training these models rely on energy-based methods and heuristic initialization to search and localize parts, equivalent to learning local object features. Due to weak supervision, however, those learnt part detectors contain lots of noise and are not enough reliable to classify the object. This paper investigates part localization problem and extends the latent-SVM by incorporating local consistency of image features. The objective is to adaptively select part sub-windows that overlap semantically meaningful components as much as possible, which leads to a more reliable learning of the part detectors in a weakly-supervised setting. The main idea of our method is estimating part-specific color/texture models as well as edge distribution within each training example, followed by a foreground segmentation for part localization. The experimental results show that we achieve an overall improvement of about 3% mAP over the latent-SVM on non-rigid objects.