{"title":"建立用于三维目标姿态估计的低维流形","authors":"R. Kouskouridas, A. Gasteratos","doi":"10.1109/IST.2012.6295483","DOIUrl":null,"url":null,"abstract":"We propose a novel solution to the problem of 3D object pose estimation problem that is based on an efficient representation and feature extraction technique. We build a part-based architecture that takes into account both appearance-based characteristics of targets along with their geometrical attributes. This bunch-based structure encompasses an image feature extraction procedure accompanied by a clustering scheme over the abstracted key-points. In a follow-up step, these clusters are considered to establish representative manifolds capable of distinguishing similar poses of different objects into the corresponding classes. We form low dimensional manifolds by incorporating sophisticated operations over the members (clusters) of the extracted part-based architecture. An accurate estimation of the pose of a target is provided by a neural network-based solution that entails a novel input-output space targeting method. The performance of our method is comparatively studied against other related works that provide solution to the 3D object pose estimation and that are based on a) manifold modeling, b) object part-based representation and c) conventional dimensionality reduction frameworks. Experimental results justify our theoretical claims and provide evidence of low generalization error when estimating the 3D pose of objects, with the best performance achieved when employing the Radial Basis Functions kernel.","PeriodicalId":213330,"journal":{"name":"2012 IEEE International Conference on Imaging Systems and Techniques Proceedings","volume":"412 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Establishing low dimensional manifolds for 3D object pose estimation\",\"authors\":\"R. Kouskouridas, A. Gasteratos\",\"doi\":\"10.1109/IST.2012.6295483\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a novel solution to the problem of 3D object pose estimation problem that is based on an efficient representation and feature extraction technique. We build a part-based architecture that takes into account both appearance-based characteristics of targets along with their geometrical attributes. This bunch-based structure encompasses an image feature extraction procedure accompanied by a clustering scheme over the abstracted key-points. In a follow-up step, these clusters are considered to establish representative manifolds capable of distinguishing similar poses of different objects into the corresponding classes. We form low dimensional manifolds by incorporating sophisticated operations over the members (clusters) of the extracted part-based architecture. An accurate estimation of the pose of a target is provided by a neural network-based solution that entails a novel input-output space targeting method. The performance of our method is comparatively studied against other related works that provide solution to the 3D object pose estimation and that are based on a) manifold modeling, b) object part-based representation and c) conventional dimensionality reduction frameworks. Experimental results justify our theoretical claims and provide evidence of low generalization error when estimating the 3D pose of objects, with the best performance achieved when employing the Radial Basis Functions kernel.\",\"PeriodicalId\":213330,\"journal\":{\"name\":\"2012 IEEE International Conference on Imaging Systems and Techniques Proceedings\",\"volume\":\"412 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Conference on Imaging Systems and Techniques Proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IST.2012.6295483\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Imaging Systems and Techniques Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IST.2012.6295483","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Establishing low dimensional manifolds for 3D object pose estimation
We propose a novel solution to the problem of 3D object pose estimation problem that is based on an efficient representation and feature extraction technique. We build a part-based architecture that takes into account both appearance-based characteristics of targets along with their geometrical attributes. This bunch-based structure encompasses an image feature extraction procedure accompanied by a clustering scheme over the abstracted key-points. In a follow-up step, these clusters are considered to establish representative manifolds capable of distinguishing similar poses of different objects into the corresponding classes. We form low dimensional manifolds by incorporating sophisticated operations over the members (clusters) of the extracted part-based architecture. An accurate estimation of the pose of a target is provided by a neural network-based solution that entails a novel input-output space targeting method. The performance of our method is comparatively studied against other related works that provide solution to the 3D object pose estimation and that are based on a) manifold modeling, b) object part-based representation and c) conventional dimensionality reduction frameworks. Experimental results justify our theoretical claims and provide evidence of low generalization error when estimating the 3D pose of objects, with the best performance achieved when employing the Radial Basis Functions kernel.