Thomas Pöllabauer, Jiayin Li, Volker Knauthe, Sarah Berkei, Arjan Kuijper
{"title":"用于 6DoF 物体姿态估计的端到端概率几何引导回归技术","authors":"Thomas Pöllabauer, Jiayin Li, Volker Knauthe, Sarah Berkei, Arjan Kuijper","doi":"arxiv-2409.11819","DOIUrl":null,"url":null,"abstract":"6D object pose estimation is the problem of identifying the position and\norientation of an object relative to a chosen coordinate system, which is a\ncore technology for modern XR applications. State-of-the-art 6D object pose\nestimators directly predict an object pose given an object observation. Due to\nthe ill-posed nature of the pose estimation problem, where multiple different\nposes can correspond to a single observation, generating additional plausible\nestimates per observation can be valuable. To address this, we reformulate the\nstate-of-the-art algorithm GDRNPP and introduce EPRO-GDR (End-to-End\nProbabilistic Geometry-Guided Regression). Instead of predicting a single pose\nper detection, we estimate a probability density distribution of the pose.\nUsing the evaluation procedure defined by the BOP (Benchmark for 6D Object Pose\nEstimation) Challenge, we test our approach on four of its core datasets and\ndemonstrate superior quantitative results for EPRO-GDR on LM-O, YCB-V, and\nITODD. Our probabilistic solution shows that predicting a pose distribution\ninstead of a single pose can improve state-of-the-art single-view pose\nestimation while providing the additional benefit of being able to sample\nmultiple meaningful pose candidates.","PeriodicalId":501130,"journal":{"name":"arXiv - CS - Computer Vision and Pattern Recognition","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation\",\"authors\":\"Thomas Pöllabauer, Jiayin Li, Volker Knauthe, Sarah Berkei, Arjan Kuijper\",\"doi\":\"arxiv-2409.11819\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"6D object pose estimation is the problem of identifying the position and\\norientation of an object relative to a chosen coordinate system, which is a\\ncore technology for modern XR applications. State-of-the-art 6D object pose\\nestimators directly predict an object pose given an object observation. Due to\\nthe ill-posed nature of the pose estimation problem, where multiple different\\nposes can correspond to a single observation, generating additional plausible\\nestimates per observation can be valuable. To address this, we reformulate the\\nstate-of-the-art algorithm GDRNPP and introduce EPRO-GDR (End-to-End\\nProbabilistic Geometry-Guided Regression). Instead of predicting a single pose\\nper detection, we estimate a probability density distribution of the pose.\\nUsing the evaluation procedure defined by the BOP (Benchmark for 6D Object Pose\\nEstimation) Challenge, we test our approach on four of its core datasets and\\ndemonstrate superior quantitative results for EPRO-GDR on LM-O, YCB-V, and\\nITODD. Our probabilistic solution shows that predicting a pose distribution\\ninstead of a single pose can improve state-of-the-art single-view pose\\nestimation while providing the additional benefit of being able to sample\\nmultiple meaningful pose candidates.\",\"PeriodicalId\":501130,\"journal\":{\"name\":\"arXiv - CS - Computer Vision and Pattern Recognition\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Computer Vision and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.11819\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation
6D object pose estimation is the problem of identifying the position and
orientation of an object relative to a chosen coordinate system, which is a
core technology for modern XR applications. State-of-the-art 6D object pose
estimators directly predict an object pose given an object observation. Due to
the ill-posed nature of the pose estimation problem, where multiple different
poses can correspond to a single observation, generating additional plausible
estimates per observation can be valuable. To address this, we reformulate the
state-of-the-art algorithm GDRNPP and introduce EPRO-GDR (End-to-End
Probabilistic Geometry-Guided Regression). Instead of predicting a single pose
per detection, we estimate a probability density distribution of the pose.
Using the evaluation procedure defined by the BOP (Benchmark for 6D Object Pose
Estimation) Challenge, we test our approach on four of its core datasets and
demonstrate superior quantitative results for EPRO-GDR on LM-O, YCB-V, and
ITODD. Our probabilistic solution shows that predicting a pose distribution
instead of a single pose can improve state-of-the-art single-view pose
estimation while providing the additional benefit of being able to sample
multiple meaningful pose candidates.