低阶非刚性重构的闭型解

2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2015-11-01 DOI:10.1109/DICTA.2015.7371247

Jack Valmadre, S. Sridharan, S. Denman, C. Fookes, S. Lucey

{"title":"低阶非刚性重构的闭型解","authors":"Jack Valmadre, S. Sridharan, S. Denman, C. Fookes, S. Lucey","doi":"10.1109/DICTA.2015.7371247","DOIUrl":null,"url":null,"abstract":"Recovering the motion of a non-rigid body from a set of monocular images permits the analysis of dynamic scenes in uncontrolled environments. However, the extension of factorisation algorithms for rigid structure from motion to the low-rank non- rigid case has proved challenging. This stems from the comparatively hard problem of finding a linear ``corrective transform'' which recovers the projection and structure matrices from an ambiguous factorisation. We elucidate that this greater difficulty is due to the need to find multiple solutions to a non-trivial problem, casting a number of previous approaches as alleviating this issue by either a) introducing constraints on the basis, making the problems non- identical, or b) incorporating heuristics to encourage a diverse set of solutions, making the problems inter-dependent. While it has previously been recognised that finding a single solution to this problem is sufficient to estimate cameras, we show that it is possible to bootstrap this partial solution to find the complete transform in closed-form. However, we acknowledge that our method minimises an algebraic error and is thus inherently sensitive to deviation from the low-rank model. We compare our closed-form solution for non-rigid structure with known cameras to the closed-form solution of Dai et al.~\\cite{Dai2012}, which we find to produce only coplanar reconstructions. We therefore make the recommendation that 3D reconstruction error always be measured relative to a trivial reconstruction such as a planar one.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Closed-Form Solutions for Low-Rank Non-Rigid Reconstruction\",\"authors\":\"Jack Valmadre, S. Sridharan, S. Denman, C. Fookes, S. Lucey\",\"doi\":\"10.1109/DICTA.2015.7371247\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recovering the motion of a non-rigid body from a set of monocular images permits the analysis of dynamic scenes in uncontrolled environments. However, the extension of factorisation algorithms for rigid structure from motion to the low-rank non- rigid case has proved challenging. This stems from the comparatively hard problem of finding a linear ``corrective transform'' which recovers the projection and structure matrices from an ambiguous factorisation. We elucidate that this greater difficulty is due to the need to find multiple solutions to a non-trivial problem, casting a number of previous approaches as alleviating this issue by either a) introducing constraints on the basis, making the problems non- identical, or b) incorporating heuristics to encourage a diverse set of solutions, making the problems inter-dependent. While it has previously been recognised that finding a single solution to this problem is sufficient to estimate cameras, we show that it is possible to bootstrap this partial solution to find the complete transform in closed-form. However, we acknowledge that our method minimises an algebraic error and is thus inherently sensitive to deviation from the low-rank model. We compare our closed-form solution for non-rigid structure with known cameras to the closed-form solution of Dai et al.~\\\\cite{Dai2012}, which we find to produce only coplanar reconstructions. We therefore make the recommendation that 3D reconstruction error always be measured relative to a trivial reconstruction such as a planar one.\",\"PeriodicalId\":214897,\"journal\":{\"name\":\"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA.2015.7371247\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2015.7371247","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

从一组单目图像中恢复非刚体的运动允许在不受控制的环境中分析动态场景。然而，将刚性结构的分解算法从运动扩展到低阶非刚性情况是具有挑战性的。这源于一个相对困难的问题，即找到一个线性的“校正变换”，它可以从一个模糊的分解中恢复投影和结构矩阵。我们阐明，这种更大的困难是由于需要为一个非平凡的问题找到多个解决方案，通过a)在基础上引入约束，使问题不相同，或b)结合启发式来鼓励不同的解决方案，使问题相互依赖，从而将许多先前的方法用于缓解这个问题。虽然以前已经认识到找到这个问题的单一解决方案足以估计相机，但我们表明有可能引导这个部分解决方案以找到封闭形式的完整变换。然而，我们承认我们的方法最小化了代数误差，因此对低秩模型的偏差本质上是敏感的。我们将我们的具有已知摄像机的非刚性结构的封闭形式解与Dai等人\cite{Dai2012}的封闭形式解进行了比较，我们发现后者只产生共面重建。因此，我们建议，三维重建误差总是相对于一个平凡的重建，如平面重建测量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Closed-Form Solutions for Low-Rank Non-Rigid Reconstruction

Recovering the motion of a non-rigid body from a set of monocular images permits the analysis of dynamic scenes in uncontrolled environments. However, the extension of factorisation algorithms for rigid structure from motion to the low-rank non- rigid case has proved challenging. This stems from the comparatively hard problem of finding a linear ``corrective transform'' which recovers the projection and structure matrices from an ambiguous factorisation. We elucidate that this greater difficulty is due to the need to find multiple solutions to a non-trivial problem, casting a number of previous approaches as alleviating this issue by either a) introducing constraints on the basis, making the problems non- identical, or b) incorporating heuristics to encourage a diverse set of solutions, making the problems inter-dependent. While it has previously been recognised that finding a single solution to this problem is sufficient to estimate cameras, we show that it is possible to bootstrap this partial solution to find the complete transform in closed-form. However, we acknowledge that our method minimises an algebraic error and is thus inherently sensitive to deviation from the low-rank model. We compare our closed-form solution for non-rigid structure with known cameras to the closed-form solution of Dai et al.~\cite{Dai2012}, which we find to produce only coplanar reconstructions. We therefore make the recommendation that 3D reconstruction error always be measured relative to a trivial reconstruction such as a planar one.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

自引率

0.00%

发文量