单视图延绳钓图像的无监督严重变形网格重建(DMR)

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-01-23 DOI:10.1109/ICMEW56448.2022.9859312

J. Mei, Jingxiang Yu, S. Romain, Craig S. Rose, Kelsey Magrane, Graeme LeeSon, Jenq-Neng Hwang

{"title":"单视图延绳钓图像的无监督严重变形网格重建(DMR)","authors":"J. Mei, Jingxiang Yu, S. Romain, Craig S. Rose, Kelsey Magrane, Graeme LeeSon, Jenq-Neng Hwang","doi":"10.1109/ICMEW56448.2022.9859312","DOIUrl":null,"url":null,"abstract":"Much progress has been made in the supervised learning of 3D reconstruction of rigid objects from multi-view images or a video. However, it is more challenging to reconstruct severely deformed objects from a single-view RGB image in an unsupervised manner. Training-based methods, such as specific category-level training, have been shown to successfully reconstruct rigid objects and slightly deformed objects like birds from a single-view image. However, they cannot effectively handle severely deformed objects and neither can be applied to some downstream tasks in the real world due to the inconsistent semantic meaning of vertices, which are crucial in defining the adopted 3D templates of objects to be reconstructed. In this work, we introduce a template-based method to infer 3D shapes from a single-view image and apply the reconstructed mesh to a downstream task, i.e., absolute length measurement. Without using 3D ground truth, our method faithfully reconstructs 3D meshes and achieves state-of-the-art accuracy in a length measurement task on a severely deformed fish dataset.","PeriodicalId":106759,"journal":{"name":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Unsupervised Severely Deformed Mesh Reconstruction (DMR) From A Single-View Image for Longline Fishing\",\"authors\":\"J. Mei, Jingxiang Yu, S. Romain, Craig S. Rose, Kelsey Magrane, Graeme LeeSon, Jenq-Neng Hwang\",\"doi\":\"10.1109/ICMEW56448.2022.9859312\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Much progress has been made in the supervised learning of 3D reconstruction of rigid objects from multi-view images or a video. However, it is more challenging to reconstruct severely deformed objects from a single-view RGB image in an unsupervised manner. Training-based methods, such as specific category-level training, have been shown to successfully reconstruct rigid objects and slightly deformed objects like birds from a single-view image. However, they cannot effectively handle severely deformed objects and neither can be applied to some downstream tasks in the real world due to the inconsistent semantic meaning of vertices, which are crucial in defining the adopted 3D templates of objects to be reconstructed. In this work, we introduce a template-based method to infer 3D shapes from a single-view image and apply the reconstructed mesh to a downstream task, i.e., absolute length measurement. Without using 3D ground truth, our method faithfully reconstructs 3D meshes and achieves state-of-the-art accuracy in a length measurement task on a severely deformed fish dataset.\",\"PeriodicalId\":106759,\"journal\":{\"name\":\"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMEW56448.2022.9859312\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW56448.2022.9859312","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

基于多视图图像或视频的刚体三维重建的监督学习已经取得了很大的进展。然而，从单视图RGB图像中以无监督的方式重建严重变形的物体更具挑战性。基于训练的方法，如特定类别级别的训练，已经被证明可以成功地从单视图图像中重建刚性物体和轻微变形的物体，如鸟类。然而，由于顶点的语义不一致，它们不能有效地处理严重变形的物体，也不能应用于现实世界中的一些下游任务，而这对于定义要重构的物体所采用的3D模板至关重要。在这项工作中，我们引入了一种基于模板的方法，从单视图图像中推断3D形状，并将重建的网格应用于下游任务，即绝对长度测量。在不使用三维地面真实值的情况下，我们的方法忠实地重建了三维网格，并在严重变形的鱼类数据集上实现了最先进的长度测量任务精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Unsupervised Severely Deformed Mesh Reconstruction (DMR) From A Single-View Image for Longline Fishing

Much progress has been made in the supervised learning of 3D reconstruction of rigid objects from multi-view images or a video. However, it is more challenging to reconstruct severely deformed objects from a single-view RGB image in an unsupervised manner. Training-based methods, such as specific category-level training, have been shown to successfully reconstruct rigid objects and slightly deformed objects like birds from a single-view image. However, they cannot effectively handle severely deformed objects and neither can be applied to some downstream tasks in the real world due to the inconsistent semantic meaning of vertices, which are crucial in defining the adopted 3D templates of objects to be reconstructed. In this work, we introduce a template-based method to infer 3D shapes from a single-view image and apply the reconstructed mesh to a downstream task, i.e., absolute length measurement. Without using 3D ground truth, our method faithfully reconstructs 3D meshes and achieves state-of-the-art accuracy in a length measurement task on a severely deformed fish dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

自引率

0.00%

发文量