基于全向相机的无监督单目深度估计用于野生葡萄浆果的三维重建。

IF 2.6 3区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

PLoS ONE Pub Date : 2025-02-03 eCollection Date: 2025-01-01 DOI:10.1371/journal.pone.0317359

Yasuto Tamura, Yuzuko Utsumi, Yuka Miwa, Masakazu Iwamura, Koichi Kise

{"title":"基于全向相机的无监督单目深度估计用于野生葡萄浆果的三维重建。","authors":"Yasuto Tamura, Yuzuko Utsumi, Yuka Miwa, Masakazu Iwamura, Koichi Kise","doi":"10.1371/journal.pone.0317359","DOIUrl":null,"url":null,"abstract":"Japanese table grapes are quite expensive because their production is highly labor-intensive. In particular, grape berry pruning is a labor-intensive task performed to produce grapes with desirable characteristics. Because it is considered difficult to master, it is desirable to assist new entrants by using information technology to show the recommended berries to cut. In this research, we aim to build a system that identifies which grape berries should be removed during the pruning process. To realize this, the 3D positions of individual grape berries need to be estimated. Our environmental restriction is that bunches hang from trellises at a height of about 1.6 meters in the grape orchards outside. It is hard to use depth sensors in such circumstances, and using an omnidirectional camera with a wide field of view is desired for the convenience of shooting videos. Obtaining 3D information of grape berries from videos is challenging because they have textureless surfaces, highly symmetric shapes, and crowded arrangements. For these reasons, it is hard to use conventional 3D reconstruction methods, which rely on matching local unique features. To satisfy the practical constraints of this task, we extend a deep learning-based unsupervised monocular depth estimation method to an omnidirectional camera and propose using it. Our experiments demonstrate the effectiveness of the proposed method for estimating the 3D positions of grape berries in the wild.","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 2","pages":"e0317359"},"PeriodicalIF":2.6000,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11790092/pdf/","citationCount":"0","resultStr":"{\"title\":\"Unsupervised monocular depth estimation with omnidirectional camera for 3D reconstruction of grape berries in the wild.\",\"authors\":\"Yasuto Tamura, Yuzuko Utsumi, Yuka Miwa, Masakazu Iwamura, Koichi Kise\",\"doi\":\"10.1371/journal.pone.0317359\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Japanese table grapes are quite expensive because their production is highly labor-intensive. In particular, grape berry pruning is a labor-intensive task performed to produce grapes with desirable characteristics. Because it is considered difficult to master, it is desirable to assist new entrants by using information technology to show the recommended berries to cut. In this research, we aim to build a system that identifies which grape berries should be removed during the pruning process. To realize this, the 3D positions of individual grape berries need to be estimated. Our environmental restriction is that bunches hang from trellises at a height of about 1.6 meters in the grape orchards outside. It is hard to use depth sensors in such circumstances, and using an omnidirectional camera with a wide field of view is desired for the convenience of shooting videos. Obtaining 3D information of grape berries from videos is challenging because they have textureless surfaces, highly symmetric shapes, and crowded arrangements. For these reasons, it is hard to use conventional 3D reconstruction methods, which rely on matching local unique features. To satisfy the practical constraints of this task, we extend a deep learning-based unsupervised monocular depth estimation method to an omnidirectional camera and propose using it. Our experiments demonstrate the effectiveness of the proposed method for estimating the 3D positions of grape berries in the wild.\",\"PeriodicalId\":20189,\"journal\":{\"name\":\"PLoS ONE\",\"volume\":\"20 2\",\"pages\":\"e0317359\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-02-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11790092/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PLoS ONE\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1371/journal.pone.0317359\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0317359","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

日本鲜食葡萄非常昂贵，因为它们的生产是高度劳动密集型的。特别是，葡萄浆果修剪是一项劳动密集型的任务，以生产具有理想特性的葡萄。因为它被认为很难掌握，所以有必要通过使用信息技术来帮助新进入者展示推荐的浆果。在这项研究中，我们的目标是建立一个系统，以确定哪些葡萄浆果应该在修剪过程中去除。为了实现这一点，需要估计单个葡萄浆果的三维位置。我们的环境限制是，在外面的葡萄果园里，一串串的葡萄挂在大约1.6米高的棚架上。在这种情况下很难使用深度传感器，为了方便拍摄视频，需要使用宽视场的全向相机。从视频中获得葡萄浆果的3D信息是具有挑战性的，因为它们具有无纹理的表面，高度对称的形状和拥挤的排列。由于这些原因，传统的三维重建方法很难使用，这些方法依赖于匹配局部特征。为了满足该任务的实际约束，我们将基于深度学习的无监督单目深度估计方法扩展到全向相机，并提出了使用该方法的建议。我们的实验证明了所提出的方法在估计野生葡萄浆果的三维位置方面的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Unsupervised monocular depth estimation with omnidirectional camera for 3D reconstruction of grape berries in the wild.

查看原文本刊更多论文

Unsupervised monocular depth estimation with omnidirectional camera for 3D reconstruction of grape berries in the wild.

Japanese table grapes are quite expensive because their production is highly labor-intensive. In particular, grape berry pruning is a labor-intensive task performed to produce grapes with desirable characteristics. Because it is considered difficult to master, it is desirable to assist new entrants by using information technology to show the recommended berries to cut. In this research, we aim to build a system that identifies which grape berries should be removed during the pruning process. To realize this, the 3D positions of individual grape berries need to be estimated. Our environmental restriction is that bunches hang from trellises at a height of about 1.6 meters in the grape orchards outside. It is hard to use depth sensors in such circumstances, and using an omnidirectional camera with a wide field of view is desired for the convenience of shooting videos. Obtaining 3D information of grape berries from videos is challenging because they have textureless surfaces, highly symmetric shapes, and crowded arrangements. For these reasons, it is hard to use conventional 3D reconstruction methods, which rely on matching local unique features. To satisfy the practical constraints of this task, we extend a deep learning-based unsupervised monocular depth estimation method to an omnidirectional camera and propose using it. Our experiments demonstrate the effectiveness of the proposed method for estimating the 3D positions of grape berries in the wild.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

PLoS ONE 生物-生物学

CiteScore

6.20

自引率

5.40%

发文量

14242

审稿时长

3.7 months

期刊介绍： PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides: * Open-access—freely accessible online, authors retain copyright * Fast publication times * Peer review by expert, practicing researchers * Post-publication tools to indicate quality and impact * Community-based dialogue on articles * Worldwide media coverage