Hongxing Peng , Shangkun Guo , Xiangjun Zou , Hongjun Wang , Juntao Xiong , Qijun Liang
{"title":"UAV - nerf:基于无人机图像神经辐射场的果园三维重建与果树语义分割","authors":"Hongxing Peng , Shangkun Guo , Xiangjun Zou , Hongjun Wang , Juntao Xiong , Qijun Liang","doi":"10.1016/j.compag.2025.110631","DOIUrl":null,"url":null,"abstract":"<div><div>In precision agriculture, accurate 3D reconstruction of orchard environments is essential for crop health monitoring and automating agricultural tasks. This paper introduces UAVO-NeRF, a novel method using Unmanned Aerial Vehicles (UAVs) for high-fidelity 3D reconstruction and semantic segmentation of orchard scenes. To address inefficiencies in large-scale outdoor environments, we employ a nonlinear scene parameterization that compresses the unbounded scene into a cubic space, enabling denser sampling of distant points. We implement multi-resolution hash encoding to capture both global context and local details, significantly enhancing reconstruction speed and quality. To handle lighting variability, we incorporate appearance embeddings that adaptively encode lighting conditions, increasing the model’s robustness under diverse illumination. Our network’s output layer includes a 3D semantic segmentation module that distinguishes fruit trees from background elements, using a cross-entropy loss function to measure the difference between predicted and actual semantic labels. Depth prediction accuracy is improved using depth maps generated by a pre-trained monocular depth estimation model, refined through a composite loss function that combines reconstruction, depth, semantic, visibility, and interlevel losses to minimize artifacts and enhance geometric representation. Experimental results demonstrate that UAVO-NeRF achieves a Peak Signal-to-Noise Ratio (PSNR) of 23.82, outperforming state-of-the-art models like Instant-NGP and Mip-NeRF 360 across metrics such as PSNR, Structural Similarity Index Measure (SSIM), and Learned Perceptual Image Patch Similarity (LPIPS). Additionally, UAVO-NeRF achieves a mean Intersection over Union (mIoU) of 0.891 for fruit tree semantic segmentation from novel viewpoints, exceeding traditional 2D models by over 5%. This approach offers a robust technological solution for digital twin applications in agriculture.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"237 ","pages":"Article 110631"},"PeriodicalIF":8.9000,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"UAVO-NeRF: 3D reconstruction of orchards and semantic segmentation of fruit trees based on neural radiance field in UAV images\",\"authors\":\"Hongxing Peng , Shangkun Guo , Xiangjun Zou , Hongjun Wang , Juntao Xiong , Qijun Liang\",\"doi\":\"10.1016/j.compag.2025.110631\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In precision agriculture, accurate 3D reconstruction of orchard environments is essential for crop health monitoring and automating agricultural tasks. This paper introduces UAVO-NeRF, a novel method using Unmanned Aerial Vehicles (UAVs) for high-fidelity 3D reconstruction and semantic segmentation of orchard scenes. To address inefficiencies in large-scale outdoor environments, we employ a nonlinear scene parameterization that compresses the unbounded scene into a cubic space, enabling denser sampling of distant points. We implement multi-resolution hash encoding to capture both global context and local details, significantly enhancing reconstruction speed and quality. To handle lighting variability, we incorporate appearance embeddings that adaptively encode lighting conditions, increasing the model’s robustness under diverse illumination. Our network’s output layer includes a 3D semantic segmentation module that distinguishes fruit trees from background elements, using a cross-entropy loss function to measure the difference between predicted and actual semantic labels. Depth prediction accuracy is improved using depth maps generated by a pre-trained monocular depth estimation model, refined through a composite loss function that combines reconstruction, depth, semantic, visibility, and interlevel losses to minimize artifacts and enhance geometric representation. Experimental results demonstrate that UAVO-NeRF achieves a Peak Signal-to-Noise Ratio (PSNR) of 23.82, outperforming state-of-the-art models like Instant-NGP and Mip-NeRF 360 across metrics such as PSNR, Structural Similarity Index Measure (SSIM), and Learned Perceptual Image Patch Similarity (LPIPS). Additionally, UAVO-NeRF achieves a mean Intersection over Union (mIoU) of 0.891 for fruit tree semantic segmentation from novel viewpoints, exceeding traditional 2D models by over 5%. This approach offers a robust technological solution for digital twin applications in agriculture.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"237 \",\"pages\":\"Article 110631\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-06-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925007379\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925007379","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
UAVO-NeRF: 3D reconstruction of orchards and semantic segmentation of fruit trees based on neural radiance field in UAV images
In precision agriculture, accurate 3D reconstruction of orchard environments is essential for crop health monitoring and automating agricultural tasks. This paper introduces UAVO-NeRF, a novel method using Unmanned Aerial Vehicles (UAVs) for high-fidelity 3D reconstruction and semantic segmentation of orchard scenes. To address inefficiencies in large-scale outdoor environments, we employ a nonlinear scene parameterization that compresses the unbounded scene into a cubic space, enabling denser sampling of distant points. We implement multi-resolution hash encoding to capture both global context and local details, significantly enhancing reconstruction speed and quality. To handle lighting variability, we incorporate appearance embeddings that adaptively encode lighting conditions, increasing the model’s robustness under diverse illumination. Our network’s output layer includes a 3D semantic segmentation module that distinguishes fruit trees from background elements, using a cross-entropy loss function to measure the difference between predicted and actual semantic labels. Depth prediction accuracy is improved using depth maps generated by a pre-trained monocular depth estimation model, refined through a composite loss function that combines reconstruction, depth, semantic, visibility, and interlevel losses to minimize artifacts and enhance geometric representation. Experimental results demonstrate that UAVO-NeRF achieves a Peak Signal-to-Noise Ratio (PSNR) of 23.82, outperforming state-of-the-art models like Instant-NGP and Mip-NeRF 360 across metrics such as PSNR, Structural Similarity Index Measure (SSIM), and Learned Perceptual Image Patch Similarity (LPIPS). Additionally, UAVO-NeRF achieves a mean Intersection over Union (mIoU) of 0.891 for fruit tree semantic segmentation from novel viewpoints, exceeding traditional 2D models by over 5%. This approach offers a robust technological solution for digital twin applications in agriculture.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.