Penggang Wang , Wei Luo , Jiandong Liu , Yongxu Zhou , Xuqing Li , Shipeng Zhao , Guoqing Zhang , Yongxiang Zhao
{"title":"Real-time semantic SLAM-based 3D reconstruction robot for greenhouse vegetables","authors":"Penggang Wang , Wei Luo , Jiandong Liu , Yongxu Zhou , Xuqing Li , Shipeng Zhao , Guoqing Zhang , Yongxiang Zhao","doi":"10.1016/j.compag.2025.110582","DOIUrl":null,"url":null,"abstract":"<div><div>Eggplant harvesting involves processes such as planting, monitoring, and spraying. However, in large eggplant plantations, relying solely on manual labor to complete these tasks not only requires extensive time and high capital costs but also results in extremely low efficiency. By combining unmanned vehicles and visual simultaneous localization and mapping (SLAM) for the three-dimensional (3D) reconstruction of an eggplant orchard we can intuitively obtain information regarding the orchard while providing a foundation for automating the aforementioned tasks. This study proposes a 3D semantic mapping and navigation solution for eggplant orchard based on visual SLAM. The baseline model of this system uses ORB-SLAM2, with improvements made by incorporating SCTNet-B, a semantic segmentation network, to enrich the representation of the point cloud map. In addition, in the tracking thread, a direct method is employed for feature extraction and matching, reducing computational load and enhancing real-time processing speed. Moreover, the point cloud map is converted into an OctoMap to reduce storage consumption. The system is also equipped with the A* algorithm and EGO-Planner algorithm for autonomous navigation of the unmanned vehicle. We implement the proposed system on an unmanned vehicle for field testing and evaluation. The experimental results demonstrate that the SCTNet-B achieves a mean intersection over union of 88.74% for semantic segmentation on a custom eggplant image dataset, with a frame rate of 28.36 FPS. After adopting the direct method, the system’s front-end tracking thread reduces the average processing time per frame by 55.68%. Using an OctoMap for storage of the semantic point cloud map results in an average reduction in memory consumption of 95.20%. The proposed system, integrated with the unmanned vehicle, meets the requirements for real-time semantic mapping and storage in large-scale eggplant orchard.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"237 ","pages":"Article 110582"},"PeriodicalIF":7.7000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016816992500688X","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Eggplant harvesting involves processes such as planting, monitoring, and spraying. However, in large eggplant plantations, relying solely on manual labor to complete these tasks not only requires extensive time and high capital costs but also results in extremely low efficiency. By combining unmanned vehicles and visual simultaneous localization and mapping (SLAM) for the three-dimensional (3D) reconstruction of an eggplant orchard we can intuitively obtain information regarding the orchard while providing a foundation for automating the aforementioned tasks. This study proposes a 3D semantic mapping and navigation solution for eggplant orchard based on visual SLAM. The baseline model of this system uses ORB-SLAM2, with improvements made by incorporating SCTNet-B, a semantic segmentation network, to enrich the representation of the point cloud map. In addition, in the tracking thread, a direct method is employed for feature extraction and matching, reducing computational load and enhancing real-time processing speed. Moreover, the point cloud map is converted into an OctoMap to reduce storage consumption. The system is also equipped with the A* algorithm and EGO-Planner algorithm for autonomous navigation of the unmanned vehicle. We implement the proposed system on an unmanned vehicle for field testing and evaluation. The experimental results demonstrate that the SCTNet-B achieves a mean intersection over union of 88.74% for semantic segmentation on a custom eggplant image dataset, with a frame rate of 28.36 FPS. After adopting the direct method, the system’s front-end tracking thread reduces the average processing time per frame by 55.68%. Using an OctoMap for storage of the semantic point cloud map results in an average reduction in memory consumption of 95.20%. The proposed system, integrated with the unmanned vehicle, meets the requirements for real-time semantic mapping and storage in large-scale eggplant orchard.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.