Qifei Tian , Huichun Zhang , Liming Bian , Lei Zhou , Zhuhao Shen , Yufeng Ge
{"title":"基于多模态无人机图像零射分割的杨树幼苗生物量田间表型评价","authors":"Qifei Tian , Huichun Zhang , Liming Bian , Lei Zhou , Zhuhao Shen , Yufeng Ge","doi":"10.1016/j.compag.2025.110462","DOIUrl":null,"url":null,"abstract":"<div><div>Poplar trees are widely cultivated for their ecological and economic benefits. Studying the phenotypes of poplar seedlings can enable the selection of optimal cultivation methods to enhance yield and quality. UAV-based low-altitude remote sensing with optical sensors captures images and spectral data for such studies. However, deep learning in UAV plant phenotyping faces the challenge of requiring substantial time and effort to label image samples for model training. This paper aims to assess the efficiency of using Grounding DINO-SAM2 for zero-shot instance segmentation of individual poplar seedlings across multiple genotypes. An automatic program calculates image features from RGB and multispectral mask areas, including canopy projection, color, texture, and spectral reflectance, which are then used to establish a biomass estimation model based on two years of data. The study obtained the following results: (1) The Grounding DINO-SAM2 model was used to implement zero-labelled sample instance segmentation of 400 image data. After modifying the sample with incorrect target recognition quantity in less than 15 min, the total model took only 0.5 h, with a precision of 0.943, which greatly saved time and computing cost compared with mainstream fully-supervised segmentation models. (2) A poplar seedling biomass estimation model based on multimodal image features was established. After comparing and optimizing single-sensor and multi-sensor combined with different modelling algorithms, it was found that the CNN test set accuracy (R<sup>2</sup>) reached 0.823. This research provides a lightweight, cost-effective approach for plant image segmentation and feature extraction, promoting advances in intelligent management and monitoring for agriculture and forestry.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"236 ","pages":"Article 110462"},"PeriodicalIF":8.9000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Field-based phenotyping for poplar seedlings biomass evaluation based on zero-shot segmentation with multimodal UAV images\",\"authors\":\"Qifei Tian , Huichun Zhang , Liming Bian , Lei Zhou , Zhuhao Shen , Yufeng Ge\",\"doi\":\"10.1016/j.compag.2025.110462\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Poplar trees are widely cultivated for their ecological and economic benefits. Studying the phenotypes of poplar seedlings can enable the selection of optimal cultivation methods to enhance yield and quality. UAV-based low-altitude remote sensing with optical sensors captures images and spectral data for such studies. However, deep learning in UAV plant phenotyping faces the challenge of requiring substantial time and effort to label image samples for model training. This paper aims to assess the efficiency of using Grounding DINO-SAM2 for zero-shot instance segmentation of individual poplar seedlings across multiple genotypes. An automatic program calculates image features from RGB and multispectral mask areas, including canopy projection, color, texture, and spectral reflectance, which are then used to establish a biomass estimation model based on two years of data. The study obtained the following results: (1) The Grounding DINO-SAM2 model was used to implement zero-labelled sample instance segmentation of 400 image data. After modifying the sample with incorrect target recognition quantity in less than 15 min, the total model took only 0.5 h, with a precision of 0.943, which greatly saved time and computing cost compared with mainstream fully-supervised segmentation models. (2) A poplar seedling biomass estimation model based on multimodal image features was established. After comparing and optimizing single-sensor and multi-sensor combined with different modelling algorithms, it was found that the CNN test set accuracy (R<sup>2</sup>) reached 0.823. This research provides a lightweight, cost-effective approach for plant image segmentation and feature extraction, promoting advances in intelligent management and monitoring for agriculture and forestry.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"236 \",\"pages\":\"Article 110462\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S016816992500568X\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016816992500568X","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
Field-based phenotyping for poplar seedlings biomass evaluation based on zero-shot segmentation with multimodal UAV images
Poplar trees are widely cultivated for their ecological and economic benefits. Studying the phenotypes of poplar seedlings can enable the selection of optimal cultivation methods to enhance yield and quality. UAV-based low-altitude remote sensing with optical sensors captures images and spectral data for such studies. However, deep learning in UAV plant phenotyping faces the challenge of requiring substantial time and effort to label image samples for model training. This paper aims to assess the efficiency of using Grounding DINO-SAM2 for zero-shot instance segmentation of individual poplar seedlings across multiple genotypes. An automatic program calculates image features from RGB and multispectral mask areas, including canopy projection, color, texture, and spectral reflectance, which are then used to establish a biomass estimation model based on two years of data. The study obtained the following results: (1) The Grounding DINO-SAM2 model was used to implement zero-labelled sample instance segmentation of 400 image data. After modifying the sample with incorrect target recognition quantity in less than 15 min, the total model took only 0.5 h, with a precision of 0.943, which greatly saved time and computing cost compared with mainstream fully-supervised segmentation models. (2) A poplar seedling biomass estimation model based on multimodal image features was established. After comparing and optimizing single-sensor and multi-sensor combined with different modelling algorithms, it was found that the CNN test set accuracy (R2) reached 0.823. This research provides a lightweight, cost-effective approach for plant image segmentation and feature extraction, promoting advances in intelligent management and monitoring for agriculture and forestry.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.