Junshu Wang , Yang Guo , Xinjie Tan , Yubin Lan , Yuxing Han
{"title":"复杂背景下纹理一致性损失和反向注意机制增强绿番石榴分割","authors":"Junshu Wang , Yang Guo , Xinjie Tan , Yubin Lan , Yuxing Han","doi":"10.1016/j.compag.2025.110308","DOIUrl":null,"url":null,"abstract":"<div><div>Green crops like guava, unlike the majority that can be distinctly separated from the background by color contrast, often share color characteristics with the surrounding leaves, making the accuracy of crop detection and further pixel-level prediction decrease in complex natural environment. Therefore, this work took the texture and boundaries of crops as the focus, proposed a Gabor based texture consistency loss and a reverse attention module (RAM). Meanwhile, both a receptive field module (RFM) and a mutual fusion decoder (MFD) were proposed to enhance the utilization of semantic information. Finally, a stepwise prediction refinement method with the deep prediction map as prior information was designed in the model framework, realizing a further enhancement of the inference ability. In the ablation experiments, this work verified the effectiveness of the proposed improvements step by step using Classification Evaluation Metrics and provided the visualization of the reverse attention. In the comparative experiments, this model demonstrated its advantages in contrast to state-of-the-arts such as U-Net, SETR, and SegFormer. The Acc and IoU reached 0.9954 and 0.9420, exceeding those of SegFormer by 0.0087 and 0.0119 respectively, demonstrating its application potential for agricultural robot visual systems. Moreover, to further demonstrate the inference capability of the proposed model, we conducted validation on two open-source building extraction datasets, WHU and MBD, which have similar task difficulties, and achieved significant results.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"235 ","pages":"Article 110308"},"PeriodicalIF":7.7000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing green guava segmentation with texture consistency loss and reverse attention mechanism under complex background\",\"authors\":\"Junshu Wang , Yang Guo , Xinjie Tan , Yubin Lan , Yuxing Han\",\"doi\":\"10.1016/j.compag.2025.110308\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Green crops like guava, unlike the majority that can be distinctly separated from the background by color contrast, often share color characteristics with the surrounding leaves, making the accuracy of crop detection and further pixel-level prediction decrease in complex natural environment. Therefore, this work took the texture and boundaries of crops as the focus, proposed a Gabor based texture consistency loss and a reverse attention module (RAM). Meanwhile, both a receptive field module (RFM) and a mutual fusion decoder (MFD) were proposed to enhance the utilization of semantic information. Finally, a stepwise prediction refinement method with the deep prediction map as prior information was designed in the model framework, realizing a further enhancement of the inference ability. In the ablation experiments, this work verified the effectiveness of the proposed improvements step by step using Classification Evaluation Metrics and provided the visualization of the reverse attention. In the comparative experiments, this model demonstrated its advantages in contrast to state-of-the-arts such as U-Net, SETR, and SegFormer. The Acc and IoU reached 0.9954 and 0.9420, exceeding those of SegFormer by 0.0087 and 0.0119 respectively, demonstrating its application potential for agricultural robot visual systems. Moreover, to further demonstrate the inference capability of the proposed model, we conducted validation on two open-source building extraction datasets, WHU and MBD, which have similar task difficulties, and achieved significant results.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"235 \",\"pages\":\"Article 110308\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2025-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925004144\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925004144","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
Enhancing green guava segmentation with texture consistency loss and reverse attention mechanism under complex background
Green crops like guava, unlike the majority that can be distinctly separated from the background by color contrast, often share color characteristics with the surrounding leaves, making the accuracy of crop detection and further pixel-level prediction decrease in complex natural environment. Therefore, this work took the texture and boundaries of crops as the focus, proposed a Gabor based texture consistency loss and a reverse attention module (RAM). Meanwhile, both a receptive field module (RFM) and a mutual fusion decoder (MFD) were proposed to enhance the utilization of semantic information. Finally, a stepwise prediction refinement method with the deep prediction map as prior information was designed in the model framework, realizing a further enhancement of the inference ability. In the ablation experiments, this work verified the effectiveness of the proposed improvements step by step using Classification Evaluation Metrics and provided the visualization of the reverse attention. In the comparative experiments, this model demonstrated its advantages in contrast to state-of-the-arts such as U-Net, SETR, and SegFormer. The Acc and IoU reached 0.9954 and 0.9420, exceeding those of SegFormer by 0.0087 and 0.0119 respectively, demonstrating its application potential for agricultural robot visual systems. Moreover, to further demonstrate the inference capability of the proposed model, we conducted validation on two open-source building extraction datasets, WHU and MBD, which have similar task difficulties, and achieved significant results.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.