{"title":"Multi-scale feature fusion-based semantic segmentation network for agricultural remote sensing images","authors":"Guoxun Zheng, Zhengang Jiang, Xiaoxian Zhang, Donghui Jiang","doi":"10.1186/s40538-025-00833-8","DOIUrl":null,"url":null,"abstract":"<div><p>With the widespread application of high-resolution low-altitude remote sensing technology in agricultural monitoring, fine semantic segmentation of crop plots has become a hot research topic. However, due to the high similarity of land features in large-scale agricultural scenes and the presence of complex situations such as blurred land boundaries, achieving accurate semantic segmentation still faces significant challenges. In response to the above issues, this study proposes a hybrid architecture that combines Convolutional Neural Networks and Transformers, aiming to improve the segmentation accuracy of crop plots in complex scenes, especially in handling areas with fuzzy boundaries and similar features. This method innovatively constructs a global local attention mechanism (GPM-Attention), which generates adaptive attention regions through multi-scale convolution operations, significantly enhancing the model's ability to capture global contextual information. This mechanism not only effectively improves the overall segmentation performance, but also significantly reduces computational redundancy and model complexity by optimizing the computation path. In addition, this study constructed a lightweight edge enhancement module (EEI) as an encoder, which not only expands the local receptive field but also enhances the recognition ability of fine-grained features, effectively solving the problem of crop plot edge blurring. To further optimize the feature fusion effect, this study designed a Feature Adaptive Fusion Module (FAM), which efficiently integrates the multi-level features generated by CNN and Transformer encoders, significantly reducing the semantic information loss of small target features. The experimental results demonstrate that the proposed method achieves a significant performance improvement on the publicly available barley remote sensing dataset, attaining a mean Intersection over Union (mIoU) of 80.39%, which represents an 11.33% increase over state-of-the-art approaches. In addition, the method achieves a 14.2% improvement in F1-score, further confirming its effectiveness. Compared to existing techniques, this study presents a more favorable trade-off among segmentation accuracy, computational efficiency, and model complexity, thereby offering reliable technical support for the practical deployment of low-altitude remote sensing imagery in agricultural monitoring applications.</p><h3>Graphical Abstract</h3>\n<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":512,"journal":{"name":"Chemical and Biological Technologies in Agriculture","volume":"12 1","pages":""},"PeriodicalIF":5.2000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://chembioagro.springeropen.com/counter/pdf/10.1186/s40538-025-00833-8","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical and Biological Technologies in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://link.springer.com/article/10.1186/s40538-025-00833-8","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
With the widespread application of high-resolution low-altitude remote sensing technology in agricultural monitoring, fine semantic segmentation of crop plots has become a hot research topic. However, due to the high similarity of land features in large-scale agricultural scenes and the presence of complex situations such as blurred land boundaries, achieving accurate semantic segmentation still faces significant challenges. In response to the above issues, this study proposes a hybrid architecture that combines Convolutional Neural Networks and Transformers, aiming to improve the segmentation accuracy of crop plots in complex scenes, especially in handling areas with fuzzy boundaries and similar features. This method innovatively constructs a global local attention mechanism (GPM-Attention), which generates adaptive attention regions through multi-scale convolution operations, significantly enhancing the model's ability to capture global contextual information. This mechanism not only effectively improves the overall segmentation performance, but also significantly reduces computational redundancy and model complexity by optimizing the computation path. In addition, this study constructed a lightweight edge enhancement module (EEI) as an encoder, which not only expands the local receptive field but also enhances the recognition ability of fine-grained features, effectively solving the problem of crop plot edge blurring. To further optimize the feature fusion effect, this study designed a Feature Adaptive Fusion Module (FAM), which efficiently integrates the multi-level features generated by CNN and Transformer encoders, significantly reducing the semantic information loss of small target features. The experimental results demonstrate that the proposed method achieves a significant performance improvement on the publicly available barley remote sensing dataset, attaining a mean Intersection over Union (mIoU) of 80.39%, which represents an 11.33% increase over state-of-the-art approaches. In addition, the method achieves a 14.2% improvement in F1-score, further confirming its effectiveness. Compared to existing techniques, this study presents a more favorable trade-off among segmentation accuracy, computational efficiency, and model complexity, thereby offering reliable technical support for the practical deployment of low-altitude remote sensing imagery in agricultural monitoring applications.
期刊介绍:
Chemical and Biological Technologies in Agriculture is an international, interdisciplinary, peer-reviewed forum for the advancement and application to all fields of agriculture of modern chemical, biochemical and molecular technologies. The scope of this journal includes chemical and biochemical processes aimed to increase sustainable agricultural and food production, the evaluation of quality and origin of raw primary products and their transformation into foods and chemicals, as well as environmental monitoring and remediation. Of special interest are the effects of chemical and biochemical technologies, also at the nano and supramolecular scale, on the relationships between soil, plants, microorganisms and their environment, with the help of modern bioinformatics. Another special focus is the use of modern bioorganic and biological chemistry to develop new technologies for plant nutrition and bio-stimulation, advancement of biorefineries from biomasses, safe and traceable food products, carbon storage in soil and plants and restoration of contaminated soils to agriculture.
This journal presents the first opportunity to bring together researchers from a wide number of disciplines within the agricultural chemical and biological sciences, from both industry and academia. The principle aim of Chemical and Biological Technologies in Agriculture is to allow the exchange of the most advanced chemical and biochemical knowledge to develop technologies which address one of the most pressing challenges of our times - sustaining a growing world population.
Chemical and Biological Technologies in Agriculture publishes original research articles, short letters and invited reviews. Articles from scientists in industry, academia as well as private research institutes, non-governmental and environmental organizations are encouraged.