Multi-scale feature fusion-based semantic segmentation network for agricultural remote sensing images

IF 5.2 2区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY
Guoxun Zheng, Zhengang Jiang, Xiaoxian Zhang, Donghui Jiang
{"title":"Multi-scale feature fusion-based semantic segmentation network for agricultural remote sensing images","authors":"Guoxun Zheng,&nbsp;Zhengang Jiang,&nbsp;Xiaoxian Zhang,&nbsp;Donghui Jiang","doi":"10.1186/s40538-025-00833-8","DOIUrl":null,"url":null,"abstract":"<div><p>With the widespread application of high-resolution low-altitude remote sensing technology in agricultural monitoring, fine semantic segmentation of crop plots has become a hot research topic. However, due to the high similarity of land features in large-scale agricultural scenes and the presence of complex situations such as blurred land boundaries, achieving accurate semantic segmentation still faces significant challenges. In response to the above issues, this study proposes a hybrid architecture that combines Convolutional Neural Networks and Transformers, aiming to improve the segmentation accuracy of crop plots in complex scenes, especially in handling areas with fuzzy boundaries and similar features. This method innovatively constructs a global local attention mechanism (GPM-Attention), which generates adaptive attention regions through multi-scale convolution operations, significantly enhancing the model's ability to capture global contextual information. This mechanism not only effectively improves the overall segmentation performance, but also significantly reduces computational redundancy and model complexity by optimizing the computation path. In addition, this study constructed a lightweight edge enhancement module (EEI) as an encoder, which not only expands the local receptive field but also enhances the recognition ability of fine-grained features, effectively solving the problem of crop plot edge blurring. To further optimize the feature fusion effect, this study designed a Feature Adaptive Fusion Module (FAM), which efficiently integrates the multi-level features generated by CNN and Transformer encoders, significantly reducing the semantic information loss of small target features. The experimental results demonstrate that the proposed method achieves a significant performance improvement on the publicly available barley remote sensing dataset, attaining a mean Intersection over Union (mIoU) of 80.39%, which represents an 11.33% increase over state-of-the-art approaches. In addition, the method achieves a 14.2% improvement in F1-score, further confirming its effectiveness. Compared to existing techniques, this study presents a more favorable trade-off among segmentation accuracy, computational efficiency, and model complexity, thereby offering reliable technical support for the practical deployment of low-altitude remote sensing imagery in agricultural monitoring applications.</p><h3>Graphical Abstract</h3>\n<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":512,"journal":{"name":"Chemical and Biological Technologies in Agriculture","volume":"12 1","pages":""},"PeriodicalIF":5.2000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://chembioagro.springeropen.com/counter/pdf/10.1186/s40538-025-00833-8","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical and Biological Technologies in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://link.springer.com/article/10.1186/s40538-025-00833-8","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

With the widespread application of high-resolution low-altitude remote sensing technology in agricultural monitoring, fine semantic segmentation of crop plots has become a hot research topic. However, due to the high similarity of land features in large-scale agricultural scenes and the presence of complex situations such as blurred land boundaries, achieving accurate semantic segmentation still faces significant challenges. In response to the above issues, this study proposes a hybrid architecture that combines Convolutional Neural Networks and Transformers, aiming to improve the segmentation accuracy of crop plots in complex scenes, especially in handling areas with fuzzy boundaries and similar features. This method innovatively constructs a global local attention mechanism (GPM-Attention), which generates adaptive attention regions through multi-scale convolution operations, significantly enhancing the model's ability to capture global contextual information. This mechanism not only effectively improves the overall segmentation performance, but also significantly reduces computational redundancy and model complexity by optimizing the computation path. In addition, this study constructed a lightweight edge enhancement module (EEI) as an encoder, which not only expands the local receptive field but also enhances the recognition ability of fine-grained features, effectively solving the problem of crop plot edge blurring. To further optimize the feature fusion effect, this study designed a Feature Adaptive Fusion Module (FAM), which efficiently integrates the multi-level features generated by CNN and Transformer encoders, significantly reducing the semantic information loss of small target features. The experimental results demonstrate that the proposed method achieves a significant performance improvement on the publicly available barley remote sensing dataset, attaining a mean Intersection over Union (mIoU) of 80.39%, which represents an 11.33% increase over state-of-the-art approaches. In addition, the method achieves a 14.2% improvement in F1-score, further confirming its effectiveness. Compared to existing techniques, this study presents a more favorable trade-off among segmentation accuracy, computational efficiency, and model complexity, thereby offering reliable technical support for the practical deployment of low-altitude remote sensing imagery in agricultural monitoring applications.

Graphical Abstract

基于多尺度特征融合的农业遥感图像语义分割网络
随着高分辨率低空遥感技术在农业监测中的广泛应用,农作物地块的精细语义分割已成为一个研究热点。然而,由于大规模农业场景中土地特征的高度相似性以及土地边界模糊等复杂情况的存在,实现准确的语义分割仍然面临着重大挑战。针对上述问题,本研究提出了一种将卷积神经网络与transformer相结合的混合架构,旨在提高复杂场景下,特别是处理边界模糊、特征相似区域的作物地块分割精度。该方法创新性地构建了全局局部注意机制(global local attention mechanism, GPM-Attention),通过多尺度卷积运算生成自适应注意区域,显著增强了模型捕获全局上下文信息的能力。该机制不仅有效提高了整体分割性能,而且通过优化计算路径,显著降低了计算冗余和模型复杂度。此外,本研究构建了轻量级边缘增强模块(EEI)作为编码器,既扩展了局部接受野,又增强了对细粒度特征的识别能力,有效解决了作物地块边缘模糊问题。为了进一步优化特征融合效果,本研究设计了feature Adaptive fusion Module (FAM),将CNN和Transformer编码器生成的多层次特征高效集成,显著降低了小目标特征的语义信息损失。实验结果表明,该方法在公开可用的大麦遥感数据集上取得了显着的性能改进,实现了80.39%的平均交叉交叉(mIoU),比现有方法提高了11.33%。此外,该方法的f1评分提高了14.2%,进一步证实了其有效性。与现有技术相比,本研究在分割精度、计算效率和模型复杂性之间取得了更好的平衡,为低空遥感影像在农业监测应用中的实际部署提供了可靠的技术支持。图形抽象
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Chemical and Biological Technologies in Agriculture
Chemical and Biological Technologies in Agriculture Biochemistry, Genetics and Molecular Biology-Biotechnology
CiteScore
6.80
自引率
3.00%
发文量
83
审稿时长
15 weeks
期刊介绍: Chemical and Biological Technologies in Agriculture is an international, interdisciplinary, peer-reviewed forum for the advancement and application to all fields of agriculture of modern chemical, biochemical and molecular technologies. The scope of this journal includes chemical and biochemical processes aimed to increase sustainable agricultural and food production, the evaluation of quality and origin of raw primary products and their transformation into foods and chemicals, as well as environmental monitoring and remediation. Of special interest are the effects of chemical and biochemical technologies, also at the nano and supramolecular scale, on the relationships between soil, plants, microorganisms and their environment, with the help of modern bioinformatics. Another special focus is the use of modern bioorganic and biological chemistry to develop new technologies for plant nutrition and bio-stimulation, advancement of biorefineries from biomasses, safe and traceable food products, carbon storage in soil and plants and restoration of contaminated soils to agriculture. This journal presents the first opportunity to bring together researchers from a wide number of disciplines within the agricultural chemical and biological sciences, from both industry and academia. The principle aim of Chemical and Biological Technologies in Agriculture is to allow the exchange of the most advanced chemical and biochemical knowledge to develop technologies which address one of the most pressing challenges of our times - sustaining a growing world population. Chemical and Biological Technologies in Agriculture publishes original research articles, short letters and invited reviews. Articles from scientists in industry, academia as well as private research institutes, non-governmental and environmental organizations are encouraged.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信