{"title":"U-ONet:基于倍频卷积和 U 型深度神经网络协调注意力的遥感图像语义标注","authors":"Qiongqiong Hu, Feiting Wang, Yuechao Wu, Ying Li","doi":"10.1049/ell2.70014","DOIUrl":null,"url":null,"abstract":"<p>Semantic labelling of remote sensing images is crucial for various remote sensing applications. However, the dense distribution of man-made and natural objects with similar colours and geographic proximity poses challenges for achieving consistent and accurate labelling results. To address this issue, a novel deep learning model incorporating an octave convolutional neural networks (CNNs) within an end-to-end U-shaped architecture is presented. The approach differs from conventional CNNs in that it employs octave convolutions instead of standard convolutions. This strategy serves to minimize low-frequency information redundancy while maintaining segmentation accuracy. Furthermore, coordination attention is introduced in the encoder module to enhance the network's ability to extract useful features, focusing on spatial and channel dependencies within the feature maps. This attention mechanism enables the network to better capture channel, direction, and location information. In conclusion, the U-shaped network is engineered with a completely symmetric structure that employs skip connections to merge low-resolution information, used for object class recognition, with high-resolution information to enable precise localization. This configuration ultimately improves segmentation accuracy. Experimental results on two public datasets demonstrate that our U-ONet achieves state-of-the-art performance, making it a compelling choice for remote sensing image semantic labelling applications.</p>","PeriodicalId":11556,"journal":{"name":"Electronics Letters","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/ell2.70014","citationCount":"0","resultStr":"{\"title\":\"U-ONet: Remote sensing image semantic labelling based on octave convolution and coordination attention in U-shape deep neural network\",\"authors\":\"Qiongqiong Hu, Feiting Wang, Yuechao Wu, Ying Li\",\"doi\":\"10.1049/ell2.70014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Semantic labelling of remote sensing images is crucial for various remote sensing applications. However, the dense distribution of man-made and natural objects with similar colours and geographic proximity poses challenges for achieving consistent and accurate labelling results. To address this issue, a novel deep learning model incorporating an octave convolutional neural networks (CNNs) within an end-to-end U-shaped architecture is presented. The approach differs from conventional CNNs in that it employs octave convolutions instead of standard convolutions. This strategy serves to minimize low-frequency information redundancy while maintaining segmentation accuracy. Furthermore, coordination attention is introduced in the encoder module to enhance the network's ability to extract useful features, focusing on spatial and channel dependencies within the feature maps. This attention mechanism enables the network to better capture channel, direction, and location information. In conclusion, the U-shaped network is engineered with a completely symmetric structure that employs skip connections to merge low-resolution information, used for object class recognition, with high-resolution information to enable precise localization. This configuration ultimately improves segmentation accuracy. Experimental results on two public datasets demonstrate that our U-ONet achieves state-of-the-art performance, making it a compelling choice for remote sensing image semantic labelling applications.</p>\",\"PeriodicalId\":11556,\"journal\":{\"name\":\"Electronics Letters\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2024-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/ell2.70014\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Electronics Letters\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/ell2.70014\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronics Letters","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ell2.70014","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
摘要
遥感图像的语义标注对各种遥感应用至关重要。然而,具有相似颜色和地理邻近性的人造和自然物体的密集分布给实现一致、准确的标注结果带来了挑战。为解决这一问题,本文提出了一种新型深度学习模型,在端到端 U 型架构中集成了八度卷积神经网络(CNN)。这种方法与传统的 CNN 不同,它采用了倍频卷积而不是标准卷积。这种策略可在保持分割准确性的同时,最大限度地减少低频信息冗余。此外,在编码器模块中引入了协调注意力,以增强网络提取有用特征的能力,重点关注特征图中的空间和通道依赖关系。这种注意力机制使网络能够更好地捕捉信道、方向和位置信息。总之,U 型网络采用完全对称的结构,利用跳转连接将用于物体类别识别的低分辨率信息与高分辨率信息合并,从而实现精确定位。这种配置最终提高了分割精度。在两个公共数据集上的实验结果表明,我们的 U-ONet 实现了最先进的性能,使其成为遥感图像语义标注应用的一个令人信服的选择。
U-ONet: Remote sensing image semantic labelling based on octave convolution and coordination attention in U-shape deep neural network
Semantic labelling of remote sensing images is crucial for various remote sensing applications. However, the dense distribution of man-made and natural objects with similar colours and geographic proximity poses challenges for achieving consistent and accurate labelling results. To address this issue, a novel deep learning model incorporating an octave convolutional neural networks (CNNs) within an end-to-end U-shaped architecture is presented. The approach differs from conventional CNNs in that it employs octave convolutions instead of standard convolutions. This strategy serves to minimize low-frequency information redundancy while maintaining segmentation accuracy. Furthermore, coordination attention is introduced in the encoder module to enhance the network's ability to extract useful features, focusing on spatial and channel dependencies within the feature maps. This attention mechanism enables the network to better capture channel, direction, and location information. In conclusion, the U-shaped network is engineered with a completely symmetric structure that employs skip connections to merge low-resolution information, used for object class recognition, with high-resolution information to enable precise localization. This configuration ultimately improves segmentation accuracy. Experimental results on two public datasets demonstrate that our U-ONet achieves state-of-the-art performance, making it a compelling choice for remote sensing image semantic labelling applications.
期刊介绍:
Electronics Letters is an internationally renowned peer-reviewed rapid-communication journal that publishes short original research papers every two weeks. Its broad and interdisciplinary scope covers the latest developments in all electronic engineering related fields including communication, biomedical, optical and device technologies. Electronics Letters also provides further insight into some of the latest developments through special features and interviews.
Scope
As a journal at the forefront of its field, Electronics Letters publishes papers covering all themes of electronic and electrical engineering. The major themes of the journal are listed below.
Antennas and Propagation
Biomedical and Bioinspired Technologies, Signal Processing and Applications
Control Engineering
Electromagnetism: Theory, Materials and Devices
Electronic Circuits and Systems
Image, Video and Vision Processing and Applications
Information, Computing and Communications
Instrumentation and Measurement
Microwave Technology
Optical Communications
Photonics and Opto-Electronics
Power Electronics, Energy and Sustainability
Radar, Sonar and Navigation
Semiconductor Technology
Signal Processing
MIMO