Mingzhu Tai , Zhenqiu Shu , Songze Tang , Zhengtao Yu
{"title":"Spatial-spectral multi-order gated aggregation network with bidirectional interactive fusion for hyperspectral image classification","authors":"Mingzhu Tai , Zhenqiu Shu , Songze Tang , Zhengtao Yu","doi":"10.1016/j.neunet.2025.108152","DOIUrl":null,"url":null,"abstract":"<div><div>Recently, convolutional neural networks (CNNs) have made significant strides in hyperspectral image classification (HSIC) tasks by contextualizing the convolutional kernels as global as possible. However, as the kernel sizes increase, encoding multi-order feature interactions becomes less efficient. Furthermore, self-attention mechanisms and convolutional operations can only handle global and local features independently, resulting in overly complex or simplified interactions. To overcome these limitations, in this work, we propose a novel HSIC framework called the Spatial-Spectral Multi-order Gated Aggregation Network with Bidirectional Interaction Fusion (SS-MoGAN). The proposed SS-MoGAN method integrates simple yet powerful convolutions and gated aggregations into a compact module, facilitating efficient feature extraction and adaptive contextual processing. Specifically, the spatial aggregation (SpaAg) and spectral aggregation (SpeAg) blocks guide the model to explicitly capture the interactions between low- and high-order features within the spatial and spectral dimensions. The bidirectional interaction fusion (BIF) blocks further integrate structural information through a bidirectional cross-attention mechanism, enhancing the representation of fine-grained details. Extensive experiments on three hyperspectral benchmark datasets demonstrate that the proposed SS-MoGAN method outperforms other state-of-the-art methods in HSIC applications. The source code for this work is available at <span><span>https://github.com/szq0816/SS-MoGAN_HSIC</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"194 ","pages":"Article 108152"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025010329","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, convolutional neural networks (CNNs) have made significant strides in hyperspectral image classification (HSIC) tasks by contextualizing the convolutional kernels as global as possible. However, as the kernel sizes increase, encoding multi-order feature interactions becomes less efficient. Furthermore, self-attention mechanisms and convolutional operations can only handle global and local features independently, resulting in overly complex or simplified interactions. To overcome these limitations, in this work, we propose a novel HSIC framework called the Spatial-Spectral Multi-order Gated Aggregation Network with Bidirectional Interaction Fusion (SS-MoGAN). The proposed SS-MoGAN method integrates simple yet powerful convolutions and gated aggregations into a compact module, facilitating efficient feature extraction and adaptive contextual processing. Specifically, the spatial aggregation (SpaAg) and spectral aggregation (SpeAg) blocks guide the model to explicitly capture the interactions between low- and high-order features within the spatial and spectral dimensions. The bidirectional interaction fusion (BIF) blocks further integrate structural information through a bidirectional cross-attention mechanism, enhancing the representation of fine-grained details. Extensive experiments on three hyperspectral benchmark datasets demonstrate that the proposed SS-MoGAN method outperforms other state-of-the-art methods in HSIC applications. The source code for this work is available at https://github.com/szq0816/SS-MoGAN_HSIC.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.