Jiyong Zhang , Zeyan Jin , Yiqian Xia , Xihua Yuan , Yifei Wang , Ning Li , Yinhui Yu , Deguang Li
{"title":"SS-KAN: Self-supervised Kolmogorov-Arnold networks for limited data remote sensing semantic segmentation","authors":"Jiyong Zhang , Zeyan Jin , Yiqian Xia , Xihua Yuan , Yifei Wang , Ning Li , Yinhui Yu , Deguang Li","doi":"10.1016/j.neunet.2025.107881","DOIUrl":null,"url":null,"abstract":"<div><div>Self-supervised learning has emerged as a powerful approach for remote sensing image segmentation. However, its effectiveness significantly diminishes in remote sensing scenarios with extreme label scarcity (no more than 5 % of samples annotated). This limitation arises from two primary challenges: (i) insufficient exploitation of hierarchical representations in unlabeled data and (ii) irreversible information loss that occurs when adapting features from pretraining to downstream tasks. To tackle these challenges, this study proposes SS-KAN, an enhanced self-supervised learning framework based on the Kolmogorov-Arnold Network (KAN). The framework includes two key innovations: First, it features a depthwise KAN module, which combines depthwise separable convolutions with dilation rates and learnable activation functions within a multi-scale branch structure. This design enables the creation of context-aware feature representations from unlabeled images. Second, the framework develops a dual-branch adaptation strategy during the fine-tuning phase. This strategy utilizes a bifurcated structure that maintains spatial semantics through standard convolution while incorporating KAN-driven decomposable nonlinearity into residual identity mappings. As a result, it effectively enhances the hierarchical feature representation and reduces feature degradation during domain transfer, particularly in scenarios with limited data labels. Extensive experiments conducted on three benchmarks demonstrate that SS-KAN outperforms other state-of-the-art methods, even when trained with only 1 % labeled data. Ablation studies further confirm the importance of both the depthwise KAN and dual-branch adaptation modules. Our findings suggest that integrating KAN adaptive nonlinearity through learnable activation functions with depthwise convolutional operations and identity mappings opens new possibilities for data-efficient remote sensing image segmentation. Our code will be available from <span><span>https://github.com/zhangjy2008327/SSKAN</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"192 ","pages":"Article 107881"},"PeriodicalIF":6.3000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025007610","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Self-supervised learning has emerged as a powerful approach for remote sensing image segmentation. However, its effectiveness significantly diminishes in remote sensing scenarios with extreme label scarcity (no more than 5 % of samples annotated). This limitation arises from two primary challenges: (i) insufficient exploitation of hierarchical representations in unlabeled data and (ii) irreversible information loss that occurs when adapting features from pretraining to downstream tasks. To tackle these challenges, this study proposes SS-KAN, an enhanced self-supervised learning framework based on the Kolmogorov-Arnold Network (KAN). The framework includes two key innovations: First, it features a depthwise KAN module, which combines depthwise separable convolutions with dilation rates and learnable activation functions within a multi-scale branch structure. This design enables the creation of context-aware feature representations from unlabeled images. Second, the framework develops a dual-branch adaptation strategy during the fine-tuning phase. This strategy utilizes a bifurcated structure that maintains spatial semantics through standard convolution while incorporating KAN-driven decomposable nonlinearity into residual identity mappings. As a result, it effectively enhances the hierarchical feature representation and reduces feature degradation during domain transfer, particularly in scenarios with limited data labels. Extensive experiments conducted on three benchmarks demonstrate that SS-KAN outperforms other state-of-the-art methods, even when trained with only 1 % labeled data. Ablation studies further confirm the importance of both the depthwise KAN and dual-branch adaptation modules. Our findings suggest that integrating KAN adaptive nonlinearity through learnable activation functions with depthwise convolutional operations and identity mappings opens new possibilities for data-efficient remote sensing image segmentation. Our code will be available from https://github.com/zhangjy2008327/SSKAN.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.