Deguang Li , Zeyan Jin , Chengyue Guan , Liubing Ji , Yudong Zhang , Zhaozhao Xu , Jiyong Zhang
{"title":"KACNet:利用Kolmogorov-Arnold网络增强CNN特征表示,用于医学图像分割和分类","authors":"Deguang Li , Zeyan Jin , Chengyue Guan , Liubing Ji , Yudong Zhang , Zhaozhao Xu , Jiyong Zhang","doi":"10.1016/j.ins.2025.122760","DOIUrl":null,"url":null,"abstract":"<div><div>Convolutional neural networks (CNNs) often face challenges in generalizing across datasets due to edge-blurring limitations of fixed activation functions. To address this, this paper introduces a novel approach by integrating KANs into CNNs, forming a module called KANConv, to build an innovative encoder architecture. The encoder is designed to augment the CNN’s capability to effectively distinguish targets in medical image segmentation and classification tasks. Specifically, the CNN component leverages its nonlinear structure to extract hidden local features from the input image, while the KAN component captures global feature information by processing the local features via a sophisticated structure composed of B-spline functions. The proposed multi-stage architecture enables a comprehensive information fusion of local and global features, significantly improving feature representation quality. To validate our encoder’s effectiveness, we perform extensive experiments on: (1) five benchmark 2D medical image segmentation datasets, where our model achieves state-of-the-art performance; and (2) six prominent 3D medical classification datasets, consistently demonstrating superior results compared to existing methods.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"726 ","pages":"Article 122760"},"PeriodicalIF":6.8000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"KACNet: Enhancing CNN feature representation with Kolmogorov-Arnold networks for medical image segmentation and classification\",\"authors\":\"Deguang Li , Zeyan Jin , Chengyue Guan , Liubing Ji , Yudong Zhang , Zhaozhao Xu , Jiyong Zhang\",\"doi\":\"10.1016/j.ins.2025.122760\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Convolutional neural networks (CNNs) often face challenges in generalizing across datasets due to edge-blurring limitations of fixed activation functions. To address this, this paper introduces a novel approach by integrating KANs into CNNs, forming a module called KANConv, to build an innovative encoder architecture. The encoder is designed to augment the CNN’s capability to effectively distinguish targets in medical image segmentation and classification tasks. Specifically, the CNN component leverages its nonlinear structure to extract hidden local features from the input image, while the KAN component captures global feature information by processing the local features via a sophisticated structure composed of B-spline functions. The proposed multi-stage architecture enables a comprehensive information fusion of local and global features, significantly improving feature representation quality. To validate our encoder’s effectiveness, we perform extensive experiments on: (1) five benchmark 2D medical image segmentation datasets, where our model achieves state-of-the-art performance; and (2) six prominent 3D medical classification datasets, consistently demonstrating superior results compared to existing methods.</div></div>\",\"PeriodicalId\":51063,\"journal\":{\"name\":\"Information Sciences\",\"volume\":\"726 \",\"pages\":\"Article 122760\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0020025525008965\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525008965","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
KACNet: Enhancing CNN feature representation with Kolmogorov-Arnold networks for medical image segmentation and classification
Convolutional neural networks (CNNs) often face challenges in generalizing across datasets due to edge-blurring limitations of fixed activation functions. To address this, this paper introduces a novel approach by integrating KANs into CNNs, forming a module called KANConv, to build an innovative encoder architecture. The encoder is designed to augment the CNN’s capability to effectively distinguish targets in medical image segmentation and classification tasks. Specifically, the CNN component leverages its nonlinear structure to extract hidden local features from the input image, while the KAN component captures global feature information by processing the local features via a sophisticated structure composed of B-spline functions. The proposed multi-stage architecture enables a comprehensive information fusion of local and global features, significantly improving feature representation quality. To validate our encoder’s effectiveness, we perform extensive experiments on: (1) five benchmark 2D medical image segmentation datasets, where our model achieves state-of-the-art performance; and (2) six prominent 3D medical classification datasets, consistently demonstrating superior results compared to existing methods.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.