KACNet: Enhancing CNN feature representation with Kolmogorov-Arnold networks for medical image segmentation and classification

IF 6.8 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS
Deguang Li , Zeyan Jin , Chengyue Guan , Liubing Ji , Yudong Zhang , Zhaozhao Xu , Jiyong Zhang
{"title":"KACNet: Enhancing CNN feature representation with Kolmogorov-Arnold networks for medical image segmentation and classification","authors":"Deguang Li ,&nbsp;Zeyan Jin ,&nbsp;Chengyue Guan ,&nbsp;Liubing Ji ,&nbsp;Yudong Zhang ,&nbsp;Zhaozhao Xu ,&nbsp;Jiyong Zhang","doi":"10.1016/j.ins.2025.122760","DOIUrl":null,"url":null,"abstract":"<div><div>Convolutional neural networks (CNNs) often face challenges in generalizing across datasets due to edge-blurring limitations of fixed activation functions. To address this, this paper introduces a novel approach by integrating KANs into CNNs, forming a module called KANConv, to build an innovative encoder architecture. The encoder is designed to augment the CNN’s capability to effectively distinguish targets in medical image segmentation and classification tasks. Specifically, the CNN component leverages its nonlinear structure to extract hidden local features from the input image, while the KAN component captures global feature information by processing the local features via a sophisticated structure composed of B-spline functions. The proposed multi-stage architecture enables a comprehensive information fusion of local and global features, significantly improving feature representation quality. To validate our encoder’s effectiveness, we perform extensive experiments on: (1) five benchmark 2D medical image segmentation datasets, where our model achieves state-of-the-art performance; and (2) six prominent 3D medical classification datasets, consistently demonstrating superior results compared to existing methods.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"726 ","pages":"Article 122760"},"PeriodicalIF":6.8000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525008965","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Convolutional neural networks (CNNs) often face challenges in generalizing across datasets due to edge-blurring limitations of fixed activation functions. To address this, this paper introduces a novel approach by integrating KANs into CNNs, forming a module called KANConv, to build an innovative encoder architecture. The encoder is designed to augment the CNN’s capability to effectively distinguish targets in medical image segmentation and classification tasks. Specifically, the CNN component leverages its nonlinear structure to extract hidden local features from the input image, while the KAN component captures global feature information by processing the local features via a sophisticated structure composed of B-spline functions. The proposed multi-stage architecture enables a comprehensive information fusion of local and global features, significantly improving feature representation quality. To validate our encoder’s effectiveness, we perform extensive experiments on: (1) five benchmark 2D medical image segmentation datasets, where our model achieves state-of-the-art performance; and (2) six prominent 3D medical classification datasets, consistently demonstrating superior results compared to existing methods.
KACNet:利用Kolmogorov-Arnold网络增强CNN特征表示,用于医学图像分割和分类
由于固定激活函数的边缘模糊限制,卷积神经网络(cnn)在跨数据集泛化方面经常面临挑战。为了解决这个问题,本文引入了一种新颖的方法,将KANs集成到cnn中,形成一个称为KANConv的模块,以构建创新的编码器架构。该编码器旨在增强CNN在医学图像分割和分类任务中有效区分目标的能力。具体来说,CNN组件利用其非线性结构从输入图像中提取隐藏的局部特征,而KAN组件通过由b样条函数组成的复杂结构处理局部特征来捕获全局特征信息。提出的多阶段结构实现了局部和全局特征的全面信息融合,显著提高了特征表示质量。为了验证编码器的有效性,我们在以下方面进行了广泛的实验:(1)五个基准二维医学图像分割数据集,其中我们的模型达到了最先进的性能;(2) 6个突出的三维医学分类数据集,与现有方法相比,结果始终优于现有方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Information Sciences
Information Sciences 工程技术-计算机:信息系统
CiteScore
14.00
自引率
17.30%
发文量
1322
审稿时长
10.4 months
期刊介绍: Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信