{"title":"MM-UKAN++: A Novel Kolmogorov–Arnold Network-Based U-Shaped Network for Ultrasound Image Segmentation","authors":"Boheng Zhang;Haorui Huang;Yi Shen;Mingjian Sun","doi":"10.1109/TUFFC.2025.3539262","DOIUrl":null,"url":null,"abstract":"Ultrasound (US) imaging is an important and commonly used medical imaging modality. Accurate and fast automatic segmentation of regions of interest (ROIs) in US images is essential for enhancing the efficiency of clinical and robot-assisted diagnosis. However, US images suffer from low contrast, fuzzy boundaries, and significant scale variations in ROIs. Existing convolutional neural network (CNN)-based and transformer-based methods struggle with model efficiency and explainability. To address these challenges, we introduce MM-UKAN++, a novel U-shaped network based on Kolmogorov-Arnold networks (KANs). MM-UKAN++ leverages multilevel KAN layers as the encoder and decoder within the U-network architecture and incorporates an innovative multidimensional attention mechanism to refine skip connections by weighting features from frequency-channel and spatial perspectives. In addition, the network effectively integrates multiscale information, fusing outputs from different scale decoders to generate precise segmentation predictions. MM-UKAN++ achieves higher segmentation accuracy with lower computational cost and outperforms other mainstream methods on several open-source datasets for US image segmentation tasks, including achieving 69.42% IoU, 81.30% Dice, and 3.31 mm HD in the BUSI dataset with 3.17 G floating point of operations (FLOPs) and 9.90 M parameters. The excellent performance on our automatic carotid artery US scanning and diagnostic system further proves the speed and accuracy of MM-UKAN++. Besides, the good performance in other medical image segmentation tasks reveals the promising applications of MM-UKAN++. The code is available on GitHub.","PeriodicalId":13322,"journal":{"name":"IEEE transactions on ultrasonics, ferroelectrics, and frequency control","volume":"72 4","pages":"498-514"},"PeriodicalIF":3.0000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on ultrasonics, ferroelectrics, and frequency control","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10873001/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
摘要
超声波(US)成像是一种重要且常用的医学成像模式。准确、快速地自动分割 US 图像中的感兴趣区(ROI)对于提高临床诊断和机器人辅助诊断的效率至关重要。然而,US 图像存在对比度低、边界模糊和 ROI 尺度变化大等问题。现有的基于卷积神经网络(CNN)和变形器的方法在模型效率和可解释性方面存在困难。为了应对这些挑战,我们引入了 MM-UKAN++,这是一种基于 Kolmogorov-Arnold 网络(KANs)的新型 U 型网络。MM-UKAN++ 利用多级 KAN 层作为 U 型网络架构中的编码器和解码器,并结合创新的多维关注机制,通过从频率通道和空间角度对特征进行加权来完善跳过连接。此外,该网络还能有效整合多尺度信息,融合不同尺度解码器的输出,生成精确的分割预测。MM-UKAN++ 以更低的计算成本实现了更高的分割精度,并在多个开源数据集的美国图像分割任务中表现优于其他主流方法,包括在 BUSI 数据集中以 3.17G Flops 和 9.90M Parameters 实现了 69.42% IoU、81.30% Dice 和 3.31mm HD。在我们的自动颈动脉 US 扫描和诊断系统上的出色表现进一步证明了 MM-UKAN++ 的速度和准确性。此外,MM-UKAN++ 在其他医学图像分割任务中的良好表现也揭示了其广阔的应用前景。代码可在 GitHub 上获取。
MM-UKAN++: A Novel Kolmogorov–Arnold Network-Based U-Shaped Network for Ultrasound Image Segmentation
Ultrasound (US) imaging is an important and commonly used medical imaging modality. Accurate and fast automatic segmentation of regions of interest (ROIs) in US images is essential for enhancing the efficiency of clinical and robot-assisted diagnosis. However, US images suffer from low contrast, fuzzy boundaries, and significant scale variations in ROIs. Existing convolutional neural network (CNN)-based and transformer-based methods struggle with model efficiency and explainability. To address these challenges, we introduce MM-UKAN++, a novel U-shaped network based on Kolmogorov-Arnold networks (KANs). MM-UKAN++ leverages multilevel KAN layers as the encoder and decoder within the U-network architecture and incorporates an innovative multidimensional attention mechanism to refine skip connections by weighting features from frequency-channel and spatial perspectives. In addition, the network effectively integrates multiscale information, fusing outputs from different scale decoders to generate precise segmentation predictions. MM-UKAN++ achieves higher segmentation accuracy with lower computational cost and outperforms other mainstream methods on several open-source datasets for US image segmentation tasks, including achieving 69.42% IoU, 81.30% Dice, and 3.31 mm HD in the BUSI dataset with 3.17 G floating point of operations (FLOPs) and 9.90 M parameters. The excellent performance on our automatic carotid artery US scanning and diagnostic system further proves the speed and accuracy of MM-UKAN++. Besides, the good performance in other medical image segmentation tasks reveals the promising applications of MM-UKAN++. The code is available on GitHub.
期刊介绍:
IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control includes the theory, technology, materials, and applications relating to: (1) the generation, transmission, and detection of ultrasonic waves and related phenomena; (2) medical ultrasound, including hyperthermia, bioeffects, tissue characterization and imaging; (3) ferroelectric, piezoelectric, and piezomagnetic materials, including crystals, polycrystalline solids, films, polymers, and composites; (4) frequency control, timing and time distribution, including crystal oscillators and other means of classical frequency control, and atomic, molecular and laser frequency control standards. Areas of interest range from fundamental studies to the design and/or applications of devices and systems.