{"title":"Spectral U-Net: Enhancing Medical Image Segmentation via Spectral Decomposition","authors":"Yaopeng Peng, Milan Sonka, Danny Z. Chen","doi":"arxiv-2409.09216","DOIUrl":null,"url":null,"abstract":"This paper introduces Spectral U-Net, a novel deep learning network based on\nspectral decomposition, by exploiting Dual Tree Complex Wavelet Transform\n(DTCWT) for down-sampling and inverse Dual Tree Complex Wavelet Transform\n(iDTCWT) for up-sampling. We devise the corresponding Wave-Block and\niWave-Block, integrated into the U-Net architecture, aiming at mitigating\ninformation loss during down-sampling and enhancing detail reconstruction\nduring up-sampling. In the encoder, we first decompose the feature map into\nhigh and low-frequency components using DTCWT, enabling down-sampling while\nmitigating information loss. In the decoder, we utilize iDTCWT to reconstruct\nhigher-resolution feature maps from down-sampled features. Evaluations on the\nRetina Fluid, Brain Tumor, and Liver Tumor segmentation datasets with the\nnnU-Net framework demonstrate the superiority of the proposed Spectral U-Net.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09216","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces Spectral U-Net, a novel deep learning network based on
spectral decomposition, by exploiting Dual Tree Complex Wavelet Transform
(DTCWT) for down-sampling and inverse Dual Tree Complex Wavelet Transform
(iDTCWT) for up-sampling. We devise the corresponding Wave-Block and
iWave-Block, integrated into the U-Net architecture, aiming at mitigating
information loss during down-sampling and enhancing detail reconstruction
during up-sampling. In the encoder, we first decompose the feature map into
high and low-frequency components using DTCWT, enabling down-sampling while
mitigating information loss. In the decoder, we utilize iDTCWT to reconstruct
higher-resolution feature maps from down-sampled features. Evaluations on the
Retina Fluid, Brain Tumor, and Liver Tumor segmentation datasets with the
nnU-Net framework demonstrate the superiority of the proposed Spectral U-Net.