{"title":"光谱 U-网络:通过光谱分解增强医学图像分割功能","authors":"Yaopeng Peng, Milan Sonka, Danny Z. Chen","doi":"arxiv-2409.09216","DOIUrl":null,"url":null,"abstract":"This paper introduces Spectral U-Net, a novel deep learning network based on\nspectral decomposition, by exploiting Dual Tree Complex Wavelet Transform\n(DTCWT) for down-sampling and inverse Dual Tree Complex Wavelet Transform\n(iDTCWT) for up-sampling. We devise the corresponding Wave-Block and\niWave-Block, integrated into the U-Net architecture, aiming at mitigating\ninformation loss during down-sampling and enhancing detail reconstruction\nduring up-sampling. In the encoder, we first decompose the feature map into\nhigh and low-frequency components using DTCWT, enabling down-sampling while\nmitigating information loss. In the decoder, we utilize iDTCWT to reconstruct\nhigher-resolution feature maps from down-sampled features. Evaluations on the\nRetina Fluid, Brain Tumor, and Liver Tumor segmentation datasets with the\nnnU-Net framework demonstrate the superiority of the proposed Spectral U-Net.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":"6 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Spectral U-Net: Enhancing Medical Image Segmentation via Spectral Decomposition\",\"authors\":\"Yaopeng Peng, Milan Sonka, Danny Z. Chen\",\"doi\":\"arxiv-2409.09216\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper introduces Spectral U-Net, a novel deep learning network based on\\nspectral decomposition, by exploiting Dual Tree Complex Wavelet Transform\\n(DTCWT) for down-sampling and inverse Dual Tree Complex Wavelet Transform\\n(iDTCWT) for up-sampling. We devise the corresponding Wave-Block and\\niWave-Block, integrated into the U-Net architecture, aiming at mitigating\\ninformation loss during down-sampling and enhancing detail reconstruction\\nduring up-sampling. In the encoder, we first decompose the feature map into\\nhigh and low-frequency components using DTCWT, enabling down-sampling while\\nmitigating information loss. In the decoder, we utilize iDTCWT to reconstruct\\nhigher-resolution feature maps from down-sampled features. Evaluations on the\\nRetina Fluid, Brain Tumor, and Liver Tumor segmentation datasets with the\\nnnU-Net framework demonstrate the superiority of the proposed Spectral U-Net.\",\"PeriodicalId\":501289,\"journal\":{\"name\":\"arXiv - EE - Image and Video Processing\",\"volume\":\"6 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Image and Video Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.09216\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09216","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Spectral U-Net: Enhancing Medical Image Segmentation via Spectral Decomposition
This paper introduces Spectral U-Net, a novel deep learning network based on
spectral decomposition, by exploiting Dual Tree Complex Wavelet Transform
(DTCWT) for down-sampling and inverse Dual Tree Complex Wavelet Transform
(iDTCWT) for up-sampling. We devise the corresponding Wave-Block and
iWave-Block, integrated into the U-Net architecture, aiming at mitigating
information loss during down-sampling and enhancing detail reconstruction
during up-sampling. In the encoder, we first decompose the feature map into
high and low-frequency components using DTCWT, enabling down-sampling while
mitigating information loss. In the decoder, we utilize iDTCWT to reconstruct
higher-resolution feature maps from down-sampled features. Evaluations on the
Retina Fluid, Brain Tumor, and Liver Tumor segmentation datasets with the
nnU-Net framework demonstrate the superiority of the proposed Spectral U-Net.