Nuo Chen;Yongquan Zhang;Chenchen Fan;Wei Zhao;Changmiao Wang;Hai Wang
{"title":"DiffusionClusNet: Deep Clustering-Driven Diffusion Models for Ultrasound Image Enhancement","authors":"Nuo Chen;Yongquan Zhang;Chenchen Fan;Wei Zhao;Changmiao Wang;Hai Wang","doi":"10.1109/TCE.2025.3540502","DOIUrl":null,"url":null,"abstract":"In modern medical diagnostics, high-quality ultrasound images are essential because they are cost-effective, non-invasive, and capable of providing dynamic recordings. Nevertheless, obtaining such high-quality images is challenging, especially in resource-limited areas, which negatively impacts diagnostic accuracy. To address these issues, we propose a novel method for enhancing ultrasound images using deep clustering-enhanced diffusion models. Our proposed method consists of two main components: an image enhancement pathway and an Auxiliary Classification Pathway (ACP), which are integrated through a Fusion of Image and Classification (FIC) module. The image enhancement pathway employs a structure that includes a Variational Autoencoder (VAE) encoder, a UNet denoising network, and a VAE decoder. This structure progressively reduces noise and generates high-quality images. Simultaneously, the ACP utilizes a convolutional neural network, a transformer encoder, and a clustering module to extract classification information, which supports the enhancement process. The FIC module uses a cross-attention mechanism to merge the image and classification features, thus enhancing the overall performance of image enhancement. To ensure the generated images retain their structural integrity, Structural Similarity (SSIM) loss is employed. Experiments conducted on multiple ultrasound datasets reveal that our method surpasses existing techniques in terms of peak signal-to-noise ratio and SSIM scores. Clinically, our approach significantly improves image contrast and structural detail, leading to more accurate diagnoses. This diffusion-based strategy for image enhancement and classification feature fusion introduces a fresh perspective on preserving structure and enhancing detail in medical image processing. Our Code is available at <uri>https://github.com/ichbincn/Ultrasound-Enhancement</uri>.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"71 1","pages":"1495-1503"},"PeriodicalIF":4.3000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10879338/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In modern medical diagnostics, high-quality ultrasound images are essential because they are cost-effective, non-invasive, and capable of providing dynamic recordings. Nevertheless, obtaining such high-quality images is challenging, especially in resource-limited areas, which negatively impacts diagnostic accuracy. To address these issues, we propose a novel method for enhancing ultrasound images using deep clustering-enhanced diffusion models. Our proposed method consists of two main components: an image enhancement pathway and an Auxiliary Classification Pathway (ACP), which are integrated through a Fusion of Image and Classification (FIC) module. The image enhancement pathway employs a structure that includes a Variational Autoencoder (VAE) encoder, a UNet denoising network, and a VAE decoder. This structure progressively reduces noise and generates high-quality images. Simultaneously, the ACP utilizes a convolutional neural network, a transformer encoder, and a clustering module to extract classification information, which supports the enhancement process. The FIC module uses a cross-attention mechanism to merge the image and classification features, thus enhancing the overall performance of image enhancement. To ensure the generated images retain their structural integrity, Structural Similarity (SSIM) loss is employed. Experiments conducted on multiple ultrasound datasets reveal that our method surpasses existing techniques in terms of peak signal-to-noise ratio and SSIM scores. Clinically, our approach significantly improves image contrast and structural detail, leading to more accurate diagnoses. This diffusion-based strategy for image enhancement and classification feature fusion introduces a fresh perspective on preserving structure and enhancing detail in medical image processing. Our Code is available at https://github.com/ichbincn/Ultrasound-Enhancement.
期刊介绍:
The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.