{"title":"Clip-Driven Universal Model for Multi-Material Decomposition in Dual-Energy CT","authors":"Xianghong Wang;Jiajun Xiang;Aihua Mao;Jiayi Xie;Peng Jin;Mingchao Ding;Yixuan Yuan;Yanye Lu;Lequan Yu;Hongmin Cai;Baiying Lei;Tianye Niu","doi":"10.1109/TCI.2025.3531707","DOIUrl":null,"url":null,"abstract":"Dual-energy computed tomography (DECT) offers quantitative insights and facilitates material decomposition, aiding in precise diagnosis and treatment planning. However, existing methods for material decomposition, often tailored to specific material types, need more generalizability and increase computational load with each additional material. We propose a CLIP-Driven Universal Model for adaptive Multi-Material Decomposition (MMD) to tackle this challenge. This model utilizes the semantic capabilities of text embeddings from Contrastive Language-Image Pre-training (CLIP), allowing a single network to manage structured feature embedding for multiple materials. A novel Siamese encoder and differential map fusion technique have also been integrated to enhance the decomposition accuracy while maintaining robustness across various conditions. Experiments on the simulated and physical patient studies have evidenced our model's superiority over traditional methods. Notably, it has significantly improved the Dice Similarity Coefficient—4.1%. These results underscore the potential of our network in clinical MMD applications, suggesting a promising avenue for enhancing DECT imaging analysis.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"349-361"},"PeriodicalIF":4.2000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Imaging","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10845834/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Dual-energy computed tomography (DECT) offers quantitative insights and facilitates material decomposition, aiding in precise diagnosis and treatment planning. However, existing methods for material decomposition, often tailored to specific material types, need more generalizability and increase computational load with each additional material. We propose a CLIP-Driven Universal Model for adaptive Multi-Material Decomposition (MMD) to tackle this challenge. This model utilizes the semantic capabilities of text embeddings from Contrastive Language-Image Pre-training (CLIP), allowing a single network to manage structured feature embedding for multiple materials. A novel Siamese encoder and differential map fusion technique have also been integrated to enhance the decomposition accuracy while maintaining robustness across various conditions. Experiments on the simulated and physical patient studies have evidenced our model's superiority over traditional methods. Notably, it has significantly improved the Dice Similarity Coefficient—4.1%. These results underscore the potential of our network in clinical MMD applications, suggesting a promising avenue for enhancing DECT imaging analysis.
期刊介绍:
The IEEE Transactions on Computational Imaging will publish articles where computation plays an integral role in the image formation process. Papers will cover all areas of computational imaging ranging from fundamental theoretical methods to the latest innovative computational imaging system designs. Topics of interest will include advanced algorithms and mathematical techniques, model-based data inversion, methods for image and signal recovery from sparse and incomplete data, techniques for non-traditional sensing of image data, methods for dynamic information acquisition and extraction from imaging sensors, software and hardware for efficient computation in imaging systems, and highly novel imaging system design.