Cross-Modal Learning via Adversarial Loss and Covariate Shift for Enhanced Liver Segmentation

IF 5.3 3区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Emerging Topics in Computational Intelligence Pub Date : 2024-03-08 DOI:10.1109/TETCI.2024.3369868

Savas Ozkan;M. Alper Selver;Bora Baydar;Ali Emre Kavur;Cemre Candemir;Gozde Bozdagi Akar

{"title":"Cross-Modal Learning via Adversarial Loss and Covariate Shift for Enhanced Liver Segmentation","authors":"Savas Ozkan;M. Alper Selver;Bora Baydar;Ali Emre Kavur;Cemre Candemir;Gozde Bozdagi Akar","doi":"10.1109/TETCI.2024.3369868","DOIUrl":null,"url":null,"abstract":"Despite the widespread use of deep learning methods for semantic segmentation from single imaging modalities, their performance for exploiting multi-domain data still needs to improve. However, the decision-making process in radiology is often guided by data from multiple sources, such as pre-operative evaluation of living donated liver transplantation donors. In such cases, cross-modality performances of deep models become more important. Unfortunately, the domain-dependency of existing techniques limits their clinical acceptability, primarily confining their performance to individual domains. This issue is further formulated as a multi-source domain adaptation problem, which is an emerging field mainly due to the diverse pattern characteristics exhibited from cross-modality data. This paper presents a novel method that can learn robust representations from unpaired cross-modal (CT-MR) data by encapsulating distinct and shared patterns from multiple modalities. In our solution, the covariate shift property is maintained with structural modifications in our architecture. Also, an adversarial loss is adopted to boost the representation capacity. As a result, sparse and rich representations are obtained. Another superiority of our model is that no information about modalities is needed at the training or inference phase. Tests on unpaired CT and MR liver data obtained from the cross-modality task of the CHAOS grand challenge demonstrate that our approach achieves state-of-the-art results with a large margin in both individual metrics and overall scores.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 4","pages":"2723-2735"},"PeriodicalIF":5.3000,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10463530/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Despite the widespread use of deep learning methods for semantic segmentation from single imaging modalities, their performance for exploiting multi-domain data still needs to improve. However, the decision-making process in radiology is often guided by data from multiple sources, such as pre-operative evaluation of living donated liver transplantation donors. In such cases, cross-modality performances of deep models become more important. Unfortunately, the domain-dependency of existing techniques limits their clinical acceptability, primarily confining their performance to individual domains. This issue is further formulated as a multi-source domain adaptation problem, which is an emerging field mainly due to the diverse pattern characteristics exhibited from cross-modality data. This paper presents a novel method that can learn robust representations from unpaired cross-modal (CT-MR) data by encapsulating distinct and shared patterns from multiple modalities. In our solution, the covariate shift property is maintained with structural modifications in our architecture. Also, an adversarial loss is adopted to boost the representation capacity. As a result, sparse and rich representations are obtained. Another superiority of our model is that no information about modalities is needed at the training or inference phase. Tests on unpaired CT and MR liver data obtained from the cross-modality task of the CHAOS grand challenge demonstrate that our approach achieves state-of-the-art results with a large margin in both individual metrics and overall scores.

查看原文本刊更多论文

通过对抗损失和变量移动进行跨模态学习以增强肝脏分割能力

尽管深度学习方法已被广泛用于单一成像模式的语义分割，但它们在利用多域数据方面的性能仍有待提高。然而，放射学中的决策过程通常由来自多个来源的数据指导，例如对活体肝移植供体的术前评估。在这种情况下，深度模型的跨模态性能变得更加重要。遗憾的是，现有技术的领域依赖性限制了其临床可接受性，主要是将其性能局限于个别领域。这个问题被进一步表述为多源领域适应问题，这是一个新兴领域，主要是因为跨模态数据表现出多种模式特征。本文提出了一种新方法，它可以通过封装来自多种模态的独特和共享模式，从未配对的跨模态（CT-MR）数据中学习稳健表征。在我们的解决方案中，通过对架构进行结构性修改，保持了协变量移动特性。此外，我们还采用了对抗损失来提高表示能力。因此，可以获得稀疏而丰富的表征。我们模型的另一个优点是，在训练或推理阶段不需要关于模式的信息。在 CHAOS 大挑战赛跨模态任务中获得的非配对 CT 和 MR 肝脏数据上进行的测试表明，我们的方法在单项指标和总分上都取得了最先进的结果，而且差距很大。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Emerging Topics in Computational Intelligence Mathematics-Control and Optimization

CiteScore

10.30

自引率

7.50%

发文量

147

期刊介绍： The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys. TETCI is an electronics only publication. TETCI publishes six issues per year. Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.