Self-supervised learning via inter-modal reconstruction and feature projection networks for label-efficient 3D-to-2D segmentation

Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention Pub Date : 2023-07-06 DOI:10.48550/arXiv.2307.03008

José Morano, Guilherme Aresta, D. Lachinov, Julia Mai, U. Schmidt-Erfurth, Hrvoje Bogunovi'c

{"title":"Self-supervised learning via inter-modal reconstruction and feature projection networks for label-efficient 3D-to-2D segmentation","authors":"José Morano, Guilherme Aresta, D. Lachinov, Julia Mai, U. Schmidt-Erfurth, Hrvoje Bogunovi'c","doi":"10.48550/arXiv.2307.03008","DOIUrl":null,"url":null,"abstract":"Deep learning has become a valuable tool for the automation of certain medical image segmentation tasks, significantly relieving the workload of medical specialists. Some of these tasks require segmentation to be performed on a subset of the input dimensions, the most common case being 3D-to-2D. However, the performance of existing methods is strongly conditioned by the amount of labeled data available, as there is currently no data efficient method, e.g. transfer learning, that has been validated on these tasks. In this work, we propose a novel convolutional neural network (CNN) and self-supervised learning (SSL) method for label-efficient 3D-to-2D segmentation. The CNN is composed of a 3D encoder and a 2D decoder connected by novel 3D-to-2D blocks. The SSL method consists of reconstructing image pairs of modalities with different dimensionality. The approach has been validated in two tasks with clinical relevance: the en-face segmentation of geographic atrophy and reticular pseudodrusen in optical coherence tomography. Results on different datasets demonstrate that the proposed CNN significantly improves the state of the art in scenarios with limited labeled data by up to 8% in Dice score. Moreover, the proposed SSL method allows further improvement of this performance by up to 23%, and we show that the SSL is beneficial regardless of the network architecture.","PeriodicalId":18289,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"117 1","pages":"589-599"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2307.03008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning has become a valuable tool for the automation of certain medical image segmentation tasks, significantly relieving the workload of medical specialists. Some of these tasks require segmentation to be performed on a subset of the input dimensions, the most common case being 3D-to-2D. However, the performance of existing methods is strongly conditioned by the amount of labeled data available, as there is currently no data efficient method, e.g. transfer learning, that has been validated on these tasks. In this work, we propose a novel convolutional neural network (CNN) and self-supervised learning (SSL) method for label-efficient 3D-to-2D segmentation. The CNN is composed of a 3D encoder and a 2D decoder connected by novel 3D-to-2D blocks. The SSL method consists of reconstructing image pairs of modalities with different dimensionality. The approach has been validated in two tasks with clinical relevance: the en-face segmentation of geographic atrophy and reticular pseudodrusen in optical coherence tomography. Results on different datasets demonstrate that the proposed CNN significantly improves the state of the art in scenarios with limited labeled data by up to 8% in Dice score. Moreover, the proposed SSL method allows further improvement of this performance by up to 23%, and we show that the SSL is beneficial regardless of the network architecture.

查看原文本刊更多论文

基于多模态重构和特征投影网络的自监督学习，用于标签高效的3d到2d分割

深度学习已成为某些医学图像分割任务自动化的重要工具，大大减轻了医学专家的工作量。其中一些任务需要在输入维度的子集上执行分割，最常见的情况是3d到2d。然而，现有方法的性能在很大程度上取决于可用标记数据的数量，因为目前还没有数据高效的方法，例如迁移学习，已经在这些任务上得到了验证。在这项工作中，我们提出了一种新颖的卷积神经网络(CNN)和自监督学习(SSL)方法，用于标签高效的3d到2d分割。CNN由一个3D编码器和一个2D解码器组成，通过新颖的3D到2D块连接。SSL方法包括重建不同维数的模态图像对。该方法已在两个具有临床意义的任务中得到验证:光学相干断层扫描中的地理萎缩和网状假性结节的正面分割。在不同数据集上的结果表明，在标记数据有限的情况下，所提出的CNN显着提高了Dice得分的8%。此外，所提出的SSL方法允许进一步提高该性能高达23%，并且我们表明SSL无论网络架构如何都是有益的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention

自引率

0.00%

发文量