Segmentation of 2D cardiac ultrasound with deep learning: simpler models for a simple task

2022 IEEE International Ultrasonics Symposium (IUS) Pub Date : 2022-10-10 DOI:10.1109/IUS54386.2022.9957618

Artem Chernyshov, Andreas Østvik, E. Smistad, L. Løvstakken

{"title":"Segmentation of 2D cardiac ultrasound with deep learning: simpler models for a simple task","authors":"Artem Chernyshov, Andreas Østvik, E. Smistad, L. Løvstakken","doi":"10.1109/IUS54386.2022.9957618","DOIUrl":null,"url":null,"abstract":"Low-complexity convolutional neural networks have been shown to be sufficient for segmentation of cardiac US images in A2C and A4C views. The performance of 24 varying-complexity implementations of U-Net and DeepLabV3+ (popular segmentation architectures) has been tested on cardiac US data (CAMUS data set) and street view data (Cityscapes data set). The inference speed of the models has also been measured before and after post-training optimization. The models systematically differed in their structural components: the number of layers and convolutional filters as well as the receptive field size. All models trained to maximize the Dice Coefficient. The Dice Coefficient was consistently high (0.86-0.90) on CA-MUS data and low (0.48-0.67) on Cityscapes data for all models. Each ten-fold reduction in the number of model parameters tended to reduce the score by ≈0.01 on CAMUS and by 0.03-0.05 on Cityscapes. Likewise, low-parameter models, especially the ones based on U-Net, had yielded predictions with higher (worse) Hausdorff Distance values. Increasing the receptive field size of the models partially mitigated this effect. Without post-training optimization, the inference speed mostly varied with the number of layers in the networks. The least complex U-Net model was 83% faster than the most complex one; for the DeepLab models the difference was 53%. With post-training optimization, any reduction in the number of parameters led to increased speed: up to more than 700% for both architecture types.","PeriodicalId":272387,"journal":{"name":"2022 IEEE International Ultrasonics Symposium (IUS)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Ultrasonics Symposium (IUS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IUS54386.2022.9957618","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Low-complexity convolutional neural networks have been shown to be sufficient for segmentation of cardiac US images in A2C and A4C views. The performance of 24 varying-complexity implementations of U-Net and DeepLabV3+ (popular segmentation architectures) has been tested on cardiac US data (CAMUS data set) and street view data (Cityscapes data set). The inference speed of the models has also been measured before and after post-training optimization. The models systematically differed in their structural components: the number of layers and convolutional filters as well as the receptive field size. All models trained to maximize the Dice Coefficient. The Dice Coefficient was consistently high (0.86-0.90) on CA-MUS data and low (0.48-0.67) on Cityscapes data for all models. Each ten-fold reduction in the number of model parameters tended to reduce the score by ≈0.01 on CAMUS and by 0.03-0.05 on Cityscapes. Likewise, low-parameter models, especially the ones based on U-Net, had yielded predictions with higher (worse) Hausdorff Distance values. Increasing the receptive field size of the models partially mitigated this effect. Without post-training optimization, the inference speed mostly varied with the number of layers in the networks. The least complex U-Net model was 83% faster than the most complex one; for the DeepLab models the difference was 53%. With post-training optimization, any reduction in the number of parameters led to increased speed: up to more than 700% for both architecture types.

查看原文本刊更多论文

基于深度学习的二维心脏超声分割:简单任务的简单模型

低复杂度卷积神经网络已被证明足以分割A2C和A4C视图的心脏US图像。在心脏US数据(CAMUS数据集)和街景数据(cityscape数据集)上测试了24种不同复杂度的U-Net和DeepLabV3+(流行的分割架构)的性能。并对模型在训练后优化前后的推理速度进行了测量。这些模型在结构成分上有系统的不同:层数和卷积滤波器以及接受野的大小。所有模型的训练都是为了最大化骰子系数。在所有模型中，CA-MUS数据的Dice系数始终较高(0.86-0.90)，而cityscape数据的Dice系数始终较低(0.48-0.67)。模型参数数量每减少10倍，CAMUS评分降低≈0.01，城市景观评分降低0.03 ~ 0.05。同样，低参数模型，尤其是基于U-Net的模型，也产生了更高(更差)的豪斯多夫距离值的预测。增加模型的感受野大小部分地减轻了这种影响。在没有训练后优化的情况下，推理速度主要随网络层数的变化而变化。最简单的U-Net模型比最复杂的U-Net模型快83%;对于DeepLab模型，差异为53%。通过训练后优化，任何参数数量的减少都会导致速度的提高:对于两种架构类型来说，速度都超过700%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Ultrasonics Symposium (IUS)

自引率

0.00%

发文量