Evaluation of 3D GANs for Lung Tissue Modelling in Pulmonary CT

The journal of machine learning for biomedical imaging Pub Date : 2022-08-17 DOI:10.59275/j.melba.2022-9e4b

S. Ellis, O. M. Manzanera, V. Baltatzis, Ibrahim Nawaz, A. Nair, L. L. Folgoc, S. Desai, Ben Glocker, J. Schnabel

{"title":"Evaluation of 3D GANs for Lung Tissue Modelling in Pulmonary CT","authors":"S. Ellis, O. M. Manzanera, V. Baltatzis, Ibrahim Nawaz, A. Nair, L. L. Folgoc, S. Desai, Ben Glocker, J. Schnabel","doi":"10.59275/j.melba.2022-9e4b","DOIUrl":null,"url":null,"abstract":"Generative adversarial networks (GANs) are able to model accurately the distribution of complex, high-dimensional datasets, for example images. This characteristic makes high-quality GANs useful for unsupervised anomaly detection in medical imaging. However, differences in training datasets such as output image dimensionality and appearance of semantically meaningful features mean that GAN models from the natural image processing domain may not work 'out-of-the-box' for medical imaging applications, necessitating re-implementation and re-evaluation. In this work we adapt and evaluate three GAN models to the application of modelling 3D healthy image patches for pulmonary CT. To the best of our knowledge, this is the first time that such a detailed evaluation has been performed. The deep convolutional GAN (DCGAN), styleGAN and the bigGAN architectures were selected for investigation due to their ubiquity and high performance in natural image processing. We train different variants of these methods and assess their performance using the widely used Frechet Inception Distance (FID). In addition, the quality of the generated images was evaluated by a human observer study, the ability of the networks to model 3D domain-specific features was investigated, and the structure of the GAN latent spaces was analysed. Results show that the 3D styleGAN approaches produce realistic-looking images with meaningful 3D structure, but suffer from mode collapse which must be explicitly addressed during training to obtain diversity in the samples. Conversely, the 3D DCGAN models show a greater capacity for image variability, but at the cost of poor-quality images. The 3D bigGAN models provide an intermediate level of image quality, but most accurately model the distribution of selected semantically meaningful features. The results suggest that future development is required to realise a 3D GAN with sufficient representational capacity for patch-based lung CT anomaly detection and we offer recommendations for future areas of research, such as experimenting with other architectures and incorporation of position-encoding.","PeriodicalId":75083,"journal":{"name":"The journal of machine learning for biomedical imaging","volume":"23 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The journal of machine learning for biomedical imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59275/j.melba.2022-9e4b","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Generative adversarial networks (GANs) are able to model accurately the distribution of complex, high-dimensional datasets, for example images. This characteristic makes high-quality GANs useful for unsupervised anomaly detection in medical imaging. However, differences in training datasets such as output image dimensionality and appearance of semantically meaningful features mean that GAN models from the natural image processing domain may not work 'out-of-the-box' for medical imaging applications, necessitating re-implementation and re-evaluation. In this work we adapt and evaluate three GAN models to the application of modelling 3D healthy image patches for pulmonary CT. To the best of our knowledge, this is the first time that such a detailed evaluation has been performed. The deep convolutional GAN (DCGAN), styleGAN and the bigGAN architectures were selected for investigation due to their ubiquity and high performance in natural image processing. We train different variants of these methods and assess their performance using the widely used Frechet Inception Distance (FID). In addition, the quality of the generated images was evaluated by a human observer study, the ability of the networks to model 3D domain-specific features was investigated, and the structure of the GAN latent spaces was analysed. Results show that the 3D styleGAN approaches produce realistic-looking images with meaningful 3D structure, but suffer from mode collapse which must be explicitly addressed during training to obtain diversity in the samples. Conversely, the 3D DCGAN models show a greater capacity for image variability, but at the cost of poor-quality images. The 3D bigGAN models provide an intermediate level of image quality, but most accurately model the distribution of selected semantically meaningful features. The results suggest that future development is required to realise a 3D GAN with sufficient representational capacity for patch-based lung CT anomaly detection and we offer recommendations for future areas of research, such as experimenting with other architectures and incorporation of position-encoding.

查看原文本刊更多论文

三维gan在肺部CT肺组织建模中的应用评价

生成对抗网络(GANs)能够准确地模拟复杂的高维数据集的分布，例如图像。这一特性使得高质量gan在医学成像中的无监督异常检测中非常有用。然而，训练数据集的差异，如输出图像维度和语义上有意义的特征的外观，意味着来自自然图像处理领域的GAN模型可能无法“开箱即用”地用于医学成像应用，需要重新实施和重新评估。在这项工作中，我们适应并评估了三种GAN模型在肺部CT三维健康图像斑块建模中的应用。据我们所知，这是第一次进行如此详细的评估。由于深度卷积GAN (DCGAN)、styleGAN和bigGAN架构在自然图像处理中的普遍性和高性能，我们选择了它们作为研究对象。我们训练了这些方法的不同变体，并使用广泛使用的Frechet Inception Distance (FID)来评估它们的性能。此外，通过人类观察者研究评估了生成图像的质量，研究了网络对3D特定领域特征的建模能力，并分析了GAN潜在空间的结构。结果表明，3D styleGAN方法产生具有有意义的3D结构的逼真图像，但必须在训练过程中明确解决模式崩溃问题，以获得样本的多样性。相反，3D DCGAN模型显示出更大的图像可变性能力，但代价是图像质量差。3D bigGAN模型提供了一个中等水平的图像质量，但最准确地模拟了选定的语义有意义的特征的分布。结果表明，未来的发展需要实现具有足够表征能力的3D GAN，用于基于补丁的肺部CT异常检测，我们为未来的研究领域提供了建议，例如实验其他架构和结合位置编码。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The journal of machine learning for biomedical imaging

自引率

0.00%

发文量