Pre-trained vs. Random Weights for Calculating Fréchet Inception Distance in Medical Imaging

2021 13th Biomedical Engineering International Conference (BMEiCON) Pub Date : 2021-11-19 DOI:10.1109/BMEiCON53485.2021.9745214

Jamie A. O’Reilly, Fawad Asadi

{"title":"Pre-trained vs. Random Weights for Calculating Fréchet Inception Distance in Medical Imaging","authors":"Jamie A. O’Reilly, Fawad Asadi","doi":"10.1109/BMEiCON53485.2021.9745214","DOIUrl":null,"url":null,"abstract":"Fréchet Inception Distance (FID) is an evaluation metric for assessing the quality of images synthesized by generative models. Conventionally this involves using an Inception v3 convolutional neural network that has been pretrained to classify everyday color images with the ImageNet dataset. The final classification section of this network is omitted, leaving an efficient feature extractor that outputs an encoded representation of each input image in the form of a 2048 element vector. Difference or similarity between samples of images can then be compared by measuring the distance between the distributions of their corresponding feature representations. Researchers have raised concerns about the utility of FID for evaluating unorthodox images (e.g. medical images) that are unlike those used for model training; suggesting that randomly initialized convolutional neural networks may be more appropriate. The aim of this study was to compare pre-trained and random approaches for evaluating medical images. Robustness to synthetic image distortions (Gaussian noise, blurring, swirl, and impulse noise) and different image types (Xray, CT, fundus, and everyday images) was addressed. Feature representations were converted into two-dimensional space and visualized using t-distributed stochastic neighbor embedding (tSNE) and principal component analysis (PCA). Normalized FID between image classes was substantially larger and more consistent for the pre-trained model. Overall, this suggests that the pre-trained model is preferable to the randomly initialized model for evaluating medical images.","PeriodicalId":380002,"journal":{"name":"2021 13th Biomedical Engineering International Conference (BMEiCON)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th Biomedical Engineering International Conference (BMEiCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BMEiCON53485.2021.9745214","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Fréchet Inception Distance (FID) is an evaluation metric for assessing the quality of images synthesized by generative models. Conventionally this involves using an Inception v3 convolutional neural network that has been pretrained to classify everyday color images with the ImageNet dataset. The final classification section of this network is omitted, leaving an efficient feature extractor that outputs an encoded representation of each input image in the form of a 2048 element vector. Difference or similarity between samples of images can then be compared by measuring the distance between the distributions of their corresponding feature representations. Researchers have raised concerns about the utility of FID for evaluating unorthodox images (e.g. medical images) that are unlike those used for model training; suggesting that randomly initialized convolutional neural networks may be more appropriate. The aim of this study was to compare pre-trained and random approaches for evaluating medical images. Robustness to synthetic image distortions (Gaussian noise, blurring, swirl, and impulse noise) and different image types (Xray, CT, fundus, and everyday images) was addressed. Feature representations were converted into two-dimensional space and visualized using t-distributed stochastic neighbor embedding (tSNE) and principal component analysis (PCA). Normalized FID between image classes was substantially larger and more consistent for the pre-trained model. Overall, this suggests that the pre-trained model is preferable to the randomly initialized model for evaluating medical images.

查看原文本刊更多论文

医学成像中预训练权与随机权的比较

起始距离(FID)是一种评价生成模型合成图像质量的评价指标。通常，这涉及到使用Inception v3卷积神经网络，该神经网络已经过预训练，可以使用ImageNet数据集对日常彩色图像进行分类。该网络的最后分类部分被省略，留下一个高效的特征提取器，以2048个元素向量的形式输出每个输入图像的编码表示。然后可以通过测量其对应特征表示的分布之间的距离来比较图像样本之间的差异或相似性。研究人员对FID用于评估非正统图像(例如医学图像)的效用表示担忧，这些图像与用于模型训练的图像不同;建议随机初始化卷积神经网络可能更合适。本研究的目的是比较预训练和随机方法评估医学图像。对合成图像失真(高斯噪声、模糊、漩涡和脉冲噪声)和不同图像类型(x射线、CT、眼底和日常图像)的鲁棒性进行了讨论。利用t分布随机邻居嵌入(tSNE)和主成分分析(PCA)将特征表示转换为二维空间并进行可视化。对于预训练的模型，图像类别之间的归一化FID实质上更大，更一致。总的来说，这表明预训练模型比随机初始化模型更适合评估医学图像。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 13th Biomedical Engineering International Conference (BMEiCON)

自引率

0.00%

发文量