Semi-supervised learning with cross-localisation in shared GAN latent space for enhanced OCT data augmentation

{"title":"Semi-supervised learning with cross-localisation in shared GAN latent space for enhanced OCT data augmentation","authors":"","doi":"10.1109/DICTA56598.2022.10034570","DOIUrl":null,"url":null,"abstract":"Deep learning methods have demonstrated stateof-the-art performance for the segmentation of the retina and choroid in optical coherence tomography (OCT) images. These methods are automatic and fast, yielding high accuracy and precision, thus reducing the load of manual analysis. However, deep learning usually requires large amounts of diverse, labelled data for training which can be difficult or infeasible to obtain, especially for medical images. For example, privacy concerns and lack of confidentiality agreements are common and are an obstacle to the sharing of useful training data. Additionally, some data can be significantly more difficult to obtain in the first place such as that of rare pathologies. Even in cases where sufficient data is available, the cost and time to perform image labelling can be significant. In many cases, data augmentation is employed to enhance the size of the training set. Similarly, semisupervised learning (SSL) can be used to exploit potentially large amounts of unlabeled data which would otherwise be unused. Motivated by this, in this study, we propose an enhanced StyleGAN2-based data augmentation method for OCT images by employing SSL through a novel crosslocalisation technique. For OCT image patches, the proposed method significantly improved the classification accuracy over the previous GAN data augmentation approach which uses labelled data only. The technique works by automatically learning, mixing, and injecting unlabelled styles into the labelled data to further increase the diversity of the synthetic data. The proposed method can be trained using differing quantities of both labelled and unlabelled data simultaneously. The method is simple, effective, generalizable and can be easily applied and used to extend StyleGAN2. Hence, there is also significant potential for the proposed method to be applied to other domains and imaging modalities for data augmentation purposes where unlabelled data exists.","PeriodicalId":159377,"journal":{"name":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA56598.2022.10034570","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Deep learning methods have demonstrated stateof-the-art performance for the segmentation of the retina and choroid in optical coherence tomography (OCT) images. These methods are automatic and fast, yielding high accuracy and precision, thus reducing the load of manual analysis. However, deep learning usually requires large amounts of diverse, labelled data for training which can be difficult or infeasible to obtain, especially for medical images. For example, privacy concerns and lack of confidentiality agreements are common and are an obstacle to the sharing of useful training data. Additionally, some data can be significantly more difficult to obtain in the first place such as that of rare pathologies. Even in cases where sufficient data is available, the cost and time to perform image labelling can be significant. In many cases, data augmentation is employed to enhance the size of the training set. Similarly, semisupervised learning (SSL) can be used to exploit potentially large amounts of unlabeled data which would otherwise be unused. Motivated by this, in this study, we propose an enhanced StyleGAN2-based data augmentation method for OCT images by employing SSL through a novel crosslocalisation technique. For OCT image patches, the proposed method significantly improved the classification accuracy over the previous GAN data augmentation approach which uses labelled data only. The technique works by automatically learning, mixing, and injecting unlabelled styles into the labelled data to further increase the diversity of the synthetic data. The proposed method can be trained using differing quantities of both labelled and unlabelled data simultaneously. The method is simple, effective, generalizable and can be easily applied and used to extend StyleGAN2. Hence, there is also significant potential for the proposed method to be applied to other domains and imaging modalities for data augmentation purposes where unlabelled data exists.
基于共享GAN潜在空间的交叉定位半监督学习增强OCT数据增强
深度学习方法已经证明了光学相干断层扫描(OCT)图像中视网膜和脉络膜分割的最先进性能。这些方法自动、快速,准确度和精密度高,从而减少了人工分析的工作量。然而,深度学习通常需要大量不同的标记数据进行训练,这些数据很难或不可获得,特别是对于医学图像。例如,隐私问题和缺乏保密协议是常见的,是分享有用训练数据的障碍。此外,一些数据可能更难以获得,例如罕见疾病的数据。即使在有足够数据可用的情况下,执行图像标记的成本和时间也可能很大。在许多情况下,数据增强是用来增强训练集的大小。类似地,半监督学习(SSL)可用于利用大量未标记的数据,否则这些数据将不会被使用。基于此,在本研究中,我们提出了一种基于stylegan2的增强OCT图像数据增强方法,该方法通过一种新的交叉定位技术使用SSL。对于OCT图像贴片,与之前仅使用标记数据的GAN数据增强方法相比,该方法显著提高了分类精度。该技术的工作原理是自动学习、混合和将未标记的样式注入已标记的数据中,以进一步增加合成数据的多样性。所提出的方法可以同时使用不同数量的标记和未标记数据进行训练。该方法简单、有效、通用性强,易于应用和扩展StyleGAN2。因此,所提出的方法也有很大的潜力应用于其他领域和成像模式,用于存在未标记数据的数据增强目的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信