Delaunay Triangulation Data Augmentation Guided by Visual Analytics for Deep Learning

2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) Pub Date : 2018-10-01 DOI:10.1109/SIBGRAPI.2018.00056

A. Peixinho, B. C. Benato, L. G. Nonato, A. Falcão

{"title":"Delaunay Triangulation Data Augmentation Guided by Visual Analytics for Deep Learning","authors":"A. Peixinho, B. C. Benato, L. G. Nonato, A. Falcão","doi":"10.1109/SIBGRAPI.2018.00056","DOIUrl":null,"url":null,"abstract":"It is well known that image classification problems can be effectively solved by Convolutional Neural Networks (CNNs). However, the number of supervised training examples from all categories must be high enough to avoid model overfitting. In this case, two key alternatives are usually presented (a) the generation of artificial examples, known as data augmentation, and (b) reusing a CNN previously trained over a large supervised training set from another image classification problem — a strategy known as transfer learning. Deep learning approaches have rarely exploited the superior ability of humans for cognitive tasks during the machine learning loop. We advocate that the expert intervention through visual analytics can improve machine learning. In this work, we demonstrate this claim by proposing a data augmentation framework based on Encoder-Decoder Neural Networks (EDNNs) and visual analytics for the design of more effective CNN-based image classifiers. An EDNN is initially trained such that its encoder extracts a feature vector from each training image. These samples are projected from the encoder feature space on to a 2D coordinate space. The expert includes points to the projection space and the feature vectors of the new samples are obtained on the original feature space by interpolation. The decoder generates artificial images from the feature vectors of the new samples and the augmented training set is used to improve the CNN-based classifier. We evaluate methods for the proposed framework and demonstrate its advantages using data from a real problem as case study — the diagnosis of helminth eggs in humans. We also show that transfer learning and data augmentation by affine transformations can further improve the results.","PeriodicalId":208985,"journal":{"name":"2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIBGRAPI.2018.00056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

It is well known that image classification problems can be effectively solved by Convolutional Neural Networks (CNNs). However, the number of supervised training examples from all categories must be high enough to avoid model overfitting. In this case, two key alternatives are usually presented (a) the generation of artificial examples, known as data augmentation, and (b) reusing a CNN previously trained over a large supervised training set from another image classification problem — a strategy known as transfer learning. Deep learning approaches have rarely exploited the superior ability of humans for cognitive tasks during the machine learning loop. We advocate that the expert intervention through visual analytics can improve machine learning. In this work, we demonstrate this claim by proposing a data augmentation framework based on Encoder-Decoder Neural Networks (EDNNs) and visual analytics for the design of more effective CNN-based image classifiers. An EDNN is initially trained such that its encoder extracts a feature vector from each training image. These samples are projected from the encoder feature space on to a 2D coordinate space. The expert includes points to the projection space and the feature vectors of the new samples are obtained on the original feature space by interpolation. The decoder generates artificial images from the feature vectors of the new samples and the augmented training set is used to improve the CNN-based classifier. We evaluate methods for the proposed framework and demonstrate its advantages using data from a real problem as case study — the diagnosis of helminth eggs in humans. We also show that transfer learning and data augmentation by affine transformations can further improve the results.

查看原文本刊更多论文

基于深度学习的视觉分析的Delaunay三角数据增强

众所周知，卷积神经网络(cnn)可以有效地解决图像分类问题。然而，来自所有类别的监督训练样例的数量必须足够高，以避免模型过拟合。在这种情况下，通常会提出两个关键的替代方案:(a)生成人工示例，称为数据增强;(b)重用以前在另一个图像分类问题的大型监督训练集上训练过的CNN——一种称为迁移学习的策略。在机器学习循环中，深度学习方法很少利用人类在认知任务上的优越能力。我们主张通过可视化分析的专家干预可以改善机器学习。在这项工作中，我们通过提出基于编码器-解码器神经网络(ednn)和视觉分析的数据增强框架来证明这一说法，用于设计更有效的基于cnn的图像分类器。对EDNN进行初始训练，使其编码器从每个训练图像中提取特征向量。这些样本从编码器特征空间投影到二维坐标空间。专家将点加入到投影空间中，在原始特征空间上插值得到新样本的特征向量。解码器从新样本的特征向量生成人工图像，并使用增强训练集来改进基于cnn的分类器。我们评估了所提出的框架的方法，并使用来自真实问题的数据作为案例研究-人类寄生虫卵的诊断来证明其优势。我们还表明，迁移学习和仿射变换的数据增强可以进一步改善结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

自引率

0.00%

发文量