Image Augmentation Techniques for Cascade Model Training

2018 Zooming Innovation in Consumer Technologies Conference (ZINC) Pub Date : 2018-05-01 DOI:10.1109/ZINC.2018.8448407

Diiana Vitas, Martina Tomic, Matko Burul

{"title":"Image Augmentation Techniques for Cascade Model Training","authors":"Diiana Vitas, Martina Tomic, Matko Burul","doi":"10.1109/ZINC.2018.8448407","DOIUrl":null,"url":null,"abstract":"Cascade classifiers are widely used in embedded systems for tasks of face detection or autonomous driving. To apply cascade classifiers for such advanced tasks, training has to be performed on a large set of samples. Image augmentation methods are introduced to artificially extend training set by creating many altered versions of the same image. This provides more samples to train on, but can also help to expose the classifier to a wider variety of colors and lighting to make a classifier more robust. Due to large number of augmentation methods, selection of appropriate methods depends of data set and classifiers purpose. In this paper the influence of different augmentation methods on traffic light classifier training is investigated. Testing is performed using basic augmentation methods which affect sample's color and luminance characteristics, such as brightness, contrast modifications, blurring and noise addition. Furthermore, horizontal flip as well as color jittering based on principal component analysis were selected as good method candidates while these methods do not significantly influence sample's properties. Each method is applied to a set of training data and the impact it had on the training process is analyzed. Results are presented in the form of rate change of true positives (TP), as well as the rate change of false positives (FP), obtained when comparing different augmentation methods with a non-augmented baseline model. Most of the tested methods reduced the FP count, however, they also resulted in a reduced TP count. The overall results indicate that the selection of augmentation methods heavily depends on the quality of the training data.","PeriodicalId":366195,"journal":{"name":"2018 Zooming Innovation in Consumer Technologies Conference (ZINC)","volume":"113 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Zooming Innovation in Consumer Technologies Conference (ZINC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ZINC.2018.8448407","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Cascade classifiers are widely used in embedded systems for tasks of face detection or autonomous driving. To apply cascade classifiers for such advanced tasks, training has to be performed on a large set of samples. Image augmentation methods are introduced to artificially extend training set by creating many altered versions of the same image. This provides more samples to train on, but can also help to expose the classifier to a wider variety of colors and lighting to make a classifier more robust. Due to large number of augmentation methods, selection of appropriate methods depends of data set and classifiers purpose. In this paper the influence of different augmentation methods on traffic light classifier training is investigated. Testing is performed using basic augmentation methods which affect sample's color and luminance characteristics, such as brightness, contrast modifications, blurring and noise addition. Furthermore, horizontal flip as well as color jittering based on principal component analysis were selected as good method candidates while these methods do not significantly influence sample's properties. Each method is applied to a set of training data and the impact it had on the training process is analyzed. Results are presented in the form of rate change of true positives (TP), as well as the rate change of false positives (FP), obtained when comparing different augmentation methods with a non-augmented baseline model. Most of the tested methods reduced the FP count, however, they also resulted in a reduced TP count. The overall results indicate that the selection of augmentation methods heavily depends on the quality of the training data.

查看原文本刊更多论文

级联模型训练的图像增强技术

级联分类器广泛应用于嵌入式系统中人脸检测或自动驾驶任务。为了将级联分类器应用于这种高级任务，必须对大量样本进行训练。引入了图像增强方法，通过创建同一图像的多个修改版本来人为地扩展训练集。这提供了更多的样本来训练，但也可以帮助分类器暴露在更广泛的颜色和光照下，使分类器更健壮。由于增强方法众多，选择合适的方法取决于数据集和分类器的目的。本文研究了不同增强方法对红绿灯分类器训练的影响。使用影响样品颜色和亮度特性的基本增强方法进行测试，如亮度、对比度修改、模糊和噪声添加。此外，选择基于主成分分析的水平翻转和颜色抖动作为较好的候选方法，这些方法对样品的性质没有显著影响。将每种方法应用于一组训练数据，并分析其对训练过程的影响。结果以真阳性率(TP)变化率和假阳性率(FP)变化率的形式呈现，这些变化率是在比较不同增强方法与非增强基线模型时获得的。大多数测试方法降低FP计数，然而，它们也导致TP计数降低。总体结果表明，增强方法的选择在很大程度上取决于训练数据的质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 Zooming Innovation in Consumer Technologies Conference (ZINC)

自引率

0.00%

发文量