GAN Based Method for Labeled Image Augmentation in Autonomous Driving

Wenbo Yu, Yong Sun, Ruilin Zhou, Xingjian Liu
{"title":"GAN Based Method for Labeled Image Augmentation in Autonomous Driving","authors":"Wenbo Yu, Yong Sun, Ruilin Zhou, Xingjian Liu","doi":"10.1109/ICCVE45908.2019.8964902","DOIUrl":null,"url":null,"abstract":"Deep learning models in Autonomous Driving perception tasks commonly use supervised learning methods and thus highly depend on labeled data. Training with more labeled data tends to bring better results, which highlights the meaning of data augmentation. Currently there are two difficulties when doing data augmentation. Firstly, it is time consuming to manually label the collected raw data. The second issue is that the diversity of a dataset is limited by the collection environment and time. In this paper, we proposed to use the current state of the art Multimodal Unsupervised Image-to-Image Translation (MUNIT) to generate synthesized images from labeled data. One of the benefits is that the generated data are automated labeled since they share the same ground truth with the raw data. Then we used the augmentation dataset to do different tasks including drivable area detection and object detection to prove that the data could be used to improve the performance of convolution neural networks (CNNs). We also designed an auto labelling tool that people could do labelling with the help of the improved CNN. The whole process is like a close loop that finishes labelling tasks while making progresses by itself. Generally speaking, our approach introduces an auto labelling pipeline based on unsupervised image-to-image translation to increase the amount and diversity of labeled data.","PeriodicalId":384049,"journal":{"name":"2019 IEEE International Conference on Connected Vehicles and Expo (ICCVE)","volume":"238 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Connected Vehicles and Expo (ICCVE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCVE45908.2019.8964902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Deep learning models in Autonomous Driving perception tasks commonly use supervised learning methods and thus highly depend on labeled data. Training with more labeled data tends to bring better results, which highlights the meaning of data augmentation. Currently there are two difficulties when doing data augmentation. Firstly, it is time consuming to manually label the collected raw data. The second issue is that the diversity of a dataset is limited by the collection environment and time. In this paper, we proposed to use the current state of the art Multimodal Unsupervised Image-to-Image Translation (MUNIT) to generate synthesized images from labeled data. One of the benefits is that the generated data are automated labeled since they share the same ground truth with the raw data. Then we used the augmentation dataset to do different tasks including drivable area detection and object detection to prove that the data could be used to improve the performance of convolution neural networks (CNNs). We also designed an auto labelling tool that people could do labelling with the help of the improved CNN. The whole process is like a close loop that finishes labelling tasks while making progresses by itself. Generally speaking, our approach introduces an auto labelling pipeline based on unsupervised image-to-image translation to increase the amount and diversity of labeled data.
基于GAN的自动驾驶标记图像增强方法
自动驾驶感知任务中的深度学习模型通常使用监督学习方法,因此高度依赖于标记数据。标记数据越多,训练效果越好,这就凸显了数据增强的意义。目前在进行数据增强时有两个困难。首先,手工标记收集到的原始数据非常耗时。第二个问题是数据集的多样性受到收集环境和时间的限制。在本文中,我们提出使用当前最先进的多模态无监督图像到图像转换(MUNIT)从标记数据生成合成图像。其中一个好处是生成的数据是自动标记的,因为它们与原始数据共享相同的基础事实。然后,我们使用增强数据集进行不同的任务,包括可驾驶区域检测和目标检测,以证明该数据可以用于提高卷积神经网络(cnn)的性能。我们还设计了一个自动标注工具,人们可以在改进的CNN的帮助下进行标注。整个过程就像一个闭环,在完成标记任务的同时,自己也在进步。一般来说,我们的方法引入了一个基于无监督图像到图像转换的自动标记管道,以增加标记数据的数量和多样性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信