Training a convolutional neural network for multi-class object detection using solely virtual world data

2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) Pub Date : 2016-08-01 DOI:10.1109/AVSS.2016.7738056

Erik Bochinski, Volker Eiselein, T. Sikora

引用次数: 36

Abstract

Convolutional neural networks are a popular choice for current object detection and classification systems. Their performance improves constantly but for effective training, large, hand-labeled datasets are required. We address the problem of obtaining customized, yet large enough datasets for CNN training by synthesizing them in a virtual world, thus eliminating the need for tedious human interaction for ground truth creation. We developed a CNN-based multi-class detection system that was trained solely on virtual world data and achieves competitive results compared to state-of-the-art detection systems.

查看原文本刊更多论文

仅使用虚拟世界数据训练用于多类目标检测的卷积神经网络

卷积神经网络是当前目标检测和分类系统的热门选择。它们的性能不断提高，但为了进行有效的训练，需要大量手工标记的数据集。我们通过在虚拟世界中合成数据集，解决了为CNN训练获得定制的、足够大的数据集的问题，从而消除了在创建地面真值时繁琐的人类交互的需要。我们开发了一个基于cnn的多类检测系统，该系统仅在虚拟世界数据上进行训练，与最先进的检测系统相比，它取得了具有竞争力的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

自引率

0.00%

发文量