On learning deep domain-invariant features from 2D synthetic images for industrial visual inspection

International Conference on Quality Control by Artificial Vision Pub Date : 2021-07-16 DOI:10.1117/12.2589040

Abdelrahman G. Abubakr, Igor Jovančević, Nour Islam Mokhtari, Hamdi Ben Abdallah, J. Orteu

{"title":"On learning deep domain-invariant features from 2D synthetic images for industrial visual inspection","authors":"Abdelrahman G. Abubakr, Igor Jovančević, Nour Islam Mokhtari, Hamdi Ben Abdallah, J. Orteu","doi":"10.1117/12.2589040","DOIUrl":null,"url":null,"abstract":"Deep learning resulted in a huge advancement in computer vision. However, deep models require a large amount of manually annotated data, which is not easy to obtain, especially in a context of sensitive industries. Rendering of Computer Aided Design (CAD) models to generate synthetic training data could be an attractive workaround. This paper focuses on using Deep Convolutional Neural Networks (DCNN) for automatic industrial inspection of mechanical assemblies, where training images are limited and hard to collect. The ultimate goal of this work is to obtain a DCNN classification model trained on synthetic renders, and deploy it to verify the presence of target objects in never-seen-before real images collected by RGB cameras. Two approaches are adopted to close the domain gap between synthetic and real images. First, Domain Randomization technique is applied to generate synthetic data for training. Second, a novel approach is proposed to learn better features representations by means of self-supervision: we used an Augmented Auto-Encoder (AAE) and achieved results competitive to our baseline model trained on real images. In addition, this approach outperformed baseline results when the problem was simplified to binary classification for each object individually.","PeriodicalId":295011,"journal":{"name":"International Conference on Quality Control by Artificial Vision","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Quality Control by Artificial Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2589040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Deep learning resulted in a huge advancement in computer vision. However, deep models require a large amount of manually annotated data, which is not easy to obtain, especially in a context of sensitive industries. Rendering of Computer Aided Design (CAD) models to generate synthetic training data could be an attractive workaround. This paper focuses on using Deep Convolutional Neural Networks (DCNN) for automatic industrial inspection of mechanical assemblies, where training images are limited and hard to collect. The ultimate goal of this work is to obtain a DCNN classification model trained on synthetic renders, and deploy it to verify the presence of target objects in never-seen-before real images collected by RGB cameras. Two approaches are adopted to close the domain gap between synthetic and real images. First, Domain Randomization technique is applied to generate synthetic data for training. Second, a novel approach is proposed to learn better features representations by means of self-supervision: we used an Augmented Auto-Encoder (AAE) and achieved results competitive to our baseline model trained on real images. In addition, this approach outperformed baseline results when the problem was simplified to binary classification for each object individually.

查看原文本刊更多论文

面向工业视觉检测的二维合成图像深度域不变特征学习

深度学习导致了计算机视觉的巨大进步。然而，深度模型需要大量手工标注的数据，这是不容易获得的，特别是在敏感行业的背景下。绘制计算机辅助设计(CAD)模型以生成综合训练数据可能是一种有吸引力的解决方案。本文将深度卷积神经网络(DCNN)应用于训练图像有限且难以收集的机械组件工业自动检测。本工作的最终目标是获得一个经过合成渲染训练的DCNN分类模型，并将其用于验证RGB相机收集的从未见过的真实图像中目标物体的存在。采用两种方法来缩小合成图像与真实图像之间的域差距。首先，应用领域随机化技术生成用于训练的合成数据。其次，提出了一种新的方法，通过自我监督来学习更好的特征表示:我们使用了增强自编码器(AAE)，并获得了与我们在真实图像上训练的基线模型相竞争的结果。此外，当问题被简化为对每个对象单独进行二值分类时，该方法的性能优于基线结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Quality Control by Artificial Vision

自引率

0.00%

发文量