Few-Shot Data Augmentation for Industrial Character Recognition

Hongchao Gao, Xiaoqian Huang, Bofeng Liu
{"title":"Few-Shot Data Augmentation for Industrial Character Recognition","authors":"Hongchao Gao, Xiaoqian Huang, Bofeng Liu","doi":"10.1145/3581807.3581841","DOIUrl":null,"url":null,"abstract":"The task of industrial character recognition is to extract character content on the surface of the workpiece in the industrial production process. Limited training data, incomplete available character categories and non-standardized character styles encountered in actual production have led to a significant reduction in the recognition performance of deep learning-based methods, such as scene text recognition and Optical Character Recognition (OCR). In this paper, we propose an augmentation strategy suitable for industrial character recognition based on the Generative Adversarial Network (GAN). The strategy consists of two modules, a character detection module and a synthetic data generation module. The results show that the augmentation strategy achieves best generation results. Recognition network utilizing the augmentation dataset generated by the strategy can achieve the best results on four types of industrial datasets.","PeriodicalId":292813,"journal":{"name":"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3581807.3581841","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The task of industrial character recognition is to extract character content on the surface of the workpiece in the industrial production process. Limited training data, incomplete available character categories and non-standardized character styles encountered in actual production have led to a significant reduction in the recognition performance of deep learning-based methods, such as scene text recognition and Optical Character Recognition (OCR). In this paper, we propose an augmentation strategy suitable for industrial character recognition based on the Generative Adversarial Network (GAN). The strategy consists of two modules, a character detection module and a synthetic data generation module. The results show that the augmentation strategy achieves best generation results. Recognition network utilizing the augmentation dataset generated by the strategy can achieve the best results on four types of industrial datasets.
工业字符识别的少镜头数据增强
工业字符识别的任务是在工业生产过程中提取工件表面的字符内容。有限的训练数据、不完整的可用字符类别以及在实际生产中遇到的非标准化字符样式导致基于深度学习的方法(如场景文本识别和光学字符识别(OCR))的识别性能显著降低。在本文中,我们提出了一种基于生成对抗网络(GAN)的适合工业字符识别的增强策略。该策略包括两个模块:字符检测模块和综合数据生成模块。结果表明,该增强策略获得了最佳的生成效果。利用该策略生成的增强数据集的识别网络可以在四种类型的工业数据集上获得最佳结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信