Image augmentation by blocky artifact in Deep Convolutional Neural Network for handwritten digit recognition

2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR) Pub Date : 1900-01-01 DOI:10.1109/ICIVPR.2017.7890867

Md Shopon, Nabeel Mohammed, Md. Anowarul Abedin

引用次数: 22

Abstract

Deep Convolutional Neural Networks - also known as DCNN - are powerful models for different visual pattern classification problems. Many works in this field use image augmentation at the training phase to achieve better accuracy. This paper presents blocky artifact as an augmentation technique to increase the accuracy of DCNN for handwritten digit recognition, both English and Bangla digits, i.e., 0–9. This paper conducts a number of experiments on three different datasets: MNIST Dataset, CMATERDB 3.1.1 Dataset and Indian Statistical Institute (ISI) Dataset. For each dataset, DCNNs with the proposed augmentation technique give better results than those without such augmentation. Unsupervised pre-training with the blocky artifact achieves 99.56%, 99.83% and 99.35% accuracy respectively on MNIST, CMATERDDB and ISI datasets producing, in the process, so far the best accuracy rate for CMATERDB and ISI datasets.

查看原文本刊更多论文

基于块伪影的深度卷积神经网络手写体数字识别图像增强

深度卷积神经网络-也被称为DCNN -是不同视觉模式分类问题的强大模型。该领域的许多工作在训练阶段使用图像增强来达到更好的精度。本文提出了块伪影作为一种增强技术来提高DCNN的手写数字识别精度，包括英语和孟加拉语数字，即0-9。本文在三个不同的数据集上进行了大量实验:MNIST数据集、CMATERDB 3.1.1数据集和印度统计研究所(ISI)数据集。对于每个数据集，采用本文提出的增强技术的DCNNs比未采用这种增强技术的DCNNs得到更好的结果。基于块构件的无监督预训练在MNIST、CMATERDDB和ISI数据集上分别达到了99.56%、99.83%和99.35%的准确率，其中CMATERDB和ISI数据集的准确率是目前为止最好的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)

自引率

0.00%

发文量