Handwriting recognition on form document using convolutional neural network and support vector machines (CNN-SVM)

2017 5th International Conference on Information and Communication Technology (ICoIC7) Pub Date : 2017-05-17 DOI:10.1109/ICOICT.2017.8074699

Darmatasia, M. I. Fanany

{"title":"Handwriting recognition on form document using convolutional neural network and support vector machines (CNN-SVM)","authors":"Darmatasia, M. I. Fanany","doi":"10.1109/ICOICT.2017.8074699","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a workflow and a machine learning model for recognizing handwritten characters on form document. The learning model is based on Convolutional Neural Network (CNN) as a powerful feature extraction and Support Vector Machines (SVM) as a high-end classifier. The proposed method is more efficient than modifying the CNN with complex architecture. We evaluated some SVM and found that the linear SVM using L1 loss function and L2 regularization giving the best performance both of the accuracy rate and the computation time. Based on the experiment results using data from NIST SD 192nd edition both for training and testing, the proposed method which combines CNN and linear SVM using L1 loss function and L2 regularization achieved a recognition rate better than only CNN. The recognition rate achieved by the proposed method are 98.85% on numeral characters, 93.05% on uppercase characters, 86.21% on lowercase characters, and 91.37% on the merger of numeral and uppercase characters. While the original CNN achieves an accuracy rate of 98.30% on numeral characters, 92.33% on uppercase characters, 83.54% on lowercase characters, and 88.32% on the merger of numeral and uppercase characters. The proposed method was also validated by using ten folds cross-validation, and it shows that the proposed method still can improve the accuracy rate. The learning model was used to construct a handwriting recognition system to recognize a more challenging data on form document automatically. The pre-processing, segmentation and character recognition are integrated into one system. The output of the system is converted into an editable text. The system gives an accuracy rate of 83.37% on ten different test form document.","PeriodicalId":244500,"journal":{"name":"2017 5th International Conference on Information and Communication Technology (ICoIC7)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"53","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 5th International Conference on Information and Communication Technology (ICoIC7)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOICT.2017.8074699","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 53

Abstract

In this paper, we propose a workflow and a machine learning model for recognizing handwritten characters on form document. The learning model is based on Convolutional Neural Network (CNN) as a powerful feature extraction and Support Vector Machines (SVM) as a high-end classifier. The proposed method is more efficient than modifying the CNN with complex architecture. We evaluated some SVM and found that the linear SVM using L1 loss function and L2 regularization giving the best performance both of the accuracy rate and the computation time. Based on the experiment results using data from NIST SD 192nd edition both for training and testing, the proposed method which combines CNN and linear SVM using L1 loss function and L2 regularization achieved a recognition rate better than only CNN. The recognition rate achieved by the proposed method are 98.85% on numeral characters, 93.05% on uppercase characters, 86.21% on lowercase characters, and 91.37% on the merger of numeral and uppercase characters. While the original CNN achieves an accuracy rate of 98.30% on numeral characters, 92.33% on uppercase characters, 83.54% on lowercase characters, and 88.32% on the merger of numeral and uppercase characters. The proposed method was also validated by using ten folds cross-validation, and it shows that the proposed method still can improve the accuracy rate. The learning model was used to construct a handwriting recognition system to recognize a more challenging data on form document automatically. The pre-processing, segmentation and character recognition are integrated into one system. The output of the system is converted into an editable text. The system gives an accuracy rate of 83.37% on ten different test form document.

查看原文本刊更多论文

基于卷积神经网络和支持向量机的手写表单识别

在本文中，我们提出了一种识别表单文档中手写字符的工作流程和机器学习模型。该学习模型基于卷积神经网络(CNN)作为强大的特征提取和支持向量机(SVM)作为高端分类器。该方法比对结构复杂的CNN进行修改更有效。对几种支持向量机进行了评价，发现使用L1损失函数和L2正则化的线性支持向量机在准确率和计算时间上都有最好的表现。利用NIST SD 192版数据进行训练和测试的实验结果表明，采用L1损失函数和L2正则化方法将CNN与线性支持向量机相结合的方法取得了比单独使用CNN更好的识别率。该方法对数字字符的识别率为98.85%，对大写字符的识别率为93.05%，对小写字符的识别率为86.21%，对数字和大写字符合并的识别率为91.37%。而原始CNN对数字字符的准确率为98.30%，对大写字符的准确率为92.33%，对小写字符的准确率为83.54%，对数字和大写字符合并的准确率为88.32%。通过十倍交叉验证对所提方法进行了验证，结果表明所提方法仍能提高准确率。利用该学习模型构建了手写识别系统，实现了对表单文档中较难识别的数据的自动识别。将预处理、分割和字符识别集成到一个系统中。系统的输出被转换成可编辑的文本。该系统对10种不同的测试表单文档的准确率达到83.37%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 5th International Conference on Information and Communication Technology (ICoIC7)

自引率

0.00%

发文量