基于CRNN的OCR用于美国和英国手语手指拼写

2021 13th International Conference on Knowledge and Systems Engineering (KSE) Pub Date : 2021-11-10 DOI:10.1109/KSE53942.2021.9648593

Ni Htwe Aung, Honey Htun, Ye Kyaw Thu, Su Su Maung

{"title":"基于CRNN的OCR用于美国和英国手语手指拼写","authors":"Ni Htwe Aung, Honey Htun, Ye Kyaw Thu, Su Su Maung","doi":"10.1109/KSE53942.2021.9648593","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) technology is mostly used to convert image containing written text (typed, handwritten, printed or scanned) into machine-readable text data. This work explores the first investigation of American Sign Language (ASL) and British Sign Language (BSL) fingerspelling font images to the corresponding English text conversion system. The proposed system is implemented by the Convolutional Recurrent Neural Network (CRNN) model with three different feature extraction methods. We also investigated two types of hyper-parameters such as hidden size and number of iterations. The experimental results show that our system achieved significantly higher conversion quality on the open-test dataset for both ASL and BSL fingerspelling. Our proposed technique can also be used in deaf education, for example, to extract fingerspelling images from exam answer sheets.","PeriodicalId":130986,"journal":{"name":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","volume":"306 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"CRNN Based OCR for American and British Sign Language Fingerspelling\",\"authors\":\"Ni Htwe Aung, Honey Htun, Ye Kyaw Thu, Su Su Maung\",\"doi\":\"10.1109/KSE53942.2021.9648593\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Optical Character Recognition (OCR) technology is mostly used to convert image containing written text (typed, handwritten, printed or scanned) into machine-readable text data. This work explores the first investigation of American Sign Language (ASL) and British Sign Language (BSL) fingerspelling font images to the corresponding English text conversion system. The proposed system is implemented by the Convolutional Recurrent Neural Network (CRNN) model with three different feature extraction methods. We also investigated two types of hyper-parameters such as hidden size and number of iterations. The experimental results show that our system achieved significantly higher conversion quality on the open-test dataset for both ASL and BSL fingerspelling. Our proposed technique can also be used in deaf education, for example, to extract fingerspelling images from exam answer sheets.\",\"PeriodicalId\":130986,\"journal\":{\"name\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"volume\":\"306 \",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KSE53942.2021.9648593\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE53942.2021.9648593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

光学字符识别(OCR)技术主要用于将包含书面文本(打字、手写、打印或扫描)的图像转换为机器可读的文本数据。本研究首次探讨了美国手语(ASL)和英国手语(BSL)的手指拼写字体图像到相应的英语文本转换系统的研究。该系统采用卷积递归神经网络(CRNN)模型和三种不同的特征提取方法来实现。我们还研究了两种类型的超参数，如隐藏大小和迭代次数。实验结果表明，我们的系统在开放测试数据集上对ASL和BSL手指拼写都取得了明显更高的转换质量。我们提出的技术也可以用于聋人教育，例如，从考试答题纸中提取指纹拼写图像。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CRNN Based OCR for American and British Sign Language Fingerspelling

Optical Character Recognition (OCR) technology is mostly used to convert image containing written text (typed, handwritten, printed or scanned) into machine-readable text data. This work explores the first investigation of American Sign Language (ASL) and British Sign Language (BSL) fingerspelling font images to the corresponding English text conversion system. The proposed system is implemented by the Convolutional Recurrent Neural Network (CRNN) model with three different feature extraction methods. We also investigated two types of hyper-parameters such as hidden size and number of iterations. The experimental results show that our system achieved significantly higher conversion quality on the open-test dataset for both ASL and BSL fingerspelling. Our proposed technique can also be used in deaf education, for example, to extract fingerspelling images from exam answer sheets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 13th International Conference on Knowledge and Systems Engineering (KSE)

自引率

0.00%

发文量