Ni Htwe Aung, Honey Htun, Ye Kyaw Thu, Su Su Maung
{"title":"基于CRNN的OCR用于美国和英国手语手指拼写","authors":"Ni Htwe Aung, Honey Htun, Ye Kyaw Thu, Su Su Maung","doi":"10.1109/KSE53942.2021.9648593","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) technology is mostly used to convert image containing written text (typed, handwritten, printed or scanned) into machine-readable text data. This work explores the first investigation of American Sign Language (ASL) and British Sign Language (BSL) fingerspelling font images to the corresponding English text conversion system. The proposed system is implemented by the Convolutional Recurrent Neural Network (CRNN) model with three different feature extraction methods. We also investigated two types of hyper-parameters such as hidden size and number of iterations. The experimental results show that our system achieved significantly higher conversion quality on the open-test dataset for both ASL and BSL fingerspelling. Our proposed technique can also be used in deaf education, for example, to extract fingerspelling images from exam answer sheets.","PeriodicalId":130986,"journal":{"name":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","volume":"306 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"CRNN Based OCR for American and British Sign Language Fingerspelling\",\"authors\":\"Ni Htwe Aung, Honey Htun, Ye Kyaw Thu, Su Su Maung\",\"doi\":\"10.1109/KSE53942.2021.9648593\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Optical Character Recognition (OCR) technology is mostly used to convert image containing written text (typed, handwritten, printed or scanned) into machine-readable text data. This work explores the first investigation of American Sign Language (ASL) and British Sign Language (BSL) fingerspelling font images to the corresponding English text conversion system. The proposed system is implemented by the Convolutional Recurrent Neural Network (CRNN) model with three different feature extraction methods. We also investigated two types of hyper-parameters such as hidden size and number of iterations. The experimental results show that our system achieved significantly higher conversion quality on the open-test dataset for both ASL and BSL fingerspelling. Our proposed technique can also be used in deaf education, for example, to extract fingerspelling images from exam answer sheets.\",\"PeriodicalId\":130986,\"journal\":{\"name\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"volume\":\"306 \",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KSE53942.2021.9648593\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE53942.2021.9648593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CRNN Based OCR for American and British Sign Language Fingerspelling
Optical Character Recognition (OCR) technology is mostly used to convert image containing written text (typed, handwritten, printed or scanned) into machine-readable text data. This work explores the first investigation of American Sign Language (ASL) and British Sign Language (BSL) fingerspelling font images to the corresponding English text conversion system. The proposed system is implemented by the Convolutional Recurrent Neural Network (CRNN) model with three different feature extraction methods. We also investigated two types of hyper-parameters such as hidden size and number of iterations. The experimental results show that our system achieved significantly higher conversion quality on the open-test dataset for both ASL and BSL fingerspelling. Our proposed technique can also be used in deaf education, for example, to extract fingerspelling images from exam answer sheets.