{"title":"Implementation of a reading device for bengali speaking visually handicapped people","authors":"Md. Mahade Sarkar, Shuvasis Datta, Md. Mahedi Hassan","doi":"10.1109/R10-HTC.2017.8288999","DOIUrl":null,"url":null,"abstract":"A reading device is a compact hardware setup with necessary programmes coded in it which read out printed documents like a human reader. People having eyesight problem can't read books, papers or any kind of printed reading materials. This problem can be solved simply by taking image of the reading materials, extracting words from the image and converting those words to sound, so by hearing that text converted sound they can understand what is written on that paper. A device is implemented for Bengali speaking visually handicapped people. For character recognition tesseract-ocr is used as optical character recognition(OCR) engine. Python gTTS module, a text to speech engine is used to convert the words extracted by tesseract-ocr to sound. Whole process is implemented on Raspberry Pi based a compact hardware design. Accuracy of character detection from the captured image and words to sound conversion is as high as 85 %. It is to be mentioned that accuracy is calculated as percentage of correct words to total words in an image.","PeriodicalId":411099,"journal":{"name":"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/R10-HTC.2017.8288999","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
A reading device is a compact hardware setup with necessary programmes coded in it which read out printed documents like a human reader. People having eyesight problem can't read books, papers or any kind of printed reading materials. This problem can be solved simply by taking image of the reading materials, extracting words from the image and converting those words to sound, so by hearing that text converted sound they can understand what is written on that paper. A device is implemented for Bengali speaking visually handicapped people. For character recognition tesseract-ocr is used as optical character recognition(OCR) engine. Python gTTS module, a text to speech engine is used to convert the words extracted by tesseract-ocr to sound. Whole process is implemented on Raspberry Pi based a compact hardware design. Accuracy of character detection from the captured image and words to sound conversion is as high as 85 %. It is to be mentioned that accuracy is calculated as percentage of correct words to total words in an image.