{"title":"为说孟加拉语的视障人士设计一套阅读装置","authors":"Md. Mahade Sarkar, Shuvasis Datta, Md. Mahedi Hassan","doi":"10.1109/R10-HTC.2017.8288999","DOIUrl":null,"url":null,"abstract":"A reading device is a compact hardware setup with necessary programmes coded in it which read out printed documents like a human reader. People having eyesight problem can't read books, papers or any kind of printed reading materials. This problem can be solved simply by taking image of the reading materials, extracting words from the image and converting those words to sound, so by hearing that text converted sound they can understand what is written on that paper. A device is implemented for Bengali speaking visually handicapped people. For character recognition tesseract-ocr is used as optical character recognition(OCR) engine. Python gTTS module, a text to speech engine is used to convert the words extracted by tesseract-ocr to sound. Whole process is implemented on Raspberry Pi based a compact hardware design. Accuracy of character detection from the captured image and words to sound conversion is as high as 85 %. It is to be mentioned that accuracy is calculated as percentage of correct words to total words in an image.","PeriodicalId":411099,"journal":{"name":"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Implementation of a reading device for bengali speaking visually handicapped people\",\"authors\":\"Md. Mahade Sarkar, Shuvasis Datta, Md. Mahedi Hassan\",\"doi\":\"10.1109/R10-HTC.2017.8288999\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A reading device is a compact hardware setup with necessary programmes coded in it which read out printed documents like a human reader. People having eyesight problem can't read books, papers or any kind of printed reading materials. This problem can be solved simply by taking image of the reading materials, extracting words from the image and converting those words to sound, so by hearing that text converted sound they can understand what is written on that paper. A device is implemented for Bengali speaking visually handicapped people. For character recognition tesseract-ocr is used as optical character recognition(OCR) engine. Python gTTS module, a text to speech engine is used to convert the words extracted by tesseract-ocr to sound. Whole process is implemented on Raspberry Pi based a compact hardware design. Accuracy of character detection from the captured image and words to sound conversion is as high as 85 %. It is to be mentioned that accuracy is calculated as percentage of correct words to total words in an image.\",\"PeriodicalId\":411099,\"journal\":{\"name\":\"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/R10-HTC.2017.8288999\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/R10-HTC.2017.8288999","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Implementation of a reading device for bengali speaking visually handicapped people
A reading device is a compact hardware setup with necessary programmes coded in it which read out printed documents like a human reader. People having eyesight problem can't read books, papers or any kind of printed reading materials. This problem can be solved simply by taking image of the reading materials, extracting words from the image and converting those words to sound, so by hearing that text converted sound they can understand what is written on that paper. A device is implemented for Bengali speaking visually handicapped people. For character recognition tesseract-ocr is used as optical character recognition(OCR) engine. Python gTTS module, a text to speech engine is used to convert the words extracted by tesseract-ocr to sound. Whole process is implemented on Raspberry Pi based a compact hardware design. Accuracy of character detection from the captured image and words to sound conversion is as high as 85 %. It is to be mentioned that accuracy is calculated as percentage of correct words to total words in an image.