文字识别和人脸检测援助视障人士使用树莓派

M. Rajesh, Bindhu K. Rajan, A. Roy, K. Thomas, A. Thomas, T. B. Tharakan, C. Dinesh
{"title":"文字识别和人脸检测援助视障人士使用树莓派","authors":"M. Rajesh, Bindhu K. Rajan, A. Roy, K. Thomas, A. Thomas, T. B. Tharakan, C. Dinesh","doi":"10.1109/ICCPCT.2017.8074355","DOIUrl":null,"url":null,"abstract":"Speech and text is the main medium for human communication. A person needs vision to access the information in a text. However those who have poor vision can gather information from voice. This paper proposes a camera based assistive text reading to help visually impaired person in reading the text present on the captured image. The faces can also be detected when a person enter into the frame by the mode control. The proposed idea involves text extraction from scanned image using Tesseract Optical Character Recognition (OCR) and converting the text to speech by e-Speak tool, a process which makes visually impaired persons to read the text. This is a prototype for blind people to recognize the products in real world by extracting the text on image and converting it into speech. Proposed method is carried out by using Raspberry pi and portability is achieved by using a battery backup. Thus the user can carry the device anywhere and able to use at any time. Upon entering the camera view previously stored faces are identified and informed which can be implemented as a future technology. This technology helps millions of people in the world who experience a significant loss of vision.","PeriodicalId":208028,"journal":{"name":"2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":"{\"title\":\"Text recognition and face detection aid for visually impaired person using Raspberry PI\",\"authors\":\"M. Rajesh, Bindhu K. Rajan, A. Roy, K. Thomas, A. Thomas, T. B. Tharakan, C. Dinesh\",\"doi\":\"10.1109/ICCPCT.2017.8074355\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech and text is the main medium for human communication. A person needs vision to access the information in a text. However those who have poor vision can gather information from voice. This paper proposes a camera based assistive text reading to help visually impaired person in reading the text present on the captured image. The faces can also be detected when a person enter into the frame by the mode control. The proposed idea involves text extraction from scanned image using Tesseract Optical Character Recognition (OCR) and converting the text to speech by e-Speak tool, a process which makes visually impaired persons to read the text. This is a prototype for blind people to recognize the products in real world by extracting the text on image and converting it into speech. Proposed method is carried out by using Raspberry pi and portability is achieved by using a battery backup. Thus the user can carry the device anywhere and able to use at any time. Upon entering the camera view previously stored faces are identified and informed which can be implemented as a future technology. This technology helps millions of people in the world who experience a significant loss of vision.\",\"PeriodicalId\":208028,\"journal\":{\"name\":\"2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)\",\"volume\":\"63 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"33\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCPCT.2017.8074355\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCPCT.2017.8074355","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 33

摘要

语音和文本是人类交流的主要媒介。一个人需要视觉来获取文本中的信息。然而,那些视力差的人可以从声音中收集信息。本文提出了一种基于相机的辅助文本阅读方法,以帮助视障人士阅读所拍摄图像上的文本。当一个人进入帧时,也可以通过模式控制检测到人脸。所提出的想法包括使用Tesseract光学字符识别(OCR)从扫描图像中提取文本,并通过e-Speak工具将文本转换为语音,使视障人士能够阅读文本。通过提取图像上的文字并将其转换为语音,为盲人识别现实世界中的产品提供了一个原型。该方法采用树莓派实现,并通过使用备用电池实现可移植性。因此,用户可以携带设备到任何地方,并能够随时使用。一旦进入摄像头视图,之前存储的人脸就会被识别并告知,这可以作为未来的技术实现。这项技术帮助了世界上数百万视力严重丧失的人。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Text recognition and face detection aid for visually impaired person using Raspberry PI
Speech and text is the main medium for human communication. A person needs vision to access the information in a text. However those who have poor vision can gather information from voice. This paper proposes a camera based assistive text reading to help visually impaired person in reading the text present on the captured image. The faces can also be detected when a person enter into the frame by the mode control. The proposed idea involves text extraction from scanned image using Tesseract Optical Character Recognition (OCR) and converting the text to speech by e-Speak tool, a process which makes visually impaired persons to read the text. This is a prototype for blind people to recognize the products in real world by extracting the text on image and converting it into speech. Proposed method is carried out by using Raspberry pi and portability is achieved by using a battery backup. Thus the user can carry the device anywhere and able to use at any time. Upon entering the camera view previously stored faces are identified and informed which can be implemented as a future technology. This technology helps millions of people in the world who experience a significant loss of vision.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信