文字识别和人脸检测援助视障人士使用树莓派

2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT) Pub Date : 2017-04-20 DOI:10.1109/ICCPCT.2017.8074355

M. Rajesh, Bindhu K. Rajan, A. Roy, K. Thomas, A. Thomas, T. B. Tharakan, C. Dinesh

{"title":"文字识别和人脸检测援助视障人士使用树莓派","authors":"M. Rajesh, Bindhu K. Rajan, A. Roy, K. Thomas, A. Thomas, T. B. Tharakan, C. Dinesh","doi":"10.1109/ICCPCT.2017.8074355","DOIUrl":null,"url":null,"abstract":"Speech and text is the main medium for human communication. A person needs vision to access the information in a text. However those who have poor vision can gather information from voice. This paper proposes a camera based assistive text reading to help visually impaired person in reading the text present on the captured image. The faces can also be detected when a person enter into the frame by the mode control. The proposed idea involves text extraction from scanned image using Tesseract Optical Character Recognition (OCR) and converting the text to speech by e-Speak tool, a process which makes visually impaired persons to read the text. This is a prototype for blind people to recognize the products in real world by extracting the text on image and converting it into speech. Proposed method is carried out by using Raspberry pi and portability is achieved by using a battery backup. Thus the user can carry the device anywhere and able to use at any time. Upon entering the camera view previously stored faces are identified and informed which can be implemented as a future technology. This technology helps millions of people in the world who experience a significant loss of vision.","PeriodicalId":208028,"journal":{"name":"2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":"{\"title\":\"Text recognition and face detection aid for visually impaired person using Raspberry PI\",\"authors\":\"M. Rajesh, Bindhu K. Rajan, A. Roy, K. Thomas, A. Thomas, T. B. Tharakan, C. Dinesh\",\"doi\":\"10.1109/ICCPCT.2017.8074355\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech and text is the main medium for human communication. A person needs vision to access the information in a text. However those who have poor vision can gather information from voice. This paper proposes a camera based assistive text reading to help visually impaired person in reading the text present on the captured image. The faces can also be detected when a person enter into the frame by the mode control. The proposed idea involves text extraction from scanned image using Tesseract Optical Character Recognition (OCR) and converting the text to speech by e-Speak tool, a process which makes visually impaired persons to read the text. This is a prototype for blind people to recognize the products in real world by extracting the text on image and converting it into speech. Proposed method is carried out by using Raspberry pi and portability is achieved by using a battery backup. Thus the user can carry the device anywhere and able to use at any time. Upon entering the camera view previously stored faces are identified and informed which can be implemented as a future technology. This technology helps millions of people in the world who experience a significant loss of vision.\",\"PeriodicalId\":208028,\"journal\":{\"name\":\"2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)\",\"volume\":\"63 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"33\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCPCT.2017.8074355\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCPCT.2017.8074355","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 33

摘要

语音和文本是人类交流的主要媒介。一个人需要视觉来获取文本中的信息。然而，那些视力差的人可以从声音中收集信息。本文提出了一种基于相机的辅助文本阅读方法，以帮助视障人士阅读所拍摄图像上的文本。当一个人进入帧时，也可以通过模式控制检测到人脸。所提出的想法包括使用Tesseract光学字符识别(OCR)从扫描图像中提取文本，并通过e-Speak工具将文本转换为语音，使视障人士能够阅读文本。通过提取图像上的文字并将其转换为语音，为盲人识别现实世界中的产品提供了一个原型。该方法采用树莓派实现，并通过使用备用电池实现可移植性。因此，用户可以携带设备到任何地方，并能够随时使用。一旦进入摄像头视图，之前存储的人脸就会被识别并告知，这可以作为未来的技术实现。这项技术帮助了世界上数百万视力严重丧失的人。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Text recognition and face detection aid for visually impaired person using Raspberry PI

Speech and text is the main medium for human communication. A person needs vision to access the information in a text. However those who have poor vision can gather information from voice. This paper proposes a camera based assistive text reading to help visually impaired person in reading the text present on the captured image. The faces can also be detected when a person enter into the frame by the mode control. The proposed idea involves text extraction from scanned image using Tesseract Optical Character Recognition (OCR) and converting the text to speech by e-Speak tool, a process which makes visually impaired persons to read the text. This is a prototype for blind people to recognize the products in real world by extracting the text on image and converting it into speech. Proposed method is carried out by using Raspberry pi and portability is achieved by using a battery backup. Thus the user can carry the device anywhere and able to use at any time. Upon entering the camera view previously stored faces are identified and informed which can be implemented as a future technology. This technology helps millions of people in the world who experience a significant loss of vision.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)

自引率

0.00%

发文量