基于深度学习的视障可穿戴视觉-听觉感官替代系统

2021 International Conference on Digital Society and Intelligent Systems (DSInS) Pub Date : 2021-12-03 DOI:10.1109/dsins54396.2021.9670599

Zifeng Wang, Heng Li, Jianping Chen, X. Chai, Zhenzhen Zhai

{"title":"基于深度学习的视障可穿戴视觉-听觉感官替代系统","authors":"Zifeng Wang, Heng Li, Jianping Chen, X. Chai, Zhenzhen Zhai","doi":"10.1109/dsins54396.2021.9670599","DOIUrl":null,"url":null,"abstract":"Visual impairment has caused serious influence on the human being and society. Due to more sensitive hearing and touch, for visually impaired people, it is an available solution to improve their quality of live, work and study by Sensory Substitution Devices (SSDs) which transfer visual information to audio or touch. In this paper, we proposed a wearable, vision-to-audio sensory substitution system with scene-perception-based deep learning to help the visually impaired users recognize and locate normal objects in the environment. The system consists of a wireless camera module, a Bluetooth speech feedback module with a microphone, and an Android mobile phone with a customized application. The camera module captures images from the scene and sends them to the application of Android mobile phone. The Bluetooth speech feedback module sends speech commands to application and broadcasts speech guidance to visually impaired users. The application based on Android platform loads speech recognition and object detection models. The system has been proved to provide an effective way to help the visually impaired people recognize and locate objects.","PeriodicalId":243724,"journal":{"name":"2021 International Conference on Digital Society and Intelligent Systems (DSInS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Wearable Vision-To-Audio Sensory Substitution System Based on Deep Learning for the Visually Impaired\",\"authors\":\"Zifeng Wang, Heng Li, Jianping Chen, X. Chai, Zhenzhen Zhai\",\"doi\":\"10.1109/dsins54396.2021.9670599\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Visual impairment has caused serious influence on the human being and society. Due to more sensitive hearing and touch, for visually impaired people, it is an available solution to improve their quality of live, work and study by Sensory Substitution Devices (SSDs) which transfer visual information to audio or touch. In this paper, we proposed a wearable, vision-to-audio sensory substitution system with scene-perception-based deep learning to help the visually impaired users recognize and locate normal objects in the environment. The system consists of a wireless camera module, a Bluetooth speech feedback module with a microphone, and an Android mobile phone with a customized application. The camera module captures images from the scene and sends them to the application of Android mobile phone. The Bluetooth speech feedback module sends speech commands to application and broadcasts speech guidance to visually impaired users. The application based on Android platform loads speech recognition and object detection models. The system has been proved to provide an effective way to help the visually impaired people recognize and locate objects.\",\"PeriodicalId\":243724,\"journal\":{\"name\":\"2021 International Conference on Digital Society and Intelligent Systems (DSInS)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Digital Society and Intelligent Systems (DSInS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/dsins54396.2021.9670599\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Digital Society and Intelligent Systems (DSInS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/dsins54396.2021.9670599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

视力障碍对人类和社会造成了严重的影响。由于听觉和触觉更为敏感，对于视障人士来说，将视觉信息转换为听觉或触觉的感官替代装置(ssd)是提高他们生活、工作和学习质量的一种可行的解决方案。在本文中，我们提出了一种基于场景感知的深度学习的可穿戴视觉-音频感官替代系统，以帮助视障用户识别和定位环境中的正常物体。该系统由无线摄像模块、带麦克风的蓝牙语音反馈模块和带有定制应用程序的Android手机组成。摄像模块从现场采集图像，发送到Android手机的应用程序。蓝牙语音反馈模块向应用程序发送语音命令，并向视障用户广播语音指导。基于Android平台的应用程序加载语音识别和目标检测模型。该系统为视障人士识别和定位物体提供了一种有效的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Wearable Vision-To-Audio Sensory Substitution System Based on Deep Learning for the Visually Impaired

Visual impairment has caused serious influence on the human being and society. Due to more sensitive hearing and touch, for visually impaired people, it is an available solution to improve their quality of live, work and study by Sensory Substitution Devices (SSDs) which transfer visual information to audio or touch. In this paper, we proposed a wearable, vision-to-audio sensory substitution system with scene-perception-based deep learning to help the visually impaired users recognize and locate normal objects in the environment. The system consists of a wireless camera module, a Bluetooth speech feedback module with a microphone, and an Android mobile phone with a customized application. The camera module captures images from the scene and sends them to the application of Android mobile phone. The Bluetooth speech feedback module sends speech commands to application and broadcasts speech guidance to visually impaired users. The application based on Android platform loads speech recognition and object detection models. The system has been proved to provide an effective way to help the visually impaired people recognize and locate objects.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 International Conference on Digital Society and Intelligent Systems (DSInS)

自引率

0.00%

发文量