{"title":"盲人阅读器和对象检测器","authors":"M. Murali, Shreya Sharma, Neel Nagansure","doi":"10.1109/ICCSP48568.2020.9182201","DOIUrl":null,"url":null,"abstract":"This work aims to assist the visually impaired people for reading a text material and detect objects in their surroundings. The input is taken in the form of an image captured from the web camera. This image is then processed either for the purpose of text reading or for object detection based on user choice. The Raspberry Pi acts as the microcontroller for processing of the entire process. The text reading is supported by software named OCR. The read text is changed into an audio output using the TTS Synthesis. Other dependencies required for the process include Tesseract Library. The Object Detection is another aspect of the project which is implemented using a TensorFlow Object Detection API. It is able to detect various objects in its surroundings and provide an audio feedback about the same. The dataset can be trained on various different situations depending on the user needs, thus making it scalable","PeriodicalId":321133,"journal":{"name":"2020 International Conference on Communication and Signal Processing (ICCSP)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Reader and Object Detector for Blind\",\"authors\":\"M. Murali, Shreya Sharma, Neel Nagansure\",\"doi\":\"10.1109/ICCSP48568.2020.9182201\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work aims to assist the visually impaired people for reading a text material and detect objects in their surroundings. The input is taken in the form of an image captured from the web camera. This image is then processed either for the purpose of text reading or for object detection based on user choice. The Raspberry Pi acts as the microcontroller for processing of the entire process. The text reading is supported by software named OCR. The read text is changed into an audio output using the TTS Synthesis. Other dependencies required for the process include Tesseract Library. The Object Detection is another aspect of the project which is implemented using a TensorFlow Object Detection API. It is able to detect various objects in its surroundings and provide an audio feedback about the same. The dataset can be trained on various different situations depending on the user needs, thus making it scalable\",\"PeriodicalId\":321133,\"journal\":{\"name\":\"2020 International Conference on Communication and Signal Processing (ICCSP)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Communication and Signal Processing (ICCSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSP48568.2020.9182201\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Communication and Signal Processing (ICCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSP48568.2020.9182201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This work aims to assist the visually impaired people for reading a text material and detect objects in their surroundings. The input is taken in the form of an image captured from the web camera. This image is then processed either for the purpose of text reading or for object detection based on user choice. The Raspberry Pi acts as the microcontroller for processing of the entire process. The text reading is supported by software named OCR. The read text is changed into an audio output using the TTS Synthesis. Other dependencies required for the process include Tesseract Library. The Object Detection is another aspect of the project which is implemented using a TensorFlow Object Detection API. It is able to detect various objects in its surroundings and provide an audio feedback about the same. The dataset can be trained on various different situations depending on the user needs, thus making it scalable