A video based interface to textual information for the visually impaired

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI:10.1109/ICMI.2002.1167016

Ali Zandifar, R. Duraiswami, Antoine Chahine, L. Davis

引用次数: 45

Abstract

We describe the development of an interface to textual information for the visually impaired that uses video, image processing, optical-character-recognition (OCR) and text-to-speech (TTS). The video provides a sequence of low resolution images in which text must be detected, rectified and converted into high resolution rectangular blocks that are capable of being analyzed via off-the-shelf OCR. To achieve this, various problems related to feature detection, mosaicing, auto-focus, zoom, and systems integration were solved in the development of the system.

查看原文本刊更多论文

为视障人士提供的基于视频的文本信息界面

我们描述了一个使用视频、图像处理、光学字符识别(OCR)和文本到语音(TTS)的视障人士文本信息接口的开发。视频提供了一系列低分辨率图像，其中的文本必须被检测、校正并转换为高分辨率的矩形块，以便通过现成的OCR进行分析。为了实现这一目标，在系统的开发中解决了与特征检测、拼接、自动对焦、变焦和系统集成相关的各种问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces

自引率

0.00%

发文量