CalliScan

The Adjunct Publication of the 32nd Annual ACM Symposium on User Interface Software and Technology Pub Date : 2019-10-14 DOI:10.1145/3332167.3357119

Oleksandr Viatchaninov, Valerii Dziubliuk, Olga Radyvonenko, Yevhenii Yakishyn, Mykhailo Zlotnyk

引用次数: 3

Abstract

In this work, a solution for handwriting text extraction from images with visual user assistance is proposed. Use of end-to-end systems that pipe together text detection and recognition is often awkward because the user cannot influence the detection stage. On the other hand, glossing over the word's regions to help system with text localization requires a manual job and can be unacceptable. This paper proposes a solution that gives visual cues to the user during a detection stage. These hints differ from traditional bounding boxes in two ways. Firstly, the found text is surrounded with polygonal bounding reflecting a possible complex nature of text blocks. Secondly, TextRadar scanning effect provides a non-overloaded camera view, helping the user to capture the most relevant part of the text on image on-the-fly. CalliScan works on-device and keeps the user's privacy. The evaluation study has shown that users need such a solution, but it is necessary to carefully handle the text layout complexity.

查看原文本刊更多论文

CalliScan

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The Adjunct Publication of the 32nd Annual ACM Symposium on User Interface Software and Technology

自引率

0.00%

发文量