High-Quality Capture of Documents on a Cluttered Tabletop with a 4K Video Camera

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI:10.1145/2682571.2797074

Chelhwon Kim, Patrick Chiu, Henry Tang

引用次数: 2

Abstract

We present a novel system for detecting and capturing paper documents on a tabletop using a 4K video camera mounted overhead on pan-tilt servos. Our automated system first finds paper documents on a cluttered tabletop based on a text probability map, and then takes a sequence of high-resolution frames of the located document to reconstruct a high quality and fronto-parallel document page image. The quality of the resulting images enables OCR processing on the whole page. We performed a preliminary evaluation on a small set of 10 document pages and our proposed system achieved 98% accuracy with the open source Tesseract OCR engine.

查看原文本刊更多论文

用4K摄像机在杂乱的桌面上捕获高质量的文档

我们提出了一种新的系统，用于检测和捕获桌面上的纸质文件，该系统使用安装在平移伺服器上的4K摄像机。我们的自动化系统首先根据文本概率图在杂乱的桌面上找到纸质文档，然后对所定位文档的一系列高分辨率帧进行重建，以获得高质量的正面并行文档页面图像。结果图像的质量使OCR能够在整个页面上进行处理。我们对10个文档页面的一小部分进行了初步评估，我们提出的系统使用开源Tesseract OCR引擎达到了98%的准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2015 ACM Symposium on Document Engineering

自引率

0.00%

发文量