Laypa: A Novel Framework for Applying Segmentation Networks to Historical Documents

Proceedings of the 7th International Workshop on Historical Document Imaging and Processing Pub Date : 2023-08-25 DOI:10.1145/3604951.3605520

Stefan Klut, Rutger van Koert, R. Sluijter

引用次数: 0

Abstract

We present novel software to process scans of historical documents to extract their layout information. We do this using a ResNet backbone with a feature pyramid head. We extract region information directly into PageXML. For baseline extraction, we use a two stage processing approach. The software has been applied successfully to several projects. The results show the feasibility to automatically label text lines and regions in historical documents.

查看原文本刊更多论文

Laypa:一个将分割网络应用于历史文献的新框架

我们提出了一种新的软件来处理历史文档的扫描，以提取其布局信息。我们使用带有特征金字塔头的ResNet主干来实现这一点。我们将区域信息直接提取到PageXML中。对于基线提取，我们使用两阶段处理方法。该软件已成功应用于多个工程中。结果表明，在历史文献中实现文本行和区域自动标注是可行的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 7th International Workshop on Historical Document Imaging and Processing

自引率

0.00%

发文量