The 6th International Workshop on Historical Document Imaging and Processing最新文献

The BIR database – Identifying typographic emphasis in list-like historical documents BIR数据库——在类似列表的历史文档中识别排版重点

The 6th International Workshop on Historical Document Imaging and Processing Pub Date : 2021-09-05 DOI: 10.1145/3476887.3476913

Anna Scius-Bertrand, Simon Gabay, Juliette Janes, L. Petkovic, Caroline Corbieres, Thibault Clérice

引用次数: 2

A survey of OCR evaluation tools and metrics OCR评估工具和指标的调查

The 6th International Workshop on Historical Document Imaging and Processing Pub Date : 2021-09-05 DOI: 10.1145/3476887.3476888

Clemens Neudecker, Konstantin Baierer, Mike Gerber, C. Clausner, A. Antonacopoulos, S. Pletschacher

{"title":"A survey of OCR evaluation tools and metrics","authors":"Clemens Neudecker, Konstantin Baierer, Mike Gerber, C. Clausner, A. Antonacopoulos, S. Pletschacher","doi":"10.1145/3476887.3476888","DOIUrl":"https://doi.org/10.1145/3476887.3476888","url":null,"abstract":"The millions of pages of historical documents that are digitized in libraries are increasingly used in contexts that have more specific requirements for OCR quality than keyword search. How to comprehensively, efficiently and reliably assess the quality of OCR results against the background of mass digitization, when ground truth can only ever be produced for very small numbers? Due to gaps in specifications, results from OCR evaluation tools can return different results, and due to differences in implementation, even commonly used error rates are often not directly comparable. OCR evaluation metrics and sampling methods are also not sufficient where they do not take into account the accuracy of layout analysis, since for advanced use cases like Natural Language Processing or the Digital Humanities, accurate layout analysis and detection of the reading order are crucial. We provide an overview of OCR evaluation metrics and tools, describe two advanced use cases for OCR results, and perform an OCR evaluation experiment with multiple evaluation tools and different metrics for two distinct datasets. We analyze the differences and commonalities in light of the presented use cases and suggest areas for future work.","PeriodicalId":166776,"journal":{"name":"The 6th International Workshop on Historical Document Imaging and Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129010618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Visual Analysis of Chapbooks Printed in Scotland 苏格兰印本的视觉分析

The 6th International Workshop on Historical Document Imaging and Processing Pub Date : 2021-09-05 DOI: 10.1145/3476887.3476893

Abhishek Dutta, G. Bergel, Andrew Zisserman

{"title":"Visual Analysis of Chapbooks Printed in Scotland","authors":"Abhishek Dutta, G. Bergel, Andrew Zisserman","doi":"10.1145/3476887.3476893","DOIUrl":"https://doi.org/10.1145/3476887.3476893","url":null,"abstract":"Chapbooks were short, cheap printed booklets produced in large quantities in Scotland, England, Ireland, North America and much of Europe between roughly the seventeenth and nineteenth centuries. A form of popular literature containing songs, stories, poems, games, riddles, religious writings and other content designed to appeal to a wide readership, they were frequently illustrated, particularly on their title-pages. This paper describes the visual analysis of such chapbook illustrations. We automatically extract all the illustrations contained in the National Library of Scotland Chapbooks Printed in Scotland dataset, and create a visual search engine to search this dataset using full or part-illustrations as queries. We also cluster these illustrations based on their visual content, and provide keyword-based search of the metadata associated with each publication. The visual search; clustering of illustrations based on visual content; and metadata search features enable researchers to forensically analyse the chapbooks dataset and to discover unnoticed relationships between its elements. We release all annotations and software tools described in this paper to enable reproduction of the results presented and to allow extension of the methodology described to datasets of a similar nature.","PeriodicalId":166776,"journal":{"name":"The 6th International Workshop on Historical Document Imaging and Processing","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123958825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Generalized Template Matching for Semi-structured Text 半结构化文本的广义模板匹配

The 6th International Workshop on Historical Document Imaging and Processing Pub Date : 2021-09-05 DOI: 10.1145/3476887.3476895

G. Nagy

引用次数: 0

Text Detection and Recognition by using CNNs in the Austro-Hungarian Historical Military Mapping Survey 奥匈历史军事测绘调查中使用cnn的文本检测与识别

The 6th International Workshop on Historical Document Imaging and Processing Pub Date : 2021-09-05 DOI: 10.1145/3476887.3476904

Y. Can, M. E. Kabadayı

引用次数: 2

Including Keyword Position in Image-based Models for Act Segmentation of Historical Registers 基于图像的历史寄存器动作分割模型中包含关键字位置

The 6th International Workshop on Historical Document Imaging and Processing Pub Date : 2021-09-05 DOI: 10.1145/3476887.3476905

Mélodie Boillet, Martin Maarand, T. Paquet, Christopher Kermorvant

{"title":"Including Keyword Position in Image-based Models for Act Segmentation of Historical Registers","authors":"Mélodie Boillet, Martin Maarand, T. Paquet, Christopher Kermorvant","doi":"10.1145/3476887.3476905","DOIUrl":"https://doi.org/10.1145/3476887.3476905","url":null,"abstract":"The segmentation of complex images into semantic regions has seen a growing interest these last years with the advent of Deep Learning. Until recently, most existing methods for Historical Document Analysis focused on the visual appearance of documents, ignoring the rich information that textual content can offer. However, the segmentation of complex documents into semantic regions is sometimes impossible relying only on visual features and recent models embed both visual and textual information. In this paper, we focus on the use of both visual and textual information for segmenting historical registers into structured and meaningful units such as acts. An act is a text recording containing valuable knowledge such as demographic information (baptism, marriage or death) or royal decisions (donation or pardon). We propose a simple pipeline to enrich document images with the position of text lines containing key-phrases and show that running a standard image-based layout analysis system on these images can lead to significant gains. Our experiments show that the detection of acts increases from 38 % of mAP to 74 % when adding textual information, in real use-case conditions where text lines positions and content are extracted with an automatic recognition system.","PeriodicalId":166776,"journal":{"name":"The 6th International Workshop on Historical Document Imaging and Processing","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132275206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

GloSAT Historical Measurement Table Dataset: Enhanced Table Structure Recognition Annotation for Downstream Historical Data Rescue GloSAT历史测量表数据集:用于下游历史数据救援的增强表结构识别注释

The 6th International Workshop on Historical Document Imaging and Processing Pub Date : 2021-09-05 DOI: 10.1145/3476887.3476890

Juliusz Ziomek, S. Middleton

{"title":"GloSAT Historical Measurement Table Dataset: Enhanced Table Structure Recognition Annotation for Downstream Historical Data Rescue","authors":"Juliusz Ziomek, S. Middleton","doi":"10.1145/3476887.3476890","DOIUrl":"https://doi.org/10.1145/3476887.3476890","url":null,"abstract":"Understanding and extracting tables from documents is a research problem that has been studied for decades. Table structure recognition is the labelling of components within a detected table, which can be detected automatically or manually provided. This paper presents the GloSAT historical measurement table dataset designed to train table structure recognition models for use in downstream historical data rescue applications. The dataset contains 500 scanned and manually annotated images of pages from meteorological measurement logbooks. We enhance standard full table and individual cell annotations by adding additional annotations for headings, headers, and table bodies. We also provide annotations for coarse segmentation cells consisting of multiple data cells logically grouped by ruling lines of ink or whitespace in the table, which often represent data cells that are semantically grouped. Our dataset annotations are provided in VOC2007 and ICDAR-2019 Competition on Table Detection and Recognition (cTDaR-19) XML formats, and our dataset can easily be aggregated with the cTDaR-19 dataset. We report results running a series of benchmark algorithms on our new dataset, concluding that post-processing is very important for performance, and that page style is not as significant a feature as table type on model performance.","PeriodicalId":166776,"journal":{"name":"The 6th International Workshop on Historical Document Imaging and Processing","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134487453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

BiblIA - a General Model for Medieval Hebrew Manuscripts and an Open Annotated Dataset 中世纪希伯来文手稿和开放注释数据集的通用模型

The 6th International Workshop on Historical Document Imaging and Processing Pub Date : 2021-09-05 DOI: 10.1145/3476887.3476896

D. Ezra, Bronson Brown-deVost, P. Jablonski, Hayim Lapin, Benjamin Kiessling, Elena Lolli

引用次数: 6