结合空间背景后ocr地图图像

Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery Pub Date : 2022-11-01 DOI:10.1145/3557918.3565864

M. Namgung, Yao-Yi Chiang

{"title":"结合空间背景后ocr地图图像","authors":"M. Namgung, Yao-Yi Chiang","doi":"10.1145/3557918.3565864","DOIUrl":null,"url":null,"abstract":"Extracting text from historical maps using Optical Character Recognition (OCR) engines often results in partially or incorrectly recognized words due to complex map content. Previous work utilizes lexical-based approaches with linguistic context or applies language models to correct OCR results for documents. However, these post-OCR methods cannot directly consider spatial relations of map text for correction. For example, \"Mississippi\" and \"River\" constitute the place phrase \"Mississippi River\" (linguistic relation), and near \"highway\", there are likely to exist intersected \"road\" to enter the \"highway\" (spatial relation). This paper presents a novel approach that exploits the spatial arrangement of map text using a contextual language model, BART [6] for post-processing of map text from OCR. The approach first structures word-level map text into sentences based on their spatial arrangement while preserving the spatial location of words constituting a place name and corrects imperfect OCR text using neighboring information. To train BART for capturing spatial relations in map text, we automatically generate large numbers of synthetic maps to fine-tune BART with location names and their spatial context. We conduct experiments on synthetic and real-world historical maps of various map styles and scales and show that the proposed method can achieve significant improvement over the commonly used lexical approach.","PeriodicalId":428859,"journal":{"name":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Incorporating spatial context for post-OCR in map images\",\"authors\":\"M. Namgung, Yao-Yi Chiang\",\"doi\":\"10.1145/3557918.3565864\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Extracting text from historical maps using Optical Character Recognition (OCR) engines often results in partially or incorrectly recognized words due to complex map content. Previous work utilizes lexical-based approaches with linguistic context or applies language models to correct OCR results for documents. However, these post-OCR methods cannot directly consider spatial relations of map text for correction. For example, \\\"Mississippi\\\" and \\\"River\\\" constitute the place phrase \\\"Mississippi River\\\" (linguistic relation), and near \\\"highway\\\", there are likely to exist intersected \\\"road\\\" to enter the \\\"highway\\\" (spatial relation). This paper presents a novel approach that exploits the spatial arrangement of map text using a contextual language model, BART [6] for post-processing of map text from OCR. The approach first structures word-level map text into sentences based on their spatial arrangement while preserving the spatial location of words constituting a place name and corrects imperfect OCR text using neighboring information. To train BART for capturing spatial relations in map text, we automatically generate large numbers of synthetic maps to fine-tune BART with location names and their spatial context. We conduct experiments on synthetic and real-world historical maps of various map styles and scales and show that the proposed method can achieve significant improvement over the commonly used lexical approach.\",\"PeriodicalId\":428859,\"journal\":{\"name\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3557918.3565864\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3557918.3565864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

由于复杂的地图内容，使用光学字符识别(OCR)引擎从历史地图中提取文本通常会导致部分或错误地识别单词。以前的工作利用基于词汇的方法与语言上下文或应用语言模型来纠正文档的OCR结果。然而，这些后ocr方法不能直接考虑地图文本的空间关系进行校正。例如，“Mississippi”和“River”构成了地方短语“Mississippi River”(语言关系)，而在“highway”附近，很可能存在交叉的“road”进入“highway”(空间关系)。本文提出了一种新的方法，利用上下文语言模型BART[6]利用地图文本的空间排列对OCR中的地图文本进行后处理。该方法首先将词级地图文本根据其空间排列结构成句子，同时保留构成地名的词的空间位置，并利用相邻信息纠正不完善的OCR文本。为了训练BART捕捉地图文本中的空间关系，我们自动生成大量合成地图，用位置名称及其空间上下文对BART进行微调。我们对各种地图样式和比例尺的合成历史地图和现实世界的历史地图进行了实验，结果表明所提出的方法比常用的词法方法取得了显著的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Incorporating spatial context for post-OCR in map images

Extracting text from historical maps using Optical Character Recognition (OCR) engines often results in partially or incorrectly recognized words due to complex map content. Previous work utilizes lexical-based approaches with linguistic context or applies language models to correct OCR results for documents. However, these post-OCR methods cannot directly consider spatial relations of map text for correction. For example, "Mississippi" and "River" constitute the place phrase "Mississippi River" (linguistic relation), and near "highway", there are likely to exist intersected "road" to enter the "highway" (spatial relation). This paper presents a novel approach that exploits the spatial arrangement of map text using a contextual language model, BART [6] for post-processing of map text from OCR. The approach first structures word-level map text into sentences based on their spatial arrangement while preserving the spatial location of words constituting a place name and corrects imperfect OCR text using neighboring information. To train BART for capturing spatial relations in map text, we automatically generate large numbers of synthetic maps to fine-tune BART with location names and their spatial context. We conduct experiments on synthetic and real-world historical maps of various map styles and scales and show that the proposed method can achieve significant improvement over the commonly used lexical approach.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery

自引率

0.00%

发文量