从扫描文档中提取多方向手写注释

2014 11th IAPR International Workshop on Document Analysis Systems Pub Date : 2014-04-07 DOI:10.1109/DAS.2014.17

M. B. Jlaiel, R. Mullot, A. Alimi

{"title":"从扫描文档中提取多方向手写注释","authors":"M. B. Jlaiel, R. Mullot, A. Alimi","doi":"10.1109/DAS.2014.17","DOIUrl":null,"url":null,"abstract":"In this paper, we present an integrated system able to localize multi-oriented handwritten annotations in scanned documents. Unlike previous single methods which limit colors or types of annotations to be extracted, the proposed method attempts to extract annotations by fusing three feature extraction techniques based on internal and external shape analysis. Our method consists of two processes: 1) a coarse segmentation process which divides the scanned document into text and non-text regions. 2) A fine segmentation process which consists of three steps: a feature extraction process, a classification process and a majority voting process which identifies the segmented regions as machine-printed or handwritten annotations. We find that our adaptive method outperform all individual methods. Experimental results on a set of 301 annotated scanned documents are reported.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"88 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Multi-oriented Handwritten Annotations Extraction from Scanned Documents\",\"authors\":\"M. B. Jlaiel, R. Mullot, A. Alimi\",\"doi\":\"10.1109/DAS.2014.17\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present an integrated system able to localize multi-oriented handwritten annotations in scanned documents. Unlike previous single methods which limit colors or types of annotations to be extracted, the proposed method attempts to extract annotations by fusing three feature extraction techniques based on internal and external shape analysis. Our method consists of two processes: 1) a coarse segmentation process which divides the scanned document into text and non-text regions. 2) A fine segmentation process which consists of three steps: a feature extraction process, a classification process and a majority voting process which identifies the segmented regions as machine-printed or handwritten annotations. We find that our adaptive method outperform all individual methods. Experimental results on a set of 301 annotated scanned documents are reported.\",\"PeriodicalId\":220495,\"journal\":{\"name\":\"2014 11th IAPR International Workshop on Document Analysis Systems\",\"volume\":\"88 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-04-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 11th IAPR International Workshop on Document Analysis Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DAS.2014.17\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 11th IAPR International Workshop on Document Analysis Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DAS.2014.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

在本文中，我们提出了一个能够定位扫描文档中多方向手写注释的集成系统。不同于以往单一的方法对提取标注的颜色或类型的限制，该方法尝试融合基于内外形状分析的三种特征提取技术来提取标注。我们的方法包括两个过程:1)粗分割过程，将扫描文档分为文本区域和非文本区域。2)精细分割过程包括三个步骤:特征提取过程，分类过程和多数投票过程，该过程将分割的区域识别为机器打印或手写注释。我们发现我们的自适应方法优于所有单独的方法。报道了301份带注释的扫描文档的实验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-oriented Handwritten Annotations Extraction from Scanned Documents

In this paper, we present an integrated system able to localize multi-oriented handwritten annotations in scanned documents. Unlike previous single methods which limit colors or types of annotations to be extracted, the proposed method attempts to extract annotations by fusing three feature extraction techniques based on internal and external shape analysis. Our method consists of two processes: 1) a coarse segmentation process which divides the scanned document into text and non-text regions. 2) A fine segmentation process which consists of three steps: a feature extraction process, a classification process and a majority voting process which identifies the segmented regions as machine-printed or handwritten annotations. We find that our adaptive method outperform all individual methods. Experimental results on a set of 301 annotated scanned documents are reported.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2014 11th IAPR International Workshop on Document Analysis Systems

自引率

0.00%

发文量