基于多尺度笔划的页面分割方法

Mehdi Felhi, S. Tabbone, Maria V. Ortiz Segovia
{"title":"基于多尺度笔划的页面分割方法","authors":"Mehdi Felhi, S. Tabbone, Maria V. Ortiz Segovia","doi":"10.1109/DAS.2014.68","DOIUrl":null,"url":null,"abstract":"In this paper we present a new hybrid page segmentation approach based on connected component and region analysis. We first describe our stroke descriptor that detects text and line component candidates using the skeleton of the binarized document image. Then, an active contour model is applied to segment the rest of the image into photo and background regions. This classification is verified by studying the variation of each detected region. Finally, we cluster the text candidates using mean-shift analysis technique according to their corresponding sizes and we present our adaptive projection profile approach to gather separately horizontal and vertical text regions. The method is applied for segmenting realistic scanned document images (newspapers and magazines) that contain text, lines and photo regions. We evaluate the performances of our approach by comparing it to the existing methods that participated in ICDAR page segmentation competition.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Multiscale Stroke-Based Page Segmentation Approach\",\"authors\":\"Mehdi Felhi, S. Tabbone, Maria V. Ortiz Segovia\",\"doi\":\"10.1109/DAS.2014.68\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we present a new hybrid page segmentation approach based on connected component and region analysis. We first describe our stroke descriptor that detects text and line component candidates using the skeleton of the binarized document image. Then, an active contour model is applied to segment the rest of the image into photo and background regions. This classification is verified by studying the variation of each detected region. Finally, we cluster the text candidates using mean-shift analysis technique according to their corresponding sizes and we present our adaptive projection profile approach to gather separately horizontal and vertical text regions. The method is applied for segmenting realistic scanned document images (newspapers and magazines) that contain text, lines and photo regions. We evaluate the performances of our approach by comparing it to the existing methods that participated in ICDAR page segmentation competition.\",\"PeriodicalId\":220495,\"journal\":{\"name\":\"2014 11th IAPR International Workshop on Document Analysis Systems\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-04-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 11th IAPR International Workshop on Document Analysis Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DAS.2014.68\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 11th IAPR International Workshop on Document Analysis Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DAS.2014.68","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

本文提出了一种基于连通分量和区域分析的混合页面分割方法。我们首先描述我们的笔画描述符,它使用二值化文档图像的骨架检测文本和行组件候选。然后,应用活动轮廓模型将图像的其余部分分割为照片和背景区域。通过研究每个检测区域的变化来验证这种分类。最后,根据候选文本对应的大小,采用均值偏移分析技术对候选文本进行聚类,并提出了自适应投影轮廓法,分别对水平和垂直文本区域进行聚类。该方法用于分割包含文本、线条和照片区域的真实扫描文档图像(报纸和杂志)。我们通过将我们的方法与参与ICDAR页面分割竞争的现有方法进行比较来评估我们的方法的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multiscale Stroke-Based Page Segmentation Approach
In this paper we present a new hybrid page segmentation approach based on connected component and region analysis. We first describe our stroke descriptor that detects text and line component candidates using the skeleton of the binarized document image. Then, an active contour model is applied to segment the rest of the image into photo and background regions. This classification is verified by studying the variation of each detected region. Finally, we cluster the text candidates using mean-shift analysis technique according to their corresponding sizes and we present our adaptive projection profile approach to gather separately horizontal and vertical text regions. The method is applied for segmenting realistic scanned document images (newspapers and magazines) that contain text, lines and photo regions. We evaluate the performances of our approach by comparing it to the existing methods that participated in ICDAR page segmentation competition.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信