LABA: Logical Layout Analysis of Book Page Images in Arabic Using Multiple Support Vector Machines

Wenda Qin, Randa I. Elanwar, Margrit Betke
{"title":"LABA: Logical Layout Analysis of Book Page Images in Arabic Using Multiple Support Vector Machines","authors":"Wenda Qin, Randa I. Elanwar, Margrit Betke","doi":"10.1109/ASAR.2018.8480095","DOIUrl":null,"url":null,"abstract":"Logical layout analysis, which determines the function of a document region, for example, whether it is a title, paragraph, or caption, is an indispensable part in a document understanding system. Rule-based algorithms have long been used for such systems. The datasets available have been small, and so the generalization of the performance of these systems is difficult to assess. In this paper, we present LABA, a supervised machine learning system based on multiple support vector machines for conducting a logical Layout Analysis of scanned pages of Books in Arabic. Our system labels the function (class) of a document(scanned book pages) region, based on its position on the page and other features. We evaluated LABA with the benchmark \"BCE-Arabic-v1\" dataset, which contains scanned pages of illustrated Arabic books. We obtained high recall and precision values, and found that the F-measure of LABA is higher for all classes except the \"noise\" class compared to a neural network method that was based on prior work.","PeriodicalId":165564,"journal":{"name":"2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASAR.2018.8480095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Logical layout analysis, which determines the function of a document region, for example, whether it is a title, paragraph, or caption, is an indispensable part in a document understanding system. Rule-based algorithms have long been used for such systems. The datasets available have been small, and so the generalization of the performance of these systems is difficult to assess. In this paper, we present LABA, a supervised machine learning system based on multiple support vector machines for conducting a logical Layout Analysis of scanned pages of Books in Arabic. Our system labels the function (class) of a document(scanned book pages) region, based on its position on the page and other features. We evaluated LABA with the benchmark "BCE-Arabic-v1" dataset, which contains scanned pages of illustrated Arabic books. We obtained high recall and precision values, and found that the F-measure of LABA is higher for all classes except the "noise" class compared to a neural network method that was based on prior work.
LABA:使用多支持向量机的阿拉伯文图书页面图像逻辑布局分析
逻辑布局分析是文档理解系统中不可缺少的一部分,它决定了文档区域的功能,例如它是标题、段落还是标题。基于规则的算法长期以来一直用于此类系统。可用的数据集很小,因此很难评估这些系统性能的泛化。在本文中,我们提出了LABA,一个基于多个支持向量机的监督机器学习系统,用于对阿拉伯语图书扫描页进行逻辑布局分析。我们的系统根据文档(扫描图书页面)区域在页面上的位置和其他特征来标记其功能(类)。我们使用基准“BCE-Arabic-v1”数据集评估LABA,该数据集包含阿拉伯语插图书籍的扫描页面。我们获得了较高的召回率和精度值,并发现与基于先前工作的神经网络方法相比,LABA的F-measure在除“噪声”类之外的所有类别中都更高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信