阿拉伯语可分解词识别的词缀方法:多字体打印脚本的验证

Souhir Ben Chobba, Sirine Barkallah, S. Kanoun
{"title":"阿拉伯语可分解词识别的词缀方法:多字体打印脚本的验证","authors":"Souhir Ben Chobba, Sirine Barkallah, S. Kanoun","doi":"10.1109/ICTIA.2014.7883755","DOIUrl":null,"url":null,"abstract":"In this paper, we present a feasibility study and a validation of the affixal approach, dealing with Arabic decomposable word recognition, on multi-font and multi-size printed script, with the integration of several improvements. Our work details also a comparative study between this approach, based on linguistic concepts of vocabulary, and the different methods of lexical knowledge integration (dictionary of language and statistical model of language) used in the framework of the analytical approach. The obtained results with a dictionary of 159 661 words and a database of 2010 word images confirm the contribution of the affixal approach compared to the analytical approach on the level of the word hypotheses filtering which considerably improves the recognition rate.","PeriodicalId":390925,"journal":{"name":"2014 Information and Communication Technologies Innovation and Application (ICTIA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Affixal approach for Arabic decomposable word recognition: A validation on the multi-font printed script\",\"authors\":\"Souhir Ben Chobba, Sirine Barkallah, S. Kanoun\",\"doi\":\"10.1109/ICTIA.2014.7883755\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present a feasibility study and a validation of the affixal approach, dealing with Arabic decomposable word recognition, on multi-font and multi-size printed script, with the integration of several improvements. Our work details also a comparative study between this approach, based on linguistic concepts of vocabulary, and the different methods of lexical knowledge integration (dictionary of language and statistical model of language) used in the framework of the analytical approach. The obtained results with a dictionary of 159 661 words and a database of 2010 word images confirm the contribution of the affixal approach compared to the analytical approach on the level of the word hypotheses filtering which considerably improves the recognition rate.\",\"PeriodicalId\":390925,\"journal\":{\"name\":\"2014 Information and Communication Technologies Innovation and Application (ICTIA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 Information and Communication Technologies Innovation and Application (ICTIA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTIA.2014.7883755\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Information and Communication Technologies Innovation and Application (ICTIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTIA.2014.7883755","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在本文中,我们对词缀方法进行了可行性研究和验证,以处理多字体和多尺寸印刷文字的阿拉伯语可分解词识别,并整合了一些改进。我们的工作还详细介绍了这种基于词汇语言学概念的方法与分析方法框架中使用的不同词汇知识整合方法(语言词典和语言统计模型)之间的比较研究。在一个包含159 661个单词的词典和一个包含2010个单词图像的数据库中获得的结果证实了词缀方法在单词假设过滤层面上的贡献,与分析方法相比,词缀方法的贡献显著提高了识别率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Affixal approach for Arabic decomposable word recognition: A validation on the multi-font printed script
In this paper, we present a feasibility study and a validation of the affixal approach, dealing with Arabic decomposable word recognition, on multi-font and multi-size printed script, with the integration of several improvements. Our work details also a comparative study between this approach, based on linguistic concepts of vocabulary, and the different methods of lexical knowledge integration (dictionary of language and statistical model of language) used in the framework of the analytical approach. The obtained results with a dictionary of 159 661 words and a database of 2010 word images confirm the contribution of the affixal approach compared to the analytical approach on the level of the word hypotheses filtering which considerably improves the recognition rate.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信