Degraded Script Identification of Urdu and Devanagari Document-A Survey

2019 4th International Conference on Information Systems and Computer Networks (ISCON) Pub Date : 2019-11-01 DOI:10.1109/ISCON47742.2019.9036305

S. Habib, M. Shukla, Rajiv Kapoor

引用次数: 0

Abstract

Script identification especially for non-Latin script have gained the attention of researchers from both the academics and industry. There are lots of challenges associated with this since most of the existing research focuses on Latin scripts. Most of the researches in this field are working only with the latest or modern documents and font types. Historical and Degraded documents have not been given much importance in the OCR research. This paper provides the different stages required for the identification of the scripts. A brief overview of the different techniques for identifying and classifying the characters in Devanagari and Urdu Script. This has been performed especially for degraded and historical texts. The paper has been concluded with a strong future scope.

查看原文本刊更多论文

乌尔都语和德文语文献的退化文字鉴定综述

文字识别，尤其是非拉丁文字的识别，已经引起了学术界和工业界的广泛关注。由于大多数现有的研究都集中在拉丁文字上，因此与此相关的挑战很多。该领域的大多数研究都只使用最新或现代的文档和字体类型。历史文献和退化文献在OCR研究中一直没有得到重视。本文提供了识别脚本所需的不同阶段。德文加里语和乌尔都语文字的不同识别和分类技术概述。这已经被执行，特别是对退化和历史文本。这篇论文的结论具有很强的未来前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 4th International Conference on Information Systems and Computer Networks (ISCON)

自引率

0.00%

发文量