{"title":"Degraded Script Identification of Urdu and Devanagari Document-A Survey","authors":"S. Habib, M. Shukla, Rajiv Kapoor","doi":"10.1109/ISCON47742.2019.9036305","DOIUrl":null,"url":null,"abstract":"Script identification especially for non-Latin script have gained the attention of researchers from both the academics and industry. There are lots of challenges associated with this since most of the existing research focuses on Latin scripts. Most of the researches in this field are working only with the latest or modern documents and font types. Historical and Degraded documents have not been given much importance in the OCR research. This paper provides the different stages required for the identification of the scripts. A brief overview of the different techniques for identifying and classifying the characters in Devanagari and Urdu Script. This has been performed especially for degraded and historical texts. The paper has been concluded with a strong future scope.","PeriodicalId":124412,"journal":{"name":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","volume":"253 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCON47742.2019.9036305","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Script identification especially for non-Latin script have gained the attention of researchers from both the academics and industry. There are lots of challenges associated with this since most of the existing research focuses on Latin scripts. Most of the researches in this field are working only with the latest or modern documents and font types. Historical and Degraded documents have not been given much importance in the OCR research. This paper provides the different stages required for the identification of the scripts. A brief overview of the different techniques for identifying and classifying the characters in Devanagari and Urdu Script. This has been performed especially for degraded and historical texts. The paper has been concluded with a strong future scope.