{"title":"普什图语文字的形状分析及OCR图像数据库的建立","authors":"Mehreen Wahab, Hassan Amin, F. Ahmed","doi":"10.1109/ICET.2009.5353160","DOIUrl":null,"url":null,"abstract":"Development of optical character recognition for the cursive script such as Pashto requires detailed knowledge of shape variation within Pashto script. The development of image dataset is essential for training/testing of various OCR approaches. This paper outlines various features of Pashto script, and describes the development of an image dataset for an optical character recognition system.","PeriodicalId":307661,"journal":{"name":"2009 International Conference on Emerging Technologies","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Shape analysis of Pashto script and creation of image database for OCR\",\"authors\":\"Mehreen Wahab, Hassan Amin, F. Ahmed\",\"doi\":\"10.1109/ICET.2009.5353160\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Development of optical character recognition for the cursive script such as Pashto requires detailed knowledge of shape variation within Pashto script. The development of image dataset is essential for training/testing of various OCR approaches. This paper outlines various features of Pashto script, and describes the development of an image dataset for an optical character recognition system.\",\"PeriodicalId\":307661,\"journal\":{\"name\":\"2009 International Conference on Emerging Technologies\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Emerging Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICET.2009.5353160\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Emerging Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICET.2009.5353160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Shape analysis of Pashto script and creation of image database for OCR
Development of optical character recognition for the cursive script such as Pashto requires detailed knowledge of shape variation within Pashto script. The development of image dataset is essential for training/testing of various OCR approaches. This paper outlines various features of Pashto script, and describes the development of an image dataset for an optical character recognition system.