{"title":"Text independent writer identification of Arabic manuscripts and the effects of writers increase","authors":"S. Awaida","doi":"10.1109/ICCVIA.2015.7351881","DOIUrl":null,"url":null,"abstract":"This article addresses text-independent writer identification of Arabic manuscripts. Several types of statistical features are extracted from historical Arabic manuscripts. Gradient distribution features for Arabic handwritten text as well as windowed gradient distribution features, contour chain code distribution features, and windowed contour chain code distribution features are extracted. A nearest neighbor (NN) classifier is used with the Euclidean distance measure. Due to the lack of publicly available Arabic manuscript database, this work designed and collected a database of 10,000 Arabic manuscript images handwritten by 200 different historical scholars. Using 8,000 images for training and 2,000 images for testing, the proposed writer identification classifier achieved a top-1, top-5, and top-10 recognition rates of 93.95%, 98.30%, and 99.10%, respectively. The effects of increasing the number of writers on the accuracy results are presented and analyzed.","PeriodicalId":419122,"journal":{"name":"International Conference on Computer Vision and Image Analysis Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Computer Vision and Image Analysis Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCVIA.2015.7351881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This article addresses text-independent writer identification of Arabic manuscripts. Several types of statistical features are extracted from historical Arabic manuscripts. Gradient distribution features for Arabic handwritten text as well as windowed gradient distribution features, contour chain code distribution features, and windowed contour chain code distribution features are extracted. A nearest neighbor (NN) classifier is used with the Euclidean distance measure. Due to the lack of publicly available Arabic manuscript database, this work designed and collected a database of 10,000 Arabic manuscript images handwritten by 200 different historical scholars. Using 8,000 images for training and 2,000 images for testing, the proposed writer identification classifier achieved a top-1, top-5, and top-10 recognition rates of 93.95%, 98.30%, and 99.10%, respectively. The effects of increasing the number of writers on the accuracy results are presented and analyzed.