{"title":"A method for text-line segmentation for unconstrained Arabic and Persian handwritten text image","authors":"Reza Shakoori","doi":"10.1109/IRI.2014.7051909","DOIUrl":null,"url":null,"abstract":"One of the challenging parts of freestyle handwritten text documents recognition area is text line segmentation problem. Curvilinear text lines and small gaps between neighboring text lines present a challenge to algorithms developed for machine printed or hand-printed documents. In this paper, we propose a novel approach based on painting algorithm by dividing of a text image into number of vertical segments which is called striping. As Arabic and Persian scripts present a lot of dots, we considered historical available nastaliq scanned pages for experiments. Results show the proposed algorithm is robust to scale change, rotation, and noise. The proposed method may contribute significantly for the development of applications related to OCR.","PeriodicalId":360013,"journal":{"name":"Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2014.7051909","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
One of the challenging parts of freestyle handwritten text documents recognition area is text line segmentation problem. Curvilinear text lines and small gaps between neighboring text lines present a challenge to algorithms developed for machine printed or hand-printed documents. In this paper, we propose a novel approach based on painting algorithm by dividing of a text image into number of vertical segments which is called striping. As Arabic and Persian scripts present a lot of dots, we considered historical available nastaliq scanned pages for experiments. Results show the proposed algorithm is robust to scale change, rotation, and noise. The proposed method may contribute significantly for the development of applications related to OCR.