{"title":"基于多边形逼近的无约束手写体印地语词分割","authors":"Kapil K. Upreti, Soumen Bag","doi":"10.1109/ICFHR.2016.0039","DOIUrl":null,"url":null,"abstract":"Segmentation of unconstrained handwritten words into characters in an optically scanned document image data is an essential task and presents challenges to researchers with a wide variety of handwritings, large varieties of pen-types, poor image quality, and a lack of ordering information of strokes. This paper contributes methods for accurate full segmentation of Hindi word images into constituent characters and modifiers. It follows the polygonal approximation approach for the segmentation, and makes use of structural properties along with directional measures to determine segmentation points in Hindi word images. The main methodological contribution of this paper is the use of polygonal approximation technique for word segmentation which is based on certain structural properties of Hindi language. Second focus of this work lies on the fact that segmentation is done without removal of shirorekha which eliminates the complexities present in earlier works. Experiments on real-world data show that our novel method is always competitive and results in more top performances than any of the other measures.","PeriodicalId":194844,"journal":{"name":"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Segmentation of Unconstrained Handwritten Hindi Words Using Polygonal Approximation\",\"authors\":\"Kapil K. Upreti, Soumen Bag\",\"doi\":\"10.1109/ICFHR.2016.0039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Segmentation of unconstrained handwritten words into characters in an optically scanned document image data is an essential task and presents challenges to researchers with a wide variety of handwritings, large varieties of pen-types, poor image quality, and a lack of ordering information of strokes. This paper contributes methods for accurate full segmentation of Hindi word images into constituent characters and modifiers. It follows the polygonal approximation approach for the segmentation, and makes use of structural properties along with directional measures to determine segmentation points in Hindi word images. The main methodological contribution of this paper is the use of polygonal approximation technique for word segmentation which is based on certain structural properties of Hindi language. Second focus of this work lies on the fact that segmentation is done without removal of shirorekha which eliminates the complexities present in earlier works. Experiments on real-world data show that our novel method is always competitive and results in more top performances than any of the other measures.\",\"PeriodicalId\":194844,\"journal\":{\"name\":\"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)\",\"volume\":\"101 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICFHR.2016.0039\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFHR.2016.0039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Segmentation of Unconstrained Handwritten Hindi Words Using Polygonal Approximation
Segmentation of unconstrained handwritten words into characters in an optically scanned document image data is an essential task and presents challenges to researchers with a wide variety of handwritings, large varieties of pen-types, poor image quality, and a lack of ordering information of strokes. This paper contributes methods for accurate full segmentation of Hindi word images into constituent characters and modifiers. It follows the polygonal approximation approach for the segmentation, and makes use of structural properties along with directional measures to determine segmentation points in Hindi word images. The main methodological contribution of this paper is the use of polygonal approximation technique for word segmentation which is based on certain structural properties of Hindi language. Second focus of this work lies on the fact that segmentation is done without removal of shirorekha which eliminates the complexities present in earlier works. Experiments on real-world data show that our novel method is always competitive and results in more top performances than any of the other measures.