{"title":"Thai Character Segmentation in Handwriting Images using Four Directional Depth First Search","authors":"Kittikhun Thongkanchorn, Sarattha Kanchanapreechakorn, Punyanuch Borwarnginn, Worapan Kusakunniran","doi":"10.1109/ICITEED.2019.8929972","DOIUrl":null,"url":null,"abstract":"One of the key processes for converting handwriting images into digital texts is the character segmentation. It is very challenge especially for the case of segmenting the hand-writing due to intra-variations of various writing styles and overlapping of characters between consecutive characters. This paper works on Thai characters in handwriting images. Thai characters consist of different types of consonants, tones and vowels, which are written in different manners. This paper proposes the 4 directional depth first search based approach for segmenting individual characters in both vertical and horizontal cutting aspects. The vertical cut is applied to segment each text column, while the horizontal cut is applied to segment individual characters. Then, the erosion with two structuring elements is used to split overlapped consecutive characters that may be remained after the main segmentation process. The proposed method is validated with 11,949 Thai characters in handwriting images. It achieves up to 90.76 % of the successful segmentation.","PeriodicalId":6598,"journal":{"name":"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"107 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEED.2019.8929972","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
One of the key processes for converting handwriting images into digital texts is the character segmentation. It is very challenge especially for the case of segmenting the hand-writing due to intra-variations of various writing styles and overlapping of characters between consecutive characters. This paper works on Thai characters in handwriting images. Thai characters consist of different types of consonants, tones and vowels, which are written in different manners. This paper proposes the 4 directional depth first search based approach for segmenting individual characters in both vertical and horizontal cutting aspects. The vertical cut is applied to segment each text column, while the horizontal cut is applied to segment individual characters. Then, the erosion with two structuring elements is used to split overlapped consecutive characters that may be remained after the main segmentation process. The proposed method is validated with 11,949 Thai characters in handwriting images. It achieves up to 90.76 % of the successful segmentation.