Thai Character Segmentation in Handwriting Images using Four Directional Depth First Search

Kittikhun Thongkanchorn, Sarattha Kanchanapreechakorn, Punyanuch Borwarnginn, Worapan Kusakunniran
{"title":"Thai Character Segmentation in Handwriting Images using Four Directional Depth First Search","authors":"Kittikhun Thongkanchorn, Sarattha Kanchanapreechakorn, Punyanuch Borwarnginn, Worapan Kusakunniran","doi":"10.1109/ICITEED.2019.8929972","DOIUrl":null,"url":null,"abstract":"One of the key processes for converting handwriting images into digital texts is the character segmentation. It is very challenge especially for the case of segmenting the hand-writing due to intra-variations of various writing styles and overlapping of characters between consecutive characters. This paper works on Thai characters in handwriting images. Thai characters consist of different types of consonants, tones and vowels, which are written in different manners. This paper proposes the 4 directional depth first search based approach for segmenting individual characters in both vertical and horizontal cutting aspects. The vertical cut is applied to segment each text column, while the horizontal cut is applied to segment individual characters. Then, the erosion with two structuring elements is used to split overlapped consecutive characters that may be remained after the main segmentation process. The proposed method is validated with 11,949 Thai characters in handwriting images. It achieves up to 90.76 % of the successful segmentation.","PeriodicalId":6598,"journal":{"name":"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"107 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEED.2019.8929972","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

One of the key processes for converting handwriting images into digital texts is the character segmentation. It is very challenge especially for the case of segmenting the hand-writing due to intra-variations of various writing styles and overlapping of characters between consecutive characters. This paper works on Thai characters in handwriting images. Thai characters consist of different types of consonants, tones and vowels, which are written in different manners. This paper proposes the 4 directional depth first search based approach for segmenting individual characters in both vertical and horizontal cutting aspects. The vertical cut is applied to segment each text column, while the horizontal cut is applied to segment individual characters. Then, the erosion with two structuring elements is used to split overlapped consecutive characters that may be remained after the main segmentation process. The proposed method is validated with 11,949 Thai characters in handwriting images. It achieves up to 90.76 % of the successful segmentation.
使用四方向深度优先搜索的手写图像中泰文字符分割
将手写图像转换为数字文本的关键过程之一是字符分割。由于各种书写风格的内部变化和连续字符之间的重叠,对笔迹的分割是非常具有挑战性的。本文研究的是手写体图像中的泰文。泰语由不同类型的辅音、声调和元音组成,它们以不同的方式书写。本文提出了基于4方向深度优先搜索的纵向和横向字符分割方法。垂直切割应用于分割每个文本列,而水平切割应用于分割单个字符。然后,使用两个结构元素的侵蚀对主分割过程后可能保留的重叠连续字符进行分割。用11,949个泰文手写图像对该方法进行了验证。分割成功率达90.76%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信