Using projection and loop for segmentation of touching Thai typewritten

S. Watcharabutsarakham
{"title":"Using projection and loop for segmentation of touching Thai typewritten","authors":"S. Watcharabutsarakham","doi":"10.1109/ISCIT.2004.1412896","DOIUrl":null,"url":null,"abstract":"This paper proposes a segmentation technique for touching Thai typewritten characters. Thai characters vary in size and position when they are in a sentence. A Thai word is composed of consonants, vowels and tones. Touching characters can occur both in horizontal and vertical directions. The proposed technique uses structural characteristics to detect suitable segmentation points in both directions. The segmentation process consists of four steps. First the height and then the position of characters are used to identify character zones. Next, size and both horizontal and vertical projections are used to classify the types of touching. Lastly, touching characters are segmented using directions and positions identified by the previous steps. The edge of touching characters is used to identify the edge of two isolated characters. The proposed segmentation technique is tested with both electronic typewriters and manual portable typewriters. Segmentation accuracy of 95.4% has been obtained for two hundred sentences of typewritten thesis documents.","PeriodicalId":237047,"journal":{"name":"IEEE International Symposium on Communications and Information Technology, 2004. ISCIT 2004.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Symposium on Communications and Information Technology, 2004. ISCIT 2004.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCIT.2004.1412896","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

This paper proposes a segmentation technique for touching Thai typewritten characters. Thai characters vary in size and position when they are in a sentence. A Thai word is composed of consonants, vowels and tones. Touching characters can occur both in horizontal and vertical directions. The proposed technique uses structural characteristics to detect suitable segmentation points in both directions. The segmentation process consists of four steps. First the height and then the position of characters are used to identify character zones. Next, size and both horizontal and vertical projections are used to classify the types of touching. Lastly, touching characters are segmented using directions and positions identified by the previous steps. The edge of touching characters is used to identify the edge of two isolated characters. The proposed segmentation technique is tested with both electronic typewriters and manual portable typewriters. Segmentation accuracy of 95.4% has been obtained for two hundred sentences of typewritten thesis documents.
使用投影和循环分割触摸泰文打字
提出了一种触摸泰文打字字符的切分技术。在句子中,泰语字符的大小和位置各不相同。泰语是由辅音、元音和声调组成的。触摸字符可以在水平和垂直方向上发生。该技术利用结构特征在两个方向上检测合适的分割点。分割过程包括四个步骤。首先使用字符的高度,然后使用字符的位置来识别字符区域。接下来,大小和水平和垂直投影被用来对触摸类型进行分类。最后,使用前面步骤确定的方向和位置对触摸字符进行分割。用接触字符的边缘来识别两个孤立字符的边缘。在电子打字机和手动便携式打字机上对所提出的分割技术进行了测试。对200句打字论文的分词准确率达到95.4%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信