粗糙集用于印刷体泰文字识别的实验结果

S. Mitatha, K. Dejharn, F. Chevasuvit, B. Chankuang, W. Kasemsiri
{"title":"粗糙集用于印刷体泰文字识别的实验结果","authors":"S. Mitatha, K. Dejharn, F. Chevasuvit, B. Chankuang, W. Kasemsiri","doi":"10.1109/TENCON.2001.949608","DOIUrl":null,"url":null,"abstract":"This paper proposes the experimental results of using rough sets for the recognition of printed Thai characters. We segment each character into 32 pieces sized 4/spl times/4 pixels, and then find the distribution of pixels that match a value of \"1\" (black dot) in each section. Following this, we use the resulting 32 values as the attributes for each given object. Afterwards, we create 3 sets of decision making rules from 3 different training sets and use those 3 sets of rules to classify each member of the unknown set. This set is composed of 42 Thai characters, excluding the two that are very rarely used, with 7 fonts and 7 sizes, for a total of 2058. The results are 46.20%, 63.15%, 73.12% for the first set of rules, the second set of rules and the third set of rules respectively. And the results when applying the set of rules to the unknowns related to each set of rules' training sets are 100% for all three set of rules.","PeriodicalId":358168,"journal":{"name":"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Experimental results of using rough sets for printed Thai characters recognition\",\"authors\":\"S. Mitatha, K. Dejharn, F. Chevasuvit, B. Chankuang, W. Kasemsiri\",\"doi\":\"10.1109/TENCON.2001.949608\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes the experimental results of using rough sets for the recognition of printed Thai characters. We segment each character into 32 pieces sized 4/spl times/4 pixels, and then find the distribution of pixels that match a value of \\\"1\\\" (black dot) in each section. Following this, we use the resulting 32 values as the attributes for each given object. Afterwards, we create 3 sets of decision making rules from 3 different training sets and use those 3 sets of rules to classify each member of the unknown set. This set is composed of 42 Thai characters, excluding the two that are very rarely used, with 7 fonts and 7 sizes, for a total of 2058. The results are 46.20%, 63.15%, 73.12% for the first set of rules, the second set of rules and the third set of rules respectively. And the results when applying the set of rules to the unknowns related to each set of rules' training sets are 100% for all three set of rules.\",\"PeriodicalId\":358168,\"journal\":{\"name\":\"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENCON.2001.949608\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.2001.949608","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

摘要

本文给出了用粗糙集识别印刷泰文字的实验结果。我们将每个字符分割成32个大小为4/spl乘以/4像素的部分,然后找到每个部分中匹配值为“1”(黑点)的像素分布。接下来,我们使用得到的32个值作为每个给定对象的属性。然后,我们从3个不同的训练集中创建3组决策规则,并使用这3组规则对未知集的每个成员进行分类。这套字符集由42个泰文字符组成,其中不包括两个很少使用的字符,共有7种字体,7种大小,共2058个。第一套规则、第二套规则和第三套规则的结果分别为46.20%、63.15%、73.12%。当将一组规则应用于与每组规则的训练集相关的未知数时,所有三组规则的结果都是100%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Experimental results of using rough sets for printed Thai characters recognition
This paper proposes the experimental results of using rough sets for the recognition of printed Thai characters. We segment each character into 32 pieces sized 4/spl times/4 pixels, and then find the distribution of pixels that match a value of "1" (black dot) in each section. Following this, we use the resulting 32 values as the attributes for each given object. Afterwards, we create 3 sets of decision making rules from 3 different training sets and use those 3 sets of rules to classify each member of the unknown set. This set is composed of 42 Thai characters, excluding the two that are very rarely used, with 7 fonts and 7 sizes, for a total of 2058. The results are 46.20%, 63.15%, 73.12% for the first set of rules, the second set of rules and the third set of rules respectively. And the results when applying the set of rules to the unknowns related to each set of rules' training sets are 100% for all three set of rules.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信