粗糙集用于印刷体泰文字识别的实验结果

Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239) Pub Date : 2001-08-19 DOI:10.1109/TENCON.2001.949608

S. Mitatha, K. Dejharn, F. Chevasuvit, B. Chankuang, W. Kasemsiri

{"title":"粗糙集用于印刷体泰文字识别的实验结果","authors":"S. Mitatha, K. Dejharn, F. Chevasuvit, B. Chankuang, W. Kasemsiri","doi":"10.1109/TENCON.2001.949608","DOIUrl":null,"url":null,"abstract":"This paper proposes the experimental results of using rough sets for the recognition of printed Thai characters. We segment each character into 32 pieces sized 4/spl times/4 pixels, and then find the distribution of pixels that match a value of \"1\" (black dot) in each section. Following this, we use the resulting 32 values as the attributes for each given object. Afterwards, we create 3 sets of decision making rules from 3 different training sets and use those 3 sets of rules to classify each member of the unknown set. This set is composed of 42 Thai characters, excluding the two that are very rarely used, with 7 fonts and 7 sizes, for a total of 2058. The results are 46.20%, 63.15%, 73.12% for the first set of rules, the second set of rules and the third set of rules respectively. And the results when applying the set of rules to the unknowns related to each set of rules' training sets are 100% for all three set of rules.","PeriodicalId":358168,"journal":{"name":"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Experimental results of using rough sets for printed Thai characters recognition\",\"authors\":\"S. Mitatha, K. Dejharn, F. Chevasuvit, B. Chankuang, W. Kasemsiri\",\"doi\":\"10.1109/TENCON.2001.949608\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes the experimental results of using rough sets for the recognition of printed Thai characters. We segment each character into 32 pieces sized 4/spl times/4 pixels, and then find the distribution of pixels that match a value of \\\"1\\\" (black dot) in each section. Following this, we use the resulting 32 values as the attributes for each given object. Afterwards, we create 3 sets of decision making rules from 3 different training sets and use those 3 sets of rules to classify each member of the unknown set. This set is composed of 42 Thai characters, excluding the two that are very rarely used, with 7 fonts and 7 sizes, for a total of 2058. The results are 46.20%, 63.15%, 73.12% for the first set of rules, the second set of rules and the third set of rules respectively. And the results when applying the set of rules to the unknowns related to each set of rules' training sets are 100% for all three set of rules.\",\"PeriodicalId\":358168,\"journal\":{\"name\":\"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENCON.2001.949608\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.2001.949608","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

摘要

本文给出了用粗糙集识别印刷泰文字的实验结果。我们将每个字符分割成32个大小为4/spl乘以/4像素的部分，然后找到每个部分中匹配值为“1”(黑点)的像素分布。接下来，我们使用得到的32个值作为每个给定对象的属性。然后，我们从3个不同的训练集中创建3组决策规则，并使用这3组规则对未知集的每个成员进行分类。这套字符集由42个泰文字符组成，其中不包括两个很少使用的字符，共有7种字体，7种大小，共2058个。第一套规则、第二套规则和第三套规则的结果分别为46.20%、63.15%、73.12%。当将一组规则应用于与每组规则的训练集相关的未知数时，所有三组规则的结果都是100%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Experimental results of using rough sets for printed Thai characters recognition

This paper proposes the experimental results of using rough sets for the recognition of printed Thai characters. We segment each character into 32 pieces sized 4/spl times/4 pixels, and then find the distribution of pixels that match a value of "1" (black dot) in each section. Following this, we use the resulting 32 values as the attributes for each given object. Afterwards, we create 3 sets of decision making rules from 3 different training sets and use those 3 sets of rules to classify each member of the unknown set. This set is composed of 42 Thai characters, excluding the two that are very rarely used, with 7 fonts and 7 sizes, for a total of 2058. The results are 46.20%, 63.15%, 73.12% for the first set of rules, the second set of rules and the third set of rules respectively. And the results when applying the set of rules to the unknowns related to each set of rules' training sets are 100% for all three set of rules.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)

自引率

0.00%

发文量