分区、直方图和结构方法对手写巽他语汉字分类的实验研究

Eki Nugraha, Alifia Chinka Rizal Muhammad, L. Riza, Haviluddin
{"title":"分区、直方图和结构方法对手写巽他语汉字分类的实验研究","authors":"Eki Nugraha, Alifia Chinka Rizal Muhammad, L. Riza, Haviluddin","doi":"10.1109/EIConCIT.2018.8878640","DOIUrl":null,"url":null,"abstract":"Sundanese characters are one of the original Sundanese historical relics that have existed since the 5th century and have become the writing language at that time. Classification of handwriting characters is a challenge because the results of handwriting are very diverse, including the characters of handwritten characters. The number of feature extraction methods that can be used in the classification process, but not all feature extraction methods are in accordance with the characteristics of the Sundanese characters. Therefore, the focus of this research is to find the optimal feature extraction method to classify the character of Sundanese characters, in order to get better accuracy by running some experiments. Feature extraction methods proposed in this research are zoning, histograms and structural approaches. Then, some following classifier methods are used for constructing models and prediction over new data: Random Forest (RF), K-Nearest Neighbor (KNN), Artificial Neural Network (ANN), and Support Vector Machine (SVM). Based on the experiments, we can state that RF provided the best results (i.e., 89.84% in average) while the optimal feature-constructing method is by using the structural approach.","PeriodicalId":424909,"journal":{"name":"2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT)","volume":"48 19","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Experimental Study on Zoning, Histogram, and Structural Methods to Classify Sundanese Characters from Handwriting\",\"authors\":\"Eki Nugraha, Alifia Chinka Rizal Muhammad, L. Riza, Haviluddin\",\"doi\":\"10.1109/EIConCIT.2018.8878640\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sundanese characters are one of the original Sundanese historical relics that have existed since the 5th century and have become the writing language at that time. Classification of handwriting characters is a challenge because the results of handwriting are very diverse, including the characters of handwritten characters. The number of feature extraction methods that can be used in the classification process, but not all feature extraction methods are in accordance with the characteristics of the Sundanese characters. Therefore, the focus of this research is to find the optimal feature extraction method to classify the character of Sundanese characters, in order to get better accuracy by running some experiments. Feature extraction methods proposed in this research are zoning, histograms and structural approaches. Then, some following classifier methods are used for constructing models and prediction over new data: Random Forest (RF), K-Nearest Neighbor (KNN), Artificial Neural Network (ANN), and Support Vector Machine (SVM). Based on the experiments, we can state that RF provided the best results (i.e., 89.84% in average) while the optimal feature-constructing method is by using the structural approach.\",\"PeriodicalId\":424909,\"journal\":{\"name\":\"2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT)\",\"volume\":\"48 19\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIConCIT.2018.8878640\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIConCIT.2018.8878640","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

Sundanese汉字是最早的Sundanese历史遗迹之一,自5世纪以来一直存在,并成为当时的书写语言。手写字符的分类是一个挑战,因为手写的结果非常多样化,包括手写字符的字符。分类过程中可以使用的特征提取方法的数量,但并不是所有的特征提取方法都符合巽他语字符的特征。因此,本研究的重点是寻找最优的特征提取方法来对巽他语字符进行分类,并通过一些实验来获得更好的准确率。本研究提出的特征提取方法有分区法、直方图法和结构法。然后,利用随机森林(Random Forest, RF)、k近邻(K-Nearest Neighbor, KNN)、人工神经网络(Artificial Neural Network, ANN)和支持向量机(Support Vector Machine, SVM)等分类器方法对新数据进行建模和预测。通过实验,我们可以得出RF提供了最好的结果(平均为89.84%),而最优的特征构建方法是使用结构方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Experimental Study on Zoning, Histogram, and Structural Methods to Classify Sundanese Characters from Handwriting
Sundanese characters are one of the original Sundanese historical relics that have existed since the 5th century and have become the writing language at that time. Classification of handwriting characters is a challenge because the results of handwriting are very diverse, including the characters of handwritten characters. The number of feature extraction methods that can be used in the classification process, but not all feature extraction methods are in accordance with the characteristics of the Sundanese characters. Therefore, the focus of this research is to find the optimal feature extraction method to classify the character of Sundanese characters, in order to get better accuracy by running some experiments. Feature extraction methods proposed in this research are zoning, histograms and structural approaches. Then, some following classifier methods are used for constructing models and prediction over new data: Random Forest (RF), K-Nearest Neighbor (KNN), Artificial Neural Network (ANN), and Support Vector Machine (SVM). Based on the experiments, we can state that RF provided the best results (i.e., 89.84% in average) while the optimal feature-constructing method is by using the structural approach.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信