分区、直方图和结构方法对手写巽他语汉字分类的实验研究

2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT) Pub Date : 2018-11-01 DOI:10.1109/EIConCIT.2018.8878640

Eki Nugraha, Alifia Chinka Rizal Muhammad, L. Riza, Haviluddin

{"title":"分区、直方图和结构方法对手写巽他语汉字分类的实验研究","authors":"Eki Nugraha, Alifia Chinka Rizal Muhammad, L. Riza, Haviluddin","doi":"10.1109/EIConCIT.2018.8878640","DOIUrl":null,"url":null,"abstract":"Sundanese characters are one of the original Sundanese historical relics that have existed since the 5th century and have become the writing language at that time. Classification of handwriting characters is a challenge because the results of handwriting are very diverse, including the characters of handwritten characters. The number of feature extraction methods that can be used in the classification process, but not all feature extraction methods are in accordance with the characteristics of the Sundanese characters. Therefore, the focus of this research is to find the optimal feature extraction method to classify the character of Sundanese characters, in order to get better accuracy by running some experiments. Feature extraction methods proposed in this research are zoning, histograms and structural approaches. Then, some following classifier methods are used for constructing models and prediction over new data: Random Forest (RF), K-Nearest Neighbor (KNN), Artificial Neural Network (ANN), and Support Vector Machine (SVM). Based on the experiments, we can state that RF provided the best results (i.e., 89.84% in average) while the optimal feature-constructing method is by using the structural approach.","PeriodicalId":424909,"journal":{"name":"2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT)","volume":"48 19","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Experimental Study on Zoning, Histogram, and Structural Methods to Classify Sundanese Characters from Handwriting\",\"authors\":\"Eki Nugraha, Alifia Chinka Rizal Muhammad, L. Riza, Haviluddin\",\"doi\":\"10.1109/EIConCIT.2018.8878640\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sundanese characters are one of the original Sundanese historical relics that have existed since the 5th century and have become the writing language at that time. Classification of handwriting characters is a challenge because the results of handwriting are very diverse, including the characters of handwritten characters. The number of feature extraction methods that can be used in the classification process, but not all feature extraction methods are in accordance with the characteristics of the Sundanese characters. Therefore, the focus of this research is to find the optimal feature extraction method to classify the character of Sundanese characters, in order to get better accuracy by running some experiments. Feature extraction methods proposed in this research are zoning, histograms and structural approaches. Then, some following classifier methods are used for constructing models and prediction over new data: Random Forest (RF), K-Nearest Neighbor (KNN), Artificial Neural Network (ANN), and Support Vector Machine (SVM). Based on the experiments, we can state that RF provided the best results (i.e., 89.84% in average) while the optimal feature-constructing method is by using the structural approach.\",\"PeriodicalId\":424909,\"journal\":{\"name\":\"2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT)\",\"volume\":\"48 19\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIConCIT.2018.8878640\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIConCIT.2018.8878640","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

Sundanese汉字是最早的Sundanese历史遗迹之一，自5世纪以来一直存在，并成为当时的书写语言。手写字符的分类是一个挑战，因为手写的结果非常多样化，包括手写字符的字符。分类过程中可以使用的特征提取方法的数量，但并不是所有的特征提取方法都符合巽他语字符的特征。因此，本研究的重点是寻找最优的特征提取方法来对巽他语字符进行分类，并通过一些实验来获得更好的准确率。本研究提出的特征提取方法有分区法、直方图法和结构法。然后，利用随机森林(Random Forest, RF)、k近邻(K-Nearest Neighbor, KNN)、人工神经网络(Artificial Neural Network, ANN)和支持向量机(Support Vector Machine, SVM)等分类器方法对新数据进行建模和预测。通过实验，我们可以得出RF提供了最好的结果(平均为89.84%)，而最优的特征构建方法是使用结构方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Experimental Study on Zoning, Histogram, and Structural Methods to Classify Sundanese Characters from Handwriting

Sundanese characters are one of the original Sundanese historical relics that have existed since the 5th century and have become the writing language at that time. Classification of handwriting characters is a challenge because the results of handwriting are very diverse, including the characters of handwritten characters. The number of feature extraction methods that can be used in the classification process, but not all feature extraction methods are in accordance with the characteristics of the Sundanese characters. Therefore, the focus of this research is to find the optimal feature extraction method to classify the character of Sundanese characters, in order to get better accuracy by running some experiments. Feature extraction methods proposed in this research are zoning, histograms and structural approaches. Then, some following classifier methods are used for constructing models and prediction over new data: Random Forest (RF), K-Nearest Neighbor (KNN), Artificial Neural Network (ANN), and Support Vector Machine (SVM). Based on the experiments, we can state that RF provided the best results (i.e., 89.84% in average) while the optimal feature-constructing method is by using the structural approach.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 2nd East Indonesia Conference on Computer and Information Technology (EIConCIT)

自引率

0.00%

发文量