Convolutional Neural Network Method for Classification of Syllables in Javanese Script

Yulianti Fauziah, Kevin Aprilianta, H. Rustamaji
{"title":"Convolutional Neural Network Method for Classification of Syllables in Javanese Script","authors":"Yulianti Fauziah, Kevin Aprilianta, H. Rustamaji","doi":"10.25139/ijair.v3i2.4395","DOIUrl":null,"url":null,"abstract":"Javanese script is one of the languages which are a typical Javanese culture. Javanese script is seen in its use in writing the name of a particular agency or location that has historical and tourism value. The use of Javanese script in public places makes the existence of this script seen by many people, not only by the Javanese people. Some of them have difficulty recognizing the Javanese characters they encounter. One method of pattern recognition and image processing is Convolutional Neural Network (CNN). CNN is a method that uses convolution operations in performing feature extraction on images as a basis for classification. The process consists of initial data processing, classification, and syllable formation. The classification consists of 48 classes covering Javanese script types, namely basic letters (Carakan) and voice-modifying scripts (Sandhangan). It is tested with multi-class confusion matrix scenarios to determine the accuracy, precision, and recall of the built CNN model. The CNN architecture consists of three convolution layers with max-pooling operations. The training configuration includes a learning rate of 0.0001, and the number of filters for each convolution layer is 32, 64, and 128 filters. The dropout value used is 0.5, and the number of neurons in the fully-connected layer is 1,024 neurons. The average performance value of accuracy reached 87.65%, the average precision value was 88.01%, and the average recall value was 87.70%.","PeriodicalId":208192,"journal":{"name":"International Journal of Artificial Intelligence & Robotics (IJAIR)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Artificial Intelligence & Robotics (IJAIR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25139/ijair.v3i2.4395","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Javanese script is one of the languages which are a typical Javanese culture. Javanese script is seen in its use in writing the name of a particular agency or location that has historical and tourism value. The use of Javanese script in public places makes the existence of this script seen by many people, not only by the Javanese people. Some of them have difficulty recognizing the Javanese characters they encounter. One method of pattern recognition and image processing is Convolutional Neural Network (CNN). CNN is a method that uses convolution operations in performing feature extraction on images as a basis for classification. The process consists of initial data processing, classification, and syllable formation. The classification consists of 48 classes covering Javanese script types, namely basic letters (Carakan) and voice-modifying scripts (Sandhangan). It is tested with multi-class confusion matrix scenarios to determine the accuracy, precision, and recall of the built CNN model. The CNN architecture consists of three convolution layers with max-pooling operations. The training configuration includes a learning rate of 0.0001, and the number of filters for each convolution layer is 32, 64, and 128 filters. The dropout value used is 0.5, and the number of neurons in the fully-connected layer is 1,024 neurons. The average performance value of accuracy reached 87.65%, the average precision value was 88.01%, and the average recall value was 87.70%.
爪哇文字音节分类的卷积神经网络方法
爪哇文字是爪哇文化的典型语言之一。爪哇文字用于书写具有历史和旅游价值的特定机构或地点的名称。在公共场所使用爪哇文字,使得这种文字的存在被很多人看到,而不仅仅是爪哇人。他们中的一些人很难识别他们遇到的爪哇文字。卷积神经网络(CNN)是模式识别和图像处理的一种方法。CNN是一种利用卷积运算对图像进行特征提取作为分类基础的方法。该过程包括初始数据处理、分类和音节形成。该分类包括48类爪哇文字类型,即基本字母(Carakan)和语音修改脚本(Sandhangan)。用多类混淆矩阵场景对其进行测试,以确定所建CNN模型的准确性、精密度和召回率。CNN架构由三个具有最大池化操作的卷积层组成。训练配置包括学习率为0.0001,每个卷积层的过滤器数量分别为32、64和128个过滤器。使用的dropout值为0.5,全连接层的神经元数为1024个神经元。准确率的平均性能值达到87.65%,准确率的平均性能值达到88.01%,召回率的平均性能值达到87.70%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信