Convolutional Neural Network Method for Classification of Syllables in Javanese Script

International Journal of Artificial Intelligence & Robotics (IJAIR) Pub Date : 2021-11-30 DOI:10.25139/ijair.v3i2.4395

Yulianti Fauziah, Kevin Aprilianta, H. Rustamaji

{"title":"Convolutional Neural Network Method for Classification of Syllables in Javanese Script","authors":"Yulianti Fauziah, Kevin Aprilianta, H. Rustamaji","doi":"10.25139/ijair.v3i2.4395","DOIUrl":null,"url":null,"abstract":"Javanese script is one of the languages which are a typical Javanese culture. Javanese script is seen in its use in writing the name of a particular agency or location that has historical and tourism value. The use of Javanese script in public places makes the existence of this script seen by many people, not only by the Javanese people. Some of them have difficulty recognizing the Javanese characters they encounter. One method of pattern recognition and image processing is Convolutional Neural Network (CNN). CNN is a method that uses convolution operations in performing feature extraction on images as a basis for classification. The process consists of initial data processing, classification, and syllable formation. The classification consists of 48 classes covering Javanese script types, namely basic letters (Carakan) and voice-modifying scripts (Sandhangan). It is tested with multi-class confusion matrix scenarios to determine the accuracy, precision, and recall of the built CNN model. The CNN architecture consists of three convolution layers with max-pooling operations. The training configuration includes a learning rate of 0.0001, and the number of filters for each convolution layer is 32, 64, and 128 filters. The dropout value used is 0.5, and the number of neurons in the fully-connected layer is 1,024 neurons. The average performance value of accuracy reached 87.65%, the average precision value was 88.01%, and the average recall value was 87.70%.","PeriodicalId":208192,"journal":{"name":"International Journal of Artificial Intelligence & Robotics (IJAIR)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Artificial Intelligence & Robotics (IJAIR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25139/ijair.v3i2.4395","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Javanese script is one of the languages which are a typical Javanese culture. Javanese script is seen in its use in writing the name of a particular agency or location that has historical and tourism value. The use of Javanese script in public places makes the existence of this script seen by many people, not only by the Javanese people. Some of them have difficulty recognizing the Javanese characters they encounter. One method of pattern recognition and image processing is Convolutional Neural Network (CNN). CNN is a method that uses convolution operations in performing feature extraction on images as a basis for classification. The process consists of initial data processing, classification, and syllable formation. The classification consists of 48 classes covering Javanese script types, namely basic letters (Carakan) and voice-modifying scripts (Sandhangan). It is tested with multi-class confusion matrix scenarios to determine the accuracy, precision, and recall of the built CNN model. The CNN architecture consists of three convolution layers with max-pooling operations. The training configuration includes a learning rate of 0.0001, and the number of filters for each convolution layer is 32, 64, and 128 filters. The dropout value used is 0.5, and the number of neurons in the fully-connected layer is 1,024 neurons. The average performance value of accuracy reached 87.65%, the average precision value was 88.01%, and the average recall value was 87.70%.

查看原文本刊更多论文

爪哇文字音节分类的卷积神经网络方法

爪哇文字是爪哇文化的典型语言之一。爪哇文字用于书写具有历史和旅游价值的特定机构或地点的名称。在公共场所使用爪哇文字，使得这种文字的存在被很多人看到，而不仅仅是爪哇人。他们中的一些人很难识别他们遇到的爪哇文字。卷积神经网络(CNN)是模式识别和图像处理的一种方法。CNN是一种利用卷积运算对图像进行特征提取作为分类基础的方法。该过程包括初始数据处理、分类和音节形成。该分类包括48类爪哇文字类型，即基本字母(Carakan)和语音修改脚本(Sandhangan)。用多类混淆矩阵场景对其进行测试，以确定所建CNN模型的准确性、精密度和召回率。CNN架构由三个具有最大池化操作的卷积层组成。训练配置包括学习率为0.0001，每个卷积层的过滤器数量分别为32、64和128个过滤器。使用的dropout值为0.5，全连接层的神经元数为1024个神经元。准确率的平均性能值达到87.65%，准确率的平均性能值达到88.01%，召回率的平均性能值达到87.70%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Artificial Intelligence & Robotics (IJAIR)

自引率

0.00%

发文量