Alphabet Classification of Sign System Using Convolutional Neural Network with Contrast Limited Adaptive Histogram Equalization and Canny Edge Detection

Ahmad Solikhin Gayuh Raharjo, E. Sugiharti
{"title":"Alphabet Classification of Sign System Using Convolutional Neural Network with Contrast Limited Adaptive Histogram Equalization and Canny Edge Detection","authors":"Ahmad Solikhin Gayuh Raharjo, E. Sugiharti","doi":"10.15294/sji.v10i3.44137","DOIUrl":null,"url":null,"abstract":"Purpose: There are deaf people who have problems in communicating orally because they do not have the ability to speak and hear. The sign system is used as a solution to this problem, but not everyone understands the use and meaning of the sign system, even in terms of the alphabet. Therefore, it is necessary to classify a sign system in the form of American Sign Language (ASL) using Artificial Intelligence technology to get good results.Methods: This research focuses on improving the accuracy of ASL alphabet classification using the VGG-19 and ResNet50 architecture of the Convolutional Neural Network (CNN) method combined with Contrast Limited Adaptive Histogram Equalization (CLAHE) to improve the detail quality of images and Canny Edge Detection to produce images that focus on the objects in it. The focused result is the accuracy value. This study uses the ASL alphabet dataset from Kaggle.Result: Based on the test results, there are three best accuracy results. The first is using the ResNet50 architecture, CLAHE, and an image size of 128 x 128 pixels with an accuracy of 99.9%, followed by the ResNet50 architecture, CLAHE + Canny Edge Detection, and an image size of 128 x 128 pixels with an accuracy of 99.82 %, and in third place are the VGG-19 architecture, CLAHE, and an image size of 128 x 128 pixels with an accuracy of 98.93%.Novelty: The novelty of this study is the increase in the accuracy value of ASL image classification from previous studies.","PeriodicalId":30781,"journal":{"name":"Scientific Journal of Informatics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Journal of Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15294/sji.v10i3.44137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: There are deaf people who have problems in communicating orally because they do not have the ability to speak and hear. The sign system is used as a solution to this problem, but not everyone understands the use and meaning of the sign system, even in terms of the alphabet. Therefore, it is necessary to classify a sign system in the form of American Sign Language (ASL) using Artificial Intelligence technology to get good results.Methods: This research focuses on improving the accuracy of ASL alphabet classification using the VGG-19 and ResNet50 architecture of the Convolutional Neural Network (CNN) method combined with Contrast Limited Adaptive Histogram Equalization (CLAHE) to improve the detail quality of images and Canny Edge Detection to produce images that focus on the objects in it. The focused result is the accuracy value. This study uses the ASL alphabet dataset from Kaggle.Result: Based on the test results, there are three best accuracy results. The first is using the ResNet50 architecture, CLAHE, and an image size of 128 x 128 pixels with an accuracy of 99.9%, followed by the ResNet50 architecture, CLAHE + Canny Edge Detection, and an image size of 128 x 128 pixels with an accuracy of 99.82 %, and in third place are the VGG-19 architecture, CLAHE, and an image size of 128 x 128 pixels with an accuracy of 98.93%.Novelty: The novelty of this study is the increase in the accuracy value of ASL image classification from previous studies.
基于对比度有限自适应直方图均衡化和Canny边缘检测的卷积神经网络符号系统字母表分类
目的:有些聋人在口头交流方面有问题,因为他们没有说话和听的能力。符号系统被用来解决这个问题,但并不是每个人都理解符号系统的用途和含义,即使是在字母表方面。因此,有必要利用人工智能技术对美国手语形式的手语系统进行分类,以获得良好的效果。方法:本研究的重点是提高ASL字母分类的准确性,使用卷积神经网络(CNN)方法的VGG-19和ResNet50架构,结合对比度有限自适应直方图均衡(CLAHE)来提高图像的细节质量,并使用Canny边缘检测来生成聚焦于其中对象的图像。所关注的结果是准确度值。本研究使用了Kaggle的ASL字母数据集。结果:根据测试结果,有三个准确度最好的结果。第一种是使用ResNet50架构CLAHE,其图像大小为128 x 128像素,准确率为99.9%,其次是ResNet50体系结构CLAHE+Canny边缘检测,其图像尺寸为128 x 28像素,准确度为99.82%,第三种是VGG-19架构CLAHE,图像大小为128 x 128像素,准确率为98.93%。新颖性:本研究的新颖性在于ASL图像分类的准确度值比以前的研究有所提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
13
审稿时长
24 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信