Asroni, Mukhtar Hanafi, Cahya Damarjati, Priyangga Zulfajri, Dias Wirahastra Biwada, K. Ku-Mahamud
{"title":"Arabic pronunciation system based on padding pre-processing and deep learning techniques","authors":"Asroni, Mukhtar Hanafi, Cahya Damarjati, Priyangga Zulfajri, Dias Wirahastra Biwada, K. Ku-Mahamud","doi":"10.1063/5.0105052","DOIUrl":null,"url":null,"abstract":"Problems in learning due to the Covid-19 pandemic have occurred in several activities e.g. teaching in Taman Pendidikan Al-Qur'an (TPA). In carrying out its activities, TPA relies heavily on the teacher to make a learning pattern on how to pronounce the Arabic alphabet of 28 letters adequately. It requires a unique approach due to the various types of sound pronunciation in reading the Qur'an. In view of health protocol rules by the World Health Organization, face-to-face meetings are replaced with online sessions, thereby affecting the learning quality. To solve this problem, a system has been developed which consists of a deep learning model. Voice data were collected for TPA students and preprocessed before the voice data were used to develop the deep learning model. Four techniques were tested to identify the best technique for pre-processing the voice data. The techniques were Spectrogram, Padding, Mel-Spectrogram, and Mel-Frequency Cepstral Coefficient techniques. The pre-processing process is to prepare the training and testing data for the deep learning stage. Test questions were later appeared with an application designed with Tkinter to lay out the exam questions. Once the user pronounced a letter, the voice is recorded and pre-processed to be predicted with a deep learning model. The quality and similarity of pronunciation of the letter is validated an Arabic speech grading algorithm. Results showed that the used of Padding technique in the pre-processing stage provides the best classification accuracy for the Arabic letter pronunciation. © 2022 American Institute of Physics Inc.. All rights reserved.","PeriodicalId":298649,"journal":{"name":"THE 2ND INTERNATIONAL CONFERENCE OF SCIENCE AND INFORMATION TECHNOLOGY IN SMART ADMINISTRATION (ICSINTESA 2021)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"THE 2ND INTERNATIONAL CONFERENCE OF SCIENCE AND INFORMATION TECHNOLOGY IN SMART ADMINISTRATION (ICSINTESA 2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1063/5.0105052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

由于Covid-19大流行,在一些活动中出现了学习问题,例如在Taman Pendidikan al - quuran (TPA)的教学。在开展活动的过程中,TPA很大程度上依赖于教师制定一种学习模式,即如何充分地发音阿拉伯字母的28个字母。由于诵读《古兰经》时有不同类型的读音,因此需要一种独特的方法。鉴于世界卫生组织的卫生议定书规则,面对面会议被在线会议取代,从而影响了学习质量。为了解决这个问题,我们开发了一个由深度学习模型组成的系统。收集TPA学生的语音数据并进行预处理,然后将语音数据用于开发深度学习模型。对四种技术进行了测试,以确定语音数据预处理的最佳技术。技术包括谱图技术、填充技术、梅尔谱图技术和梅尔频率倒谱系数技术。预处理过程是为深度学习阶段准备训练和测试数据。后来出现了一个由Tkinter设计的应用程序来列出考试问题。一旦用户念出一个字母,语音就会被记录下来,并经过预处理,用深度学习模型进行预测。用阿拉伯语语音分级算法对字母的发音质量和相似度进行了验证。结果表明,在预处理阶段使用填充技术对阿拉伯字母发音的分类精度最高。©2022美国物理学会。版权所有。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Arabic pronunciation system based on padding pre-processing and deep learning techniques
Problems in learning due to the Covid-19 pandemic have occurred in several activities e.g. teaching in Taman Pendidikan Al-Qur'an (TPA). In carrying out its activities, TPA relies heavily on the teacher to make a learning pattern on how to pronounce the Arabic alphabet of 28 letters adequately. It requires a unique approach due to the various types of sound pronunciation in reading the Qur'an. In view of health protocol rules by the World Health Organization, face-to-face meetings are replaced with online sessions, thereby affecting the learning quality. To solve this problem, a system has been developed which consists of a deep learning model. Voice data were collected for TPA students and preprocessed before the voice data were used to develop the deep learning model. Four techniques were tested to identify the best technique for pre-processing the voice data. The techniques were Spectrogram, Padding, Mel-Spectrogram, and Mel-Frequency Cepstral Coefficient techniques. The pre-processing process is to prepare the training and testing data for the deep learning stage. Test questions were later appeared with an application designed with Tkinter to lay out the exam questions. Once the user pronounced a letter, the voice is recorded and pre-processed to be predicted with a deep learning model. The quality and similarity of pronunciation of the letter is validated an Arabic speech grading algorithm. Results showed that the used of Padding technique in the pre-processing stage provides the best classification accuracy for the Arabic letter pronunciation. © 2022 American Institute of Physics Inc.. All rights reserved.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信