基于CMU SphinxTools的摩洛哥方言语音识别系统

2020 International Conference on Intelligent Systems and Computer Vision (ISCV) Pub Date : 2020-06-01 DOI:10.1109/ISCV49265.2020.9204250

Abderrahim Ezzine, H. Satori, Mohamed Hamidi, K. Satori

{"title":"基于CMU SphinxTools的摩洛哥方言语音识别系统","authors":"Abderrahim Ezzine, H. Satori, Mohamed Hamidi, K. Satori","doi":"10.1109/ISCV49265.2020.9204250","DOIUrl":null,"url":null,"abstract":"The main aim of an Automatic Speech Recognition system (ASR) is to produce a system that is able to simulate the human listener based on the learning approach and speech data of a studied language. In this paper, we describe the Darija Moroccan Dialect speech recognition system that is implemented to recognize the ten first Arabic digits spoken in Moroccan dialect (Darija) collected from 20 speakers including both males and females. This system is designed based on the CMU Sphinx tools through the ASR Hidden Markov Model method with small data and the Mel frequency spectral coefficients (MFCCs) that are used in the feature extraction phase. Our best-obtained accuracy is 96.27 % found with 8 GMMs.","PeriodicalId":313743,"journal":{"name":"2020 International Conference on Intelligent Systems and Computer Vision (ISCV)","volume":"319 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Moroccan Dialect Speech Recognition System Based on CMU SphinxTools\",\"authors\":\"Abderrahim Ezzine, H. Satori, Mohamed Hamidi, K. Satori\",\"doi\":\"10.1109/ISCV49265.2020.9204250\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The main aim of an Automatic Speech Recognition system (ASR) is to produce a system that is able to simulate the human listener based on the learning approach and speech data of a studied language. In this paper, we describe the Darija Moroccan Dialect speech recognition system that is implemented to recognize the ten first Arabic digits spoken in Moroccan dialect (Darija) collected from 20 speakers including both males and females. This system is designed based on the CMU Sphinx tools through the ASR Hidden Markov Model method with small data and the Mel frequency spectral coefficients (MFCCs) that are used in the feature extraction phase. Our best-obtained accuracy is 96.27 % found with 8 GMMs.\",\"PeriodicalId\":313743,\"journal\":{\"name\":\"2020 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"volume\":\"319 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCV49265.2020.9204250\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCV49265.2020.9204250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

自动语音识别系统(ASR)的主要目的是产生一个能够基于学习方法和所学习语言的语音数据模拟人类听者的系统。在本文中，我们描述了Darija摩洛哥方言语音识别系统，该系统用于识别从20名男女说话者中收集的摩洛哥方言(Darija)的前十位阿拉伯数字。本系统基于CMU Sphinx工具，通过小数据的ASR隐马尔可夫模型方法和特征提取阶段使用的Mel频谱系数(mfc)进行设计。我们获得的最佳准确度为96.27%，发现8个GMMs。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Moroccan Dialect Speech Recognition System Based on CMU SphinxTools

The main aim of an Automatic Speech Recognition system (ASR) is to produce a system that is able to simulate the human listener based on the learning approach and speech data of a studied language. In this paper, we describe the Darija Moroccan Dialect speech recognition system that is implemented to recognize the ten first Arabic digits spoken in Moroccan dialect (Darija) collected from 20 speakers including both males and females. This system is designed based on the CMU Sphinx tools through the ASR Hidden Markov Model method with small data and the Mel frequency spectral coefficients (MFCCs) that are used in the feature extraction phase. Our best-obtained accuracy is 96.27 % found with 8 GMMs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Conference on Intelligent Systems and Computer Vision (ISCV)

自引率

0.00%

发文量