{"title":"基于顺序浮动前向选择和数据挖掘分类器的阿拉伯语发音错误检测特征选择","authors":"M. Maqsood","doi":"10.57041/pjs.v68i4.230","DOIUrl":null,"url":null,"abstract":"Feature selection process is used to reduce the feature vector length and identify thediscriminative features. Many acoustic-phonetic features including Mel-Frequency CepstralCoefficient (MFCC), Energy, Pitch, Zero-crossing, spectrum were tested individually for Arabicmispronunciation detection using three classifiers; Random Forest, Bayesian classifier, and BaggedSupport Vector Machine (SVM). The results for Bagged SVM were better than the other twoclassifiers. Top three individual features with highest accuracies were identified for each isolatedArabic consonant. To validate the results, a modified form of Sequential Floating Forward Selection(SFFS) process was used. Results showed that MFCC along with its first and second derivatives,energy, spectrum, and zero-crossing were the most suitable acoustic features for Arabicmispronunciation detection system. The proposed approach provided an average accuracy of 94.9%which was better than the previous best 92.95% for Arabic consonants.","PeriodicalId":19787,"journal":{"name":"Pakistan journal of science","volume":"51 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FEATURE SELECTION FOR ARABIC MISPRONUNCIATION DETECTION BASED ON SEQUENTIAL FLOATING FORWARD SELECTION AND DATA MINING CLASSIFIERS\",\"authors\":\"M. Maqsood\",\"doi\":\"10.57041/pjs.v68i4.230\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature selection process is used to reduce the feature vector length and identify thediscriminative features. Many acoustic-phonetic features including Mel-Frequency CepstralCoefficient (MFCC), Energy, Pitch, Zero-crossing, spectrum were tested individually for Arabicmispronunciation detection using three classifiers; Random Forest, Bayesian classifier, and BaggedSupport Vector Machine (SVM). The results for Bagged SVM were better than the other twoclassifiers. Top three individual features with highest accuracies were identified for each isolatedArabic consonant. To validate the results, a modified form of Sequential Floating Forward Selection(SFFS) process was used. Results showed that MFCC along with its first and second derivatives,energy, spectrum, and zero-crossing were the most suitable acoustic features for Arabicmispronunciation detection system. The proposed approach provided an average accuracy of 94.9%which was better than the previous best 92.95% for Arabic consonants.\",\"PeriodicalId\":19787,\"journal\":{\"name\":\"Pakistan journal of science\",\"volume\":\"51 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pakistan journal of science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.57041/pjs.v68i4.230\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pakistan journal of science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.57041/pjs.v68i4.230","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
FEATURE SELECTION FOR ARABIC MISPRONUNCIATION DETECTION BASED ON SEQUENTIAL FLOATING FORWARD SELECTION AND DATA MINING CLASSIFIERS
Feature selection process is used to reduce the feature vector length and identify thediscriminative features. Many acoustic-phonetic features including Mel-Frequency CepstralCoefficient (MFCC), Energy, Pitch, Zero-crossing, spectrum were tested individually for Arabicmispronunciation detection using three classifiers; Random Forest, Bayesian classifier, and BaggedSupport Vector Machine (SVM). The results for Bagged SVM were better than the other twoclassifiers. Top three individual features with highest accuracies were identified for each isolatedArabic consonant. To validate the results, a modified form of Sequential Floating Forward Selection(SFFS) process was used. Results showed that MFCC along with its first and second derivatives,energy, spectrum, and zero-crossing were the most suitable acoustic features for Arabicmispronunciation detection system. The proposed approach provided an average accuracy of 94.9%which was better than the previous best 92.95% for Arabic consonants.