{"title":"Detection and classification of voice pathology using feature selection","authors":"Malak Al Mojaly, Muhammad Ghulam, M. Alsulaiman","doi":"10.1109/AICCSA.2014.7073250","DOIUrl":null,"url":null,"abstract":"The aim of this study is to apply automatic speech recognition (ASR) mechanism to improve the amount of information extracted from the voice and to increase the accuracy of the system by using selective highly discriminative features among different types of acoustic features. For feature extraction, we applied three techniques which are Mel Frequency Cepstral Coefficient (MFCC), Linear Prediction Cepstral Coefficients (LPCC), and RelAtive SpecTrA - Perceptual Linear Predictive (RASTA-PLP) with a number of selected coefficients from each technique by using t-test, Kruskal-Wallis test, or genetic algorithm (GA). Then for classification, either support vector machine (SVM) or Gaussian Mixture Model (GMM) is used. The experimental results on a selected MEEI subset database show that the proposed method gives high accuracies compared with some recent related methods both in detection and classification tasks. The highest accuracy of 99.9875 % with a standard deviation of 0.0263 is achieved in case of detection, and 99.8578 % with a standard deviation of 0.1657 in case of multi-class pathology classification.","PeriodicalId":412749,"journal":{"name":"2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA.2014.7073250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
The aim of this study is to apply automatic speech recognition (ASR) mechanism to improve the amount of information extracted from the voice and to increase the accuracy of the system by using selective highly discriminative features among different types of acoustic features. For feature extraction, we applied three techniques which are Mel Frequency Cepstral Coefficient (MFCC), Linear Prediction Cepstral Coefficients (LPCC), and RelAtive SpecTrA - Perceptual Linear Predictive (RASTA-PLP) with a number of selected coefficients from each technique by using t-test, Kruskal-Wallis test, or genetic algorithm (GA). Then for classification, either support vector machine (SVM) or Gaussian Mixture Model (GMM) is used. The experimental results on a selected MEEI subset database show that the proposed method gives high accuracies compared with some recent related methods both in detection and classification tasks. The highest accuracy of 99.9875 % with a standard deviation of 0.0263 is achieved in case of detection, and 99.8578 % with a standard deviation of 0.1657 in case of multi-class pathology classification.