G. Muhammad, Khalid Almalki, Tamer A. Mesallam, M. Farahat, M. Alsulaiman
{"title":"Automatic Arabic digit speech recognition and formant analysis for voicing disordered people","authors":"G. Muhammad, Khalid Almalki, Tamer A. Mesallam, M. Farahat, M. Alsulaiman","doi":"10.1109/ISCI.2011.5959001","DOIUrl":null,"url":null,"abstract":"In this paper, analysis of speech from voice disordered people is performed from automatic speech recognition (ASR) point of view. Six different types of voicing disorder (pathological voice) are analyzed to show the difficulty of automatically recognizing their corresponding speech. As a case study, Arabic spoken digits are taken as input. The distribution of first four formants of vowel /a/ is extracted to show a significant deviation of formants from the normal speech to disordered speech. Experiment result reveals that current ASR technique is far from reliable performance in case of pathological speech, and thereby we need attention to this.","PeriodicalId":166647,"journal":{"name":"2011 IEEE Symposium on Computers & Informatics","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Symposium on Computers & Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCI.2011.5959001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16
Abstract
In this paper, analysis of speech from voice disordered people is performed from automatic speech recognition (ASR) point of view. Six different types of voicing disorder (pathological voice) are analyzed to show the difficulty of automatically recognizing their corresponding speech. As a case study, Arabic spoken digits are taken as input. The distribution of first four formants of vowel /a/ is extracted to show a significant deviation of formants from the normal speech to disordered speech. Experiment result reveals that current ASR technique is far from reliable performance in case of pathological speech, and thereby we need attention to this.