{"title":"Missing Features Restoration Using Clustering Methods","authors":"H. T. Rassem, P. Girija","doi":"10.1109/SITIS.2010.30","DOIUrl":null,"url":null,"abstract":"The performance of the Automatic Speech Recognition (ASR) system reduces greatly when speech is corrupted by noise. In spectrogram representation of a speech signal, after deleting low SNR elements, incomplete spectrogram is obtained. In this case, the speech recognizer should make modifications to spectrogram to restore the missing elements, which is one direction. In another direction speech recognizer should be restoring the missing elements due to deleting low SNR elements before the recognition is performed, which can be done using the spectrogram reconstruction methods. In this paper, some spectrogram reconstruction methods suggested by some researchers are implemented as a toolbox using MATLAB and tested using Sphinx III software under different conditions such as different length of window and different length of utterances. These methods are called clustering statistical methods and tested with Sphinx III software developed by CMU, USA. Our speech corpus consists of 20 males and 20 females, each one has two different utterances.","PeriodicalId":128396,"journal":{"name":"2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITIS.2010.30","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The performance of the Automatic Speech Recognition (ASR) system reduces greatly when speech is corrupted by noise. In spectrogram representation of a speech signal, after deleting low SNR elements, incomplete spectrogram is obtained. In this case, the speech recognizer should make modifications to spectrogram to restore the missing elements, which is one direction. In another direction speech recognizer should be restoring the missing elements due to deleting low SNR elements before the recognition is performed, which can be done using the spectrogram reconstruction methods. In this paper, some spectrogram reconstruction methods suggested by some researchers are implemented as a toolbox using MATLAB and tested using Sphinx III software under different conditions such as different length of window and different length of utterances. These methods are called clustering statistical methods and tested with Sphinx III software developed by CMU, USA. Our speech corpus consists of 20 males and 20 females, each one has two different utterances.