{"title":"利用聚类方法恢复缺失特征","authors":"H. T. Rassem, P. Girija","doi":"10.1109/SITIS.2010.30","DOIUrl":null,"url":null,"abstract":"The performance of the Automatic Speech Recognition (ASR) system reduces greatly when speech is corrupted by noise. In spectrogram representation of a speech signal, after deleting low SNR elements, incomplete spectrogram is obtained. In this case, the speech recognizer should make modifications to spectrogram to restore the missing elements, which is one direction. In another direction speech recognizer should be restoring the missing elements due to deleting low SNR elements before the recognition is performed, which can be done using the spectrogram reconstruction methods. In this paper, some spectrogram reconstruction methods suggested by some researchers are implemented as a toolbox using MATLAB and tested using Sphinx III software under different conditions such as different length of window and different length of utterances. These methods are called clustering statistical methods and tested with Sphinx III software developed by CMU, USA. Our speech corpus consists of 20 males and 20 females, each one has two different utterances.","PeriodicalId":128396,"journal":{"name":"2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Missing Features Restoration Using Clustering Methods\",\"authors\":\"H. T. Rassem, P. Girija\",\"doi\":\"10.1109/SITIS.2010.30\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The performance of the Automatic Speech Recognition (ASR) system reduces greatly when speech is corrupted by noise. In spectrogram representation of a speech signal, after deleting low SNR elements, incomplete spectrogram is obtained. In this case, the speech recognizer should make modifications to spectrogram to restore the missing elements, which is one direction. In another direction speech recognizer should be restoring the missing elements due to deleting low SNR elements before the recognition is performed, which can be done using the spectrogram reconstruction methods. In this paper, some spectrogram reconstruction methods suggested by some researchers are implemented as a toolbox using MATLAB and tested using Sphinx III software under different conditions such as different length of window and different length of utterances. These methods are called clustering statistical methods and tested with Sphinx III software developed by CMU, USA. Our speech corpus consists of 20 males and 20 females, each one has two different utterances.\",\"PeriodicalId\":128396,\"journal\":{\"name\":\"2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SITIS.2010.30\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITIS.2010.30","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Missing Features Restoration Using Clustering Methods
The performance of the Automatic Speech Recognition (ASR) system reduces greatly when speech is corrupted by noise. In spectrogram representation of a speech signal, after deleting low SNR elements, incomplete spectrogram is obtained. In this case, the speech recognizer should make modifications to spectrogram to restore the missing elements, which is one direction. In another direction speech recognizer should be restoring the missing elements due to deleting low SNR elements before the recognition is performed, which can be done using the spectrogram reconstruction methods. In this paper, some spectrogram reconstruction methods suggested by some researchers are implemented as a toolbox using MATLAB and tested using Sphinx III software under different conditions such as different length of window and different length of utterances. These methods are called clustering statistical methods and tested with Sphinx III software developed by CMU, USA. Our speech corpus consists of 20 males and 20 females, each one has two different utterances.