{"title":"语音信号特征波形的低复杂度分解","authors":"Guiping Wang, C. Bao","doi":"10.1109/CHINSL.2004.1409607","DOIUrl":null,"url":null,"abstract":"For efficient coding of speech, it is desirable to separate the slowly and rapidly evolving spectral components to take advantage of their different perceptual qualities. Existing decomposition methods are too inflexible to model transient changes in the speech signals, require high delay or produce a large parameter set that is not scalable to low rates. We present a low complexity decomposition method, based on SVD, applied to waveform interpolation (WI) coding. This scheme reduces the computational complexity of the common SVD method in WI by exploiting the properties of human auditory perception to lower the dimensions of the decomposition matrix. This method requires only a single frame of speech and overcomes the substantial delay problems. The quantization solution involves the use of vector quantization on the separately decomposed singular matrices, U, V, and the diagonal matrix of singular values, S. The quality of the reconstructed speech can be varied according to the scalable decomposition and the bit rate available.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Low complexity decomposition for the characteristic waveform of speech signal\",\"authors\":\"Guiping Wang, C. Bao\",\"doi\":\"10.1109/CHINSL.2004.1409607\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For efficient coding of speech, it is desirable to separate the slowly and rapidly evolving spectral components to take advantage of their different perceptual qualities. Existing decomposition methods are too inflexible to model transient changes in the speech signals, require high delay or produce a large parameter set that is not scalable to low rates. We present a low complexity decomposition method, based on SVD, applied to waveform interpolation (WI) coding. This scheme reduces the computational complexity of the common SVD method in WI by exploiting the properties of human auditory perception to lower the dimensions of the decomposition matrix. This method requires only a single frame of speech and overcomes the substantial delay problems. The quantization solution involves the use of vector quantization on the separately decomposed singular matrices, U, V, and the diagonal matrix of singular values, S. The quality of the reconstructed speech can be varied according to the scalable decomposition and the bit rate available.\",\"PeriodicalId\":212562,\"journal\":{\"name\":\"2004 International Symposium on Chinese Spoken Language Processing\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2004 International Symposium on Chinese Spoken Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CHINSL.2004.1409607\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2004 International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2004.1409607","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Low complexity decomposition for the characteristic waveform of speech signal
For efficient coding of speech, it is desirable to separate the slowly and rapidly evolving spectral components to take advantage of their different perceptual qualities. Existing decomposition methods are too inflexible to model transient changes in the speech signals, require high delay or produce a large parameter set that is not scalable to low rates. We present a low complexity decomposition method, based on SVD, applied to waveform interpolation (WI) coding. This scheme reduces the computational complexity of the common SVD method in WI by exploiting the properties of human auditory perception to lower the dimensions of the decomposition matrix. This method requires only a single frame of speech and overcomes the substantial delay problems. The quantization solution involves the use of vector quantization on the separately decomposed singular matrices, U, V, and the diagonal matrix of singular values, S. The quality of the reconstructed speech can be varied according to the scalable decomposition and the bit rate available.