{"title":"Index assignment for predictive wideband LSF quantization","authors":"V.T. Ruoppila, S. Ragot","doi":"10.1109/SCFT.2000.878415","DOIUrl":"https://doi.org/10.1109/SCFT.2000.878415","url":null,"abstract":"In this paper we summarize some results derived earlier for the mean-square channel distortion of an autoregressive moving average (ARMA) vector quantizer with a maximum entropy encoder when the channel is assumed binary symmetric and memoryless. We discuss the required assumptions and their practical consequences in index assignment of ARMA vector quantizers. The discussion relates also to channel optimization of these quantizers. Furthermore, we compare noisy channel performance of memoryless, moving average, and autoregressive two-stage vector quantizers in line spectrum frequency quantization applied to wideband speech coding.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128517926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Regularized linear prediction all-pole models","authors":"M. Murthi, W. Kleijn","doi":"10.1109/SCFT.2000.878410","DOIUrl":"https://doi.org/10.1109/SCFT.2000.878410","url":null,"abstract":"For many cases of voiced speech, linear prediction (LP) based all-pole spectral envelopes exhibit unnatural vocal tract transfer functions that underestimate the formant bandwidths. To obtain smoother contoured all-pole spectral envelopes, we employ a regularization measure which discourages nonsmooth behavior of the transfer function. In particular, we demonstrate how a simple regularization scheme can be incorporated into the LP framework without the need for iterative numerical optimization or spectral sampling. Our results indicate that regularized LP all-pole models can provide more accurate vocal tract transfer function modeling than conventional LP, particularly at the formants.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127092456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Trellis-based optimization of MPEG-4 advanced audio coding","authors":"A. Aggarwal, S. Regunathan, K. Rose","doi":"10.1109/SCFT.2000.878430","DOIUrl":"https://doi.org/10.1109/SCFT.2000.878430","url":null,"abstract":"We outline a method to perform efficient low rate quantization for MPEG-4 advanced audio coding (AAC). The AAC bit stream consists of indices for quantized spectral coefficients as well as side information about quantizer step sizes and Huffman codebooks. The MPEG-4 Verification Model does not explicitly account for side information bits in its optimization and suffers from poor compression efficiency at low bit rates. We reformulate the encoding problem as one of optimal parameter selection, where the side information bits are taken into account, so as to minimize the noise to mask ratio for the given target bit rate. The optimal solution is determined by a dynamic programming procedure that efficiently searches through a trellis. This trellis-based optimization greatly improves the low bit rate performance of AAC and, consequently, the performance of a multi-layer AAC system. The resulting bit stream is standard-compatible, and additional complexity due to the proposed optimization is only incurred at the encoder.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130396079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A pseudo-cepstrum based short-term postfilter","authors":"H. Kim, Hong-Goo Kang","doi":"10.1109/SCFT.2000.878412","DOIUrl":"https://doi.org/10.1109/SCFT.2000.878412","url":null,"abstract":"We propose an adaptive short-term postfilter for speech coders by incorporating the properties of the pseudo-cepstrum. Since the proposed postfilter implicitly has a characteristic of tilt compensation, it does not require an additional tilt compensation filter as conventional techniques. We derive a relationship between the parameters of the proposed postfilter based on a minimum phase distortion criterion, and show a simple tuning procedure for the parameters. It is also shown that the postfilter can be implemented with a lower order. By applying this postfilter to several international speech coding standards, we reduce the complexity of the speech coders while obtaining comparable performance to conventional approaches.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122867557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}