2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)最新文献

筛选
英文 中文
Index assignment for predictive wideband LSF quantization 预测宽带LSF量化的指标分配
V.T. Ruoppila, S. Ragot
{"title":"Index assignment for predictive wideband LSF quantization","authors":"V.T. Ruoppila, S. Ragot","doi":"10.1109/SCFT.2000.878415","DOIUrl":"https://doi.org/10.1109/SCFT.2000.878415","url":null,"abstract":"In this paper we summarize some results derived earlier for the mean-square channel distortion of an autoregressive moving average (ARMA) vector quantizer with a maximum entropy encoder when the channel is assumed binary symmetric and memoryless. We discuss the required assumptions and their practical consequences in index assignment of ARMA vector quantizers. The discussion relates also to channel optimization of these quantizers. Furthermore, we compare noisy channel performance of memoryless, moving average, and autoregressive two-stage vector quantizers in line spectrum frequency quantization applied to wideband speech coding.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128517926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Regularized linear prediction all-pole models 正则化线性预测全极模型
M. Murthi, W. Kleijn
{"title":"Regularized linear prediction all-pole models","authors":"M. Murthi, W. Kleijn","doi":"10.1109/SCFT.2000.878410","DOIUrl":"https://doi.org/10.1109/SCFT.2000.878410","url":null,"abstract":"For many cases of voiced speech, linear prediction (LP) based all-pole spectral envelopes exhibit unnatural vocal tract transfer functions that underestimate the formant bandwidths. To obtain smoother contoured all-pole spectral envelopes, we employ a regularization measure which discourages nonsmooth behavior of the transfer function. In particular, we demonstrate how a simple regularization scheme can be incorporated into the LP framework without the need for iterative numerical optimization or spectral sampling. Our results indicate that regularized LP all-pole models can provide more accurate vocal tract transfer function modeling than conventional LP, particularly at the formants.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127092456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Trellis-based optimization of MPEG-4 advanced audio coding 基于栅格的MPEG-4高级音频编码优化
A. Aggarwal, S. Regunathan, K. Rose
{"title":"Trellis-based optimization of MPEG-4 advanced audio coding","authors":"A. Aggarwal, S. Regunathan, K. Rose","doi":"10.1109/SCFT.2000.878430","DOIUrl":"https://doi.org/10.1109/SCFT.2000.878430","url":null,"abstract":"We outline a method to perform efficient low rate quantization for MPEG-4 advanced audio coding (AAC). The AAC bit stream consists of indices for quantized spectral coefficients as well as side information about quantizer step sizes and Huffman codebooks. The MPEG-4 Verification Model does not explicitly account for side information bits in its optimization and suffers from poor compression efficiency at low bit rates. We reformulate the encoding problem as one of optimal parameter selection, where the side information bits are taken into account, so as to minimize the noise to mask ratio for the given target bit rate. The optimal solution is determined by a dynamic programming procedure that efficiently searches through a trellis. This trellis-based optimization greatly improves the low bit rate performance of AAC and, consequently, the performance of a multi-layer AAC system. The resulting bit stream is standard-compatible, and additional complexity due to the proposed optimization is only incurred at the encoder.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130396079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
A pseudo-cepstrum based short-term postfilter 基于伪倒谱的短期后滤波器
H. Kim, Hong-Goo Kang
{"title":"A pseudo-cepstrum based short-term postfilter","authors":"H. Kim, Hong-Goo Kang","doi":"10.1109/SCFT.2000.878412","DOIUrl":"https://doi.org/10.1109/SCFT.2000.878412","url":null,"abstract":"We propose an adaptive short-term postfilter for speech coders by incorporating the properties of the pseudo-cepstrum. Since the proposed postfilter implicitly has a characteristic of tilt compensation, it does not require an additional tilt compensation filter as conventional techniques. We derive a relationship between the parameters of the proposed postfilter based on a minimum phase distortion criterion, and show a simple tuning procedure for the parameters. It is also shown that the postfilter can be implemented with a lower order. By applying this postfilter to several international speech coding standards, we reduce the complexity of the speech coders while obtaining comparable performance to conventional approaches.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122867557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信