{"title":"宽带语音编码的低复杂度LSF量化","authors":"S. Ragot, J. Adoul, R. Lefebvre, R. Salami","doi":"10.1109/SCFT.1999.781471","DOIUrl":null,"url":null,"abstract":"State-of-the-art narrowband speech coders operating from 4 to 16 kbit/s are mostly based on the code-excited linear predictive (CELP) model. They achieve a good synthesis quality usually at the expense of a high coding complexity. For example, in the 8 kbit/s G.729 coder the innovation codebook search is responsible for approximately half the total coder complexity, the latter being close to 20 MIPS in fixed-point DSP implementation. Less known is the relative part of spectral quantization, which is around 8% of the total complexity. CELP coders are still relevant for wideband speech coding but their complexity is greater than in the narrowband case, which becomes critical for real-time implementations. We propose in this article a two-stage algebraic-stochastic line spectral frequency (LSF) quantization scheme. It combines the strengths of algebraic and stochastic techniques, namely low computation and storage cost and good performance. The generalized Lloyd-Max algorithm is adapted for optimizing lattice codebooks obtained by spherical truncation. Simulations with a Gaussian source show that the quantization method exhibits good quality/complexity tradeoffs. Several stochastic-algebraic LSF quantizers are derived and compared to a more conventional technique.","PeriodicalId":372569,"journal":{"name":"1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)","volume":"232 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Low complexity LSF quantization for wideband speech coding\",\"authors\":\"S. Ragot, J. Adoul, R. Lefebvre, R. Salami\",\"doi\":\"10.1109/SCFT.1999.781471\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"State-of-the-art narrowband speech coders operating from 4 to 16 kbit/s are mostly based on the code-excited linear predictive (CELP) model. They achieve a good synthesis quality usually at the expense of a high coding complexity. For example, in the 8 kbit/s G.729 coder the innovation codebook search is responsible for approximately half the total coder complexity, the latter being close to 20 MIPS in fixed-point DSP implementation. Less known is the relative part of spectral quantization, which is around 8% of the total complexity. CELP coders are still relevant for wideband speech coding but their complexity is greater than in the narrowband case, which becomes critical for real-time implementations. We propose in this article a two-stage algebraic-stochastic line spectral frequency (LSF) quantization scheme. It combines the strengths of algebraic and stochastic techniques, namely low computation and storage cost and good performance. The generalized Lloyd-Max algorithm is adapted for optimizing lattice codebooks obtained by spherical truncation. Simulations with a Gaussian source show that the quantization method exhibits good quality/complexity tradeoffs. Several stochastic-algebraic LSF quantizers are derived and compared to a more conventional technique.\",\"PeriodicalId\":372569,\"journal\":{\"name\":\"1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)\",\"volume\":\"232 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1999-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCFT.1999.781471\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCFT.1999.781471","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Low complexity LSF quantization for wideband speech coding
State-of-the-art narrowband speech coders operating from 4 to 16 kbit/s are mostly based on the code-excited linear predictive (CELP) model. They achieve a good synthesis quality usually at the expense of a high coding complexity. For example, in the 8 kbit/s G.729 coder the innovation codebook search is responsible for approximately half the total coder complexity, the latter being close to 20 MIPS in fixed-point DSP implementation. Less known is the relative part of spectral quantization, which is around 8% of the total complexity. CELP coders are still relevant for wideband speech coding but their complexity is greater than in the narrowband case, which becomes critical for real-time implementations. We propose in this article a two-stage algebraic-stochastic line spectral frequency (LSF) quantization scheme. It combines the strengths of algebraic and stochastic techniques, namely low computation and storage cost and good performance. The generalized Lloyd-Max algorithm is adapted for optimizing lattice codebooks obtained by spherical truncation. Simulations with a Gaussian source show that the quantization method exhibits good quality/complexity tradeoffs. Several stochastic-algebraic LSF quantizers are derived and compared to a more conventional technique.