{"title":"A low delay speech coding system at 4.8 kb/s","authors":"Jian Zhang, Hong Shen Wang","doi":"10.1109/ICCS.1994.474279","DOIUrl":null,"url":null,"abstract":"Low-delay speech coding has drawn much attention because of its many potential applications. A lot of low delay speech coders have been proposed in the past few years. But almost all of them focus on the speech bit rate between 8 kb/s to 16 kb/s. The authors present a low delay speech coder operating at 4.8 kb/s which provides good speech quality with a coding delay only about 2.5 msec. This new low delay coder uses backward LPC analysis to remove the short-term correlation from speech and adopts a new approach to predict the long term prediction (LTP) coefficients based on a \"dual-decision\" scheme. At the excitation part, the authors exploit a new excitaton model which contains a sinusoid signal codebook and a stochastic codebook to excite the whole speech system. The synthesized speech of this coder has comparable quality to that of FS-1016 CELP coder while having a delay constraint more than an order of magnitude smaller than that of the FS-1016 coder.<<ETX>>","PeriodicalId":158681,"journal":{"name":"Proceedings of ICCS '94","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of ICCS '94","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCS.1994.474279","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Low-delay speech coding has drawn much attention because of its many potential applications. A lot of low delay speech coders have been proposed in the past few years. But almost all of them focus on the speech bit rate between 8 kb/s to 16 kb/s. The authors present a low delay speech coder operating at 4.8 kb/s which provides good speech quality with a coding delay only about 2.5 msec. This new low delay coder uses backward LPC analysis to remove the short-term correlation from speech and adopts a new approach to predict the long term prediction (LTP) coefficients based on a "dual-decision" scheme. At the excitation part, the authors exploit a new excitaton model which contains a sinusoid signal codebook and a stochastic codebook to excite the whole speech system. The synthesized speech of this coder has comparable quality to that of FS-1016 CELP coder while having a delay constraint more than an order of magnitude smaller than that of the FS-1016 coder.<>