Min Gang, Guo Jian, Yang Jibin, Tan Wei, Chen Yanpu
{"title":"一个极低比特率的语音编码算法,大约300bps","authors":"Min Gang, Guo Jian, Yang Jibin, Tan Wei, Chen Yanpu","doi":"10.1109/WCSP.2009.5371427","DOIUrl":null,"url":null,"abstract":"An extreme low bit rate speech coding algorithm around 300bps is proposed in this paper. The algorithm builds mixed excitation segment coding model by taking advantage of the segment coder and the MELP coder. Variable dimension matrix quantization (VDMQ) and Variable dimension vector quantization (VDVQ) scheme are presented for quantizing LSP and excitation parameters. These quantization schemes achieve acceptable performance at very low bit rate. Also, the codebook storage is reduced dramatically. Informal subjective listening test shows that the reconstructed speech has high intelligibility and moderate naturalness, the PESQ score can achieve 2.02.","PeriodicalId":244652,"journal":{"name":"2009 International Conference on Wireless Communications & Signal Processing","volume":"2000 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An extreme low bit rate speech coding algorithm around 300bps\",\"authors\":\"Min Gang, Guo Jian, Yang Jibin, Tan Wei, Chen Yanpu\",\"doi\":\"10.1109/WCSP.2009.5371427\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An extreme low bit rate speech coding algorithm around 300bps is proposed in this paper. The algorithm builds mixed excitation segment coding model by taking advantage of the segment coder and the MELP coder. Variable dimension matrix quantization (VDMQ) and Variable dimension vector quantization (VDVQ) scheme are presented for quantizing LSP and excitation parameters. These quantization schemes achieve acceptable performance at very low bit rate. Also, the codebook storage is reduced dramatically. Informal subjective listening test shows that the reconstructed speech has high intelligibility and moderate naturalness, the PESQ score can achieve 2.02.\",\"PeriodicalId\":244652,\"journal\":{\"name\":\"2009 International Conference on Wireless Communications & Signal Processing\",\"volume\":\"2000 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Wireless Communications & Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WCSP.2009.5371427\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Wireless Communications & Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCSP.2009.5371427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An extreme low bit rate speech coding algorithm around 300bps
An extreme low bit rate speech coding algorithm around 300bps is proposed in this paper. The algorithm builds mixed excitation segment coding model by taking advantage of the segment coder and the MELP coder. Variable dimension matrix quantization (VDMQ) and Variable dimension vector quantization (VDVQ) scheme are presented for quantizing LSP and excitation parameters. These quantization schemes achieve acceptable performance at very low bit rate. Also, the codebook storage is reduced dramatically. Informal subjective listening test shows that the reconstructed speech has high intelligibility and moderate naturalness, the PESQ score can achieve 2.02.