带线性预测的多模多频带激励声码器

2010 IEEE International Conference on Wireless Communications, Networking and Information Security Pub Date : 2010-06-25 DOI:10.1109/WCINS.2010.5541932

Yanxia Liang, Jiawei Yang, Ye Li

{"title":"带线性预测的多模多频带激励声码器","authors":"Yanxia Liang, Jiawei Yang, Ye Li","doi":"10.1109/WCINS.2010.5541932","DOIUrl":null,"url":null,"abstract":"This paper presents a Multimode Multi-Band Excitation with Linear Prediction model (MMBE-LP) at 2.35kbps. Unvoiced/Voiced (U/V) decisions and spectrum amplitudes estimations are improved in this vocoder compared with MBE vocoder. For better quantization results, different codebooks are used for different modes of U/V decision, so the number of sub-bands in a frame is fixed. Spectral amplitudes are estimated by Linear Prediction Mode and quantized by MSVQ (Multi-Stage Vector Quantization). Simulation results show that the unvoiced and voiced parts of synthetic speech are coherent with the parts of original speech obviously.","PeriodicalId":156036,"journal":{"name":"2010 IEEE International Conference on Wireless Communications, Networking and Information Security","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Multimode Multi-Band Excitation with Linear Prediction vocoder\",\"authors\":\"Yanxia Liang, Jiawei Yang, Ye Li\",\"doi\":\"10.1109/WCINS.2010.5541932\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a Multimode Multi-Band Excitation with Linear Prediction model (MMBE-LP) at 2.35kbps. Unvoiced/Voiced (U/V) decisions and spectrum amplitudes estimations are improved in this vocoder compared with MBE vocoder. For better quantization results, different codebooks are used for different modes of U/V decision, so the number of sub-bands in a frame is fixed. Spectral amplitudes are estimated by Linear Prediction Mode and quantized by MSVQ (Multi-Stage Vector Quantization). Simulation results show that the unvoiced and voiced parts of synthetic speech are coherent with the parts of original speech obviously.\",\"PeriodicalId\":156036,\"journal\":{\"name\":\"2010 IEEE International Conference on Wireless Communications, Networking and Information Security\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE International Conference on Wireless Communications, Networking and Information Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WCINS.2010.5541932\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Wireless Communications, Networking and Information Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCINS.2010.5541932","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

提出了一种2.35kbps速率的多模多频带线性预测激励模型(MMBE-LP)。与MBE声码器相比，该声码器改进了清/清(U/V)判断和频谱幅度估计。为了获得更好的量化结果，不同的码本用于不同的U/V判定模式，因此一帧中的子带数量是固定的。采用线性预测方法估计谱幅值，采用多阶段矢量量化方法量化谱幅值。仿真结果表明，合成语音的不浊音部分和浊音部分与原始语音有明显的一致性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Multimode Multi-Band Excitation with Linear Prediction vocoder

This paper presents a Multimode Multi-Band Excitation with Linear Prediction model (MMBE-LP) at 2.35kbps. Unvoiced/Voiced (U/V) decisions and spectrum amplitudes estimations are improved in this vocoder compared with MBE vocoder. For better quantization results, different codebooks are used for different modes of U/V decision, so the number of sub-bands in a frame is fixed. Spectral amplitudes are estimated by Linear Prediction Mode and quantized by MSVQ (Multi-Stage Vector Quantization). Simulation results show that the unvoiced and voiced parts of synthetic speech are coherent with the parts of original speech obviously.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2010 IEEE International Conference on Wireless Communications, Networking and Information Security

自引率

0.00%

发文量