谐波语音编码的幅度/相位谱参数模型

ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing Pub Date : 1988-04-11 DOI:10.1109/ICASSP.1988.196596

D. Thomson

{"title":"谐波语音编码的幅度/相位谱参数模型","authors":"D. Thomson","doi":"10.1109/ICASSP.1988.196596","DOIUrl":null,"url":null,"abstract":"A method is described for representing magnitude and phase in a sinusoidal transform coder. Instead of transmitting individual sinusoids, the entire speech spectrum is transmitted. The synthesizer estimates the frequency, amplitude, and phase of each harmonic from the spectrum. Relatively high-quality speech in the 4.8-9.6 kb/s range is obtained by modeling the magnitude/phase spectrum with a combination of pole-zero analysis, phase prediction and vector quantization. A window subtraction method ensures proper synthesis of unvoiced speech. The system is robust since it does not depend on pitch estimates or voicing decisions.<<ETX>>","PeriodicalId":448544,"journal":{"name":"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing","volume":"152 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1988-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Parametric models of the magnitude/phase spectrum for harmonic speech coding\",\"authors\":\"D. Thomson\",\"doi\":\"10.1109/ICASSP.1988.196596\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A method is described for representing magnitude and phase in a sinusoidal transform coder. Instead of transmitting individual sinusoids, the entire speech spectrum is transmitted. The synthesizer estimates the frequency, amplitude, and phase of each harmonic from the spectrum. Relatively high-quality speech in the 4.8-9.6 kb/s range is obtained by modeling the magnitude/phase spectrum with a combination of pole-zero analysis, phase prediction and vector quantization. A window subtraction method ensures proper synthesis of unvoiced speech. The system is robust since it does not depend on pitch estimates or voicing decisions.<<ETX>>\",\"PeriodicalId\":448544,\"journal\":{\"name\":\"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing\",\"volume\":\"152 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1988-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1988.196596\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1988.196596","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

摘要

描述了一种在正弦变换编码器中表示幅度和相位的方法。传输的不是单个的正弦波，而是整个语音频谱。合成器从频谱中估计每个谐波的频率、幅度和相位。结合极零分析、相位预测和矢量量化对幅相谱进行建模，得到了4.8 ~ 9.6 kb/s范围内的高质量语音。窗口减法可确保正确合成不发音语音。该系统是稳健的，因为它不依赖于音高估计或声音决定

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Parametric models of the magnitude/phase spectrum for harmonic speech coding

A method is described for representing magnitude and phase in a sinusoidal transform coder. Instead of transmitting individual sinusoids, the entire speech spectrum is transmitted. The synthesizer estimates the frequency, amplitude, and phase of each harmonic from the spectrum. Relatively high-quality speech in the 4.8-9.6 kb/s range is obtained by modeling the magnitude/phase spectrum with a combination of pole-zero analysis, phase prediction and vector quantization. A window subtraction method ensures proper synthesis of unvoiced speech. The system is robust since it does not depend on pitch estimates or voicing decisions.<>

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing

自引率

0.00%

发文量