Speech compression by vector quantization of epochs

Peter Veprek, A. B. Bradley
{"title":"Speech compression by vector quantization of epochs","authors":"Peter Veprek, A. B. Bradley","doi":"10.1109/ISSPA.1999.818220","DOIUrl":null,"url":null,"abstract":"A pitch epoch is a fundamental unit of voiced speech. This paper introduces a speech compression method based on vector quantization of epochs. Pitch determination, epoch marking, vector quantization procedure, and a technique for epoch extrapolation are described. The compression method is then evaluated and briefly compared to other waveform coders. The quality is objectively measured by the segmental signal-to-noise ratio and the results are tabulated. The (automatic) epoch vector quantization yields the following SNRseg: 10.03 dB at 12.0 kbps, 11.35 dB at 21.3 kbps, 13.31 dB at 39.7 kbps, 18.32 dB at 76.4 kbps, and 62.69 dB at 112.9 kbps.","PeriodicalId":302569,"journal":{"name":"ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.1999.818220","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

A pitch epoch is a fundamental unit of voiced speech. This paper introduces a speech compression method based on vector quantization of epochs. Pitch determination, epoch marking, vector quantization procedure, and a technique for epoch extrapolation are described. The compression method is then evaluated and briefly compared to other waveform coders. The quality is objectively measured by the segmental signal-to-noise ratio and the results are tabulated. The (automatic) epoch vector quantization yields the following SNRseg: 10.03 dB at 12.0 kbps, 11.35 dB at 21.3 kbps, 13.31 dB at 39.7 kbps, 18.32 dB at 76.4 kbps, and 62.69 dB at 112.9 kbps.
基于时间向量量化的语音压缩
音高epoch是浊音的基本单位。介绍了一种基于时间矢量量化的语音压缩方法。基音确定、历元标记、矢量量化程序和历元外推技术进行了描述。然后对压缩方法进行评估,并与其他波形编码器进行简要比较。用分段信噪比客观地测量了质量,并将结果制成表格。(自动)历元矢量量化产生以下SNRseg: 12.0 kbps时10.03 dB, 21.3 kbps时11.35 dB, 39.7 kbps时13.31 dB, 76.4 kbps时18.32 dB, 112.9 kbps时62.69 dB。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信