IP网络上音乐流媒体服务窄带语音编码器的带宽扩展

Young Han Lee, H. Kim
{"title":"IP网络上音乐流媒体服务窄带语音编码器的带宽扩展","authors":"Young Han Lee, H. Kim","doi":"10.1109/SIPS.2007.4387608","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a bandwidth extension (BWE) algorithm for a low-bit-rate narrowband CELP coder using a spectral envelope sharing approach to develop a wideband speech coder. The developed wideband speech coder, referred to here as the BWE coder, is constructed using an embedded structure by adding an enhancement layer to the narrowband CELP coder. To minimize the bit-rate increase caused by the enhancement layer, the proposed BWE coder shares the spectral envelope and excitation parameters both with the narrowband CELP coder and the enhancement layer. In this paper, we choose G.729EV layer 2 as the baseline narrowband speech coder, and mel-frequency cepstral coefficients (MFCCs) are used to reconstruct the higher frequency components at the enhancement layer. By doing this, the bit-rate of the proposed BWE coder is found to be 12.7 kbit/s, just 0.7 kbit/s higher than that of G.729EV layer 2. It is also demonstrated from a MUSHRA test with audio signals from four different music genres, that the BWE coder gives better quality than G.729EV layer 2 and comparable quality to G.729EV layer 3, corresponding to an overall bit-rate reduction of 1.3 kbit/s.","PeriodicalId":93225,"journal":{"name":"Proceedings. IEEE Workshop on Signal Processing Systems (2007-2014)","volume":"37 1","pages":"552-555"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Bandwidth Extension of a Narrowband Speech Coder for Music Streaming Services Over IP Networks\",\"authors\":\"Young Han Lee, H. Kim\",\"doi\":\"10.1109/SIPS.2007.4387608\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a bandwidth extension (BWE) algorithm for a low-bit-rate narrowband CELP coder using a spectral envelope sharing approach to develop a wideband speech coder. The developed wideband speech coder, referred to here as the BWE coder, is constructed using an embedded structure by adding an enhancement layer to the narrowband CELP coder. To minimize the bit-rate increase caused by the enhancement layer, the proposed BWE coder shares the spectral envelope and excitation parameters both with the narrowband CELP coder and the enhancement layer. In this paper, we choose G.729EV layer 2 as the baseline narrowband speech coder, and mel-frequency cepstral coefficients (MFCCs) are used to reconstruct the higher frequency components at the enhancement layer. By doing this, the bit-rate of the proposed BWE coder is found to be 12.7 kbit/s, just 0.7 kbit/s higher than that of G.729EV layer 2. It is also demonstrated from a MUSHRA test with audio signals from four different music genres, that the BWE coder gives better quality than G.729EV layer 2 and comparable quality to G.729EV layer 3, corresponding to an overall bit-rate reduction of 1.3 kbit/s.\",\"PeriodicalId\":93225,\"journal\":{\"name\":\"Proceedings. IEEE Workshop on Signal Processing Systems (2007-2014)\",\"volume\":\"37 1\",\"pages\":\"552-555\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. IEEE Workshop on Signal Processing Systems (2007-2014)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIPS.2007.4387608\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE Workshop on Signal Processing Systems (2007-2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIPS.2007.4387608","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在本文中,我们提出了一种带宽扩展(BWE)算法,用于低比特率窄带CELP编码器,使用频谱包络共享方法开发宽带语音编码器。所开发的宽带语音编码器,这里称为BWE编码器,是通过在窄带CELP编码器上添加增强层而采用嵌入式结构构建的。为了使增强层引起的比特率增加最小化,所提出的BWE编码器与窄带CELP编码器和增强层共享频谱包络和激励参数。在本文中,我们选择G.729EV第2层作为窄带语音编码器的基线,并使用mel-frequency倒谱系数(mfccc)在增强层重构高频分量。通过这样做,发现所提出的BWE编码器的比特率为12.7 kbit/s,仅比G.729EV第2层高0.7 kbit/s。通过对四种不同音乐类型的音频信号进行的MUSHRA测试也证明,BWE编码器的质量优于G.729EV第2层,与G.729EV第3层相当,相当于总比特率降低了1.3 kbit/s。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Bandwidth Extension of a Narrowband Speech Coder for Music Streaming Services Over IP Networks
In this paper, we propose a bandwidth extension (BWE) algorithm for a low-bit-rate narrowband CELP coder using a spectral envelope sharing approach to develop a wideband speech coder. The developed wideband speech coder, referred to here as the BWE coder, is constructed using an embedded structure by adding an enhancement layer to the narrowband CELP coder. To minimize the bit-rate increase caused by the enhancement layer, the proposed BWE coder shares the spectral envelope and excitation parameters both with the narrowband CELP coder and the enhancement layer. In this paper, we choose G.729EV layer 2 as the baseline narrowband speech coder, and mel-frequency cepstral coefficients (MFCCs) are used to reconstruct the higher frequency components at the enhancement layer. By doing this, the bit-rate of the proposed BWE coder is found to be 12.7 kbit/s, just 0.7 kbit/s higher than that of G.729EV layer 2. It is also demonstrated from a MUSHRA test with audio signals from four different music genres, that the BWE coder gives better quality than G.729EV layer 2 and comparable quality to G.729EV layer 3, corresponding to an overall bit-rate reduction of 1.3 kbit/s.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信