在极低比特率下窄带语音编码的长期谐波加噪声模型

International Conference on Telecommunications and Signal Processing Pub Date : 2017-07-01 DOI:10.1109/TSP.2017.8076008

F. Ali, S. Larbi

{"title":"在极低比特率下窄带语音编码的长期谐波加噪声模型","authors":"F. Ali, S. Larbi","doi":"10.1109/TSP.2017.8076008","DOIUrl":null,"url":null,"abstract":"This paper presents a very low bit-rate speech codec based on the long-term Harmonic plus Noise Model (LT-HNM). The HNM is known to be efficient in terms of speech signal representation, thanks to the use of natural parameters: fundamental and voicing cut-off frequencies, harmonics and noise frequencies. Besides, the long-term modeling is particularly efficient in reducing the data size of the model parameters. In this paper we combine both approaches, long-term modeling and HNM, to develop a very low bit-rate coder for narrowband speech. The obtained bit-rates are as low as 2.3 kbps with objective listening quality (perceptual evaluation of speech quality PESQ) of 2.3.","PeriodicalId":236767,"journal":{"name":"International Conference on Telecommunications and Signal Processing","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A long term harmonic plus noise model for narrow-band speech coding at very low bit-rates\",\"authors\":\"F. Ali, S. Larbi\",\"doi\":\"10.1109/TSP.2017.8076008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a very low bit-rate speech codec based on the long-term Harmonic plus Noise Model (LT-HNM). The HNM is known to be efficient in terms of speech signal representation, thanks to the use of natural parameters: fundamental and voicing cut-off frequencies, harmonics and noise frequencies. Besides, the long-term modeling is particularly efficient in reducing the data size of the model parameters. In this paper we combine both approaches, long-term modeling and HNM, to develop a very low bit-rate coder for narrowband speech. The obtained bit-rates are as low as 2.3 kbps with objective listening quality (perceptual evaluation of speech quality PESQ) of 2.3.\",\"PeriodicalId\":236767,\"journal\":{\"name\":\"International Conference on Telecommunications and Signal Processing\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Telecommunications and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TSP.2017.8076008\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Telecommunications and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSP.2017.8076008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

提出了一种基于长期谐波加噪声模型(LT-HNM)的极低比特率语音编解码器。由于使用了自然参数:基本和语音截止频率、谐波和噪声频率，HNM在语音信号表示方面被认为是有效的。此外，长期建模在减少模型参数的数据量方面特别有效。在本文中，我们结合两种方法，长期建模和HNM，开发了一个非常低比特率的窄带语音编码器。获得的比特率低至2.3 kbps，客观收听质量(语音质量感知评价PESQ)为2.3。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A long term harmonic plus noise model for narrow-band speech coding at very low bit-rates

This paper presents a very low bit-rate speech codec based on the long-term Harmonic plus Noise Model (LT-HNM). The HNM is known to be efficient in terms of speech signal representation, thanks to the use of natural parameters: fundamental and voicing cut-off frequencies, harmonics and noise frequencies. Besides, the long-term modeling is particularly efficient in reducing the data size of the model parameters. In this paper we combine both approaches, long-term modeling and HNM, to develop a very low bit-rate coder for narrowband speech. The obtained bit-rates are as low as 2.3 kbps with objective listening quality (perceptual evaluation of speech quality PESQ) of 2.3.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Telecommunications and Signal Processing

自引率

0.00%

发文量