Speech Coding, 2002, IEEE Workshop Proceedings.最新文献

MMSE decoding for vector quantization over channels with memory MMSE解码的矢量量化在信道与存储器

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-09 DOI: 10.1109/SCW.2002.1215728

Heng-Iang Hsu, Wen-Whei Chang, Xiaobei Liu, S. Koh

引用次数: 0

An iterative interpolative transform method for modeling harmonic magnitudes 调和幅值建模的迭代插值变换方法

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-06 DOI: 10.1109/SCW.2002.1215716

T. Ramabadran, A. Smith, M. Jasiuk

引用次数: 3

Perceptual QoS assessment methodologies for coded speech in networks 网络中编码语音的感知QoS评价方法

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-06 DOI: 10.1109/SCW.2002.1215730

N. Kitawaki

引用次数: 3

A packet loss concealment method using pitch waveform repetition and internal state update on the decoded speech for the sub-band ADPCM wideband speech codec 针对子带ADPCM宽带语音编解码器，提出了一种基于基音波形重复和解码后语音内部状态更新的丢包隐藏方法

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-06 DOI: 10.1109/SCW.2002.1215726

M. Serizawa, Y. Nozawa

{"title":"A packet loss concealment method using pitch waveform repetition and internal state update on the decoded speech for the sub-band ADPCM wideband speech codec","authors":"M. Serizawa, Y. Nozawa","doi":"10.1109/SCW.2002.1215726","DOIUrl":"https://doi.org/10.1109/SCW.2002.1215726","url":null,"abstract":"The paper proposes a packet loss concealment (PLC) method for the SB-ADPCM (sub-band adaptive differential pulse code modulation) wideband speech codec. When a packet loss occurs, the concealment repeats a pitch waveform of the speech decoded in the past with attenuation to generate a speech waveform corresponding to the lost packet. The packet loss causes differences in the internal states, such as prediction filter states, between encoding and decoding of the SB-ADPCM codec. This difference results in an annoying click noise during the period following the packet loss. The proposed method reduces this difference by updating the internal state based on the speech decoded by the concealment in the past. It also employs a forgetting factor control for the internal states, which reduces the impact on the internal states from the packet loss. Results from a five-grade mean opinion test show that the proposed method achieves around 3 (fair) or 4 (good) speech quality at a loss rate lower than 5%, and 0.4 through 1.0 higher quality compared to the conventional muting PLC method at packet loss rates of 1 to 10% with a packet size of 10 or 20 msec.","PeriodicalId":140750,"journal":{"name":"Speech Coding, 2002, IEEE Workshop Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115571987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

The analysis of speech codecs using psychoacoustic measures 语音编解码器的心理声学分析

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-06 DOI: 10.1109/SCW.2002.1215740

Mohammed Raad, C. Ritz, I. Burnett, A. Mertins

引用次数: 2

Wideband speech coder employing T-codes and reversible variable length codes 采用t码和可逆变长码的宽带语音编码器

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-06 DOI: 10.1109/SCW.2002.1215743

Hongqiang Wang, S. Koh, G. Shu

引用次数: 0

Speech and noise separations using comb filtering method for high quality speech coding 语音和噪声分离采用梳状滤波方法进行高质量语音编码

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-06 DOI: 10.1109/SCW.2002.1215739

Y. Wang, K. Yoshida

引用次数: 4

Quantization noise spectral shaping in instantaneous coding of spectrally unbalanced speech signals 频谱不平衡语音信号瞬时编码中的量化噪声频谱整形

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-06 DOI: 10.1109/SCW.2002.1215722

G. Mahé, A. Gilloire

引用次数: 4

A scalable coder designed for 10-kHz bandwidth speech 为10khz带宽语音设计的可扩展编码器

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-06 DOI: 10.1109/SCW.2002.1215741

M. Oshikiri, H. Ehara, K. Yoshida

{"title":"A scalable coder designed for 10-kHz bandwidth speech","authors":"M. Oshikiri, H. Ehara, K. Yoshida","doi":"10.1109/SCW.2002.1215741","DOIUrl":"https://doi.org/10.1109/SCW.2002.1215741","url":null,"abstract":"This paper presents a scalable speech coder with rate of 23.85-kbit/s to encode 10-kHz bandwidth speech signals. The perceptual quality of the 10-kHz bandwidth speech signals is much better than that of 7-kHz bandwidth ones, and it is close to that of 20-kHz bandwidth ones. The 10-kHz bandwidth is therefore promising for high-fidelity conversational applications. The scalable coder consists of two layers: a base-layer and an enhancement-layer. The adaptive multi-rate wideband speech coder (AMR-WB) at 15.85-kbit/s and a transform coding method at 8-kbit/s are utilized for the base-layer and the enhancement-layer, respectively. This hybrid structure ensures the efficient coding of the 10-kHz bandwidth speech. In enhancement-layer, the modified discrete cosine transform (MDCT) is exploited. Its analysis frame size is set to be short in order to minimize additional algorithmic delay. The total additional algorithmic delay of the enhancement-layer is 5-ms. Since it is difficult to quantize all the MDCT coefficients at 8-kbit/s, we have limited the region for quantization from 6-kHz to 9-kHz to improve the perceptual quality of decoded speech. Our subjective evaluation test results indicate the quality of the proposed coder clearly exceeds that of AMR-WB at 23.85-kbit/s under both clean and noise conditions.","PeriodicalId":140750,"journal":{"name":"Speech Coding, 2002, IEEE Workshop Proceedings.","volume":"229 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121478822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

A 1200/2400 bps coding suite based on MELP 基于MELP的1200/2400 bps编码套件

Speech Coding, 2002, IEEE Workshop Proceedings. Pub Date : 2002-10-06 DOI: 10.1109/SCW.2002.1215734

Tian Wang, K. Koishida, V. Cuperman, A. Gersho, J. Collura

引用次数: 63