{"title":"A novel algorithm for low bit rate speech compression using a hybrid LP-harmonics model","authors":"N. Abu-Shikhah, Mohamed Deriche","doi":"10.1109/SCFT.2000.878388","DOIUrl":null,"url":null,"abstract":"We present a new LP-harmonic speech codec. At the coder speech signal is pre-processed, and an LP analysis is performed, together with pitch estimation and voicing decision. At the decoder and when the frame is voiced, the encoded parameters are used to estimate the spectrum envelope, extract and classify the harmonics as either strong or weak depending on their relative distance from multiples of the fundamental frequency. Strong harmonics parameters are then used to generate pure sinusoids. While weak harmonics are used to generate a mixed signal of a pure sinusoid and a random-like signal. For unvoiced frames, the excitation of the LP filter is generated as a white noise signal. The proposed model allows for the mixing of strong and weak periodic signals together with random signals to produce an excitation input that results in natural speech. Informal testing of the coder working at 1.82 kb/s showed that the output speech has high intelligibility, with quality comparable to that of a 4 kb/s sinusoidal codec.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCFT.2000.878388","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present a new LP-harmonic speech codec. At the coder speech signal is pre-processed, and an LP analysis is performed, together with pitch estimation and voicing decision. At the decoder and when the frame is voiced, the encoded parameters are used to estimate the spectrum envelope, extract and classify the harmonics as either strong or weak depending on their relative distance from multiples of the fundamental frequency. Strong harmonics parameters are then used to generate pure sinusoids. While weak harmonics are used to generate a mixed signal of a pure sinusoid and a random-like signal. For unvoiced frames, the excitation of the LP filter is generated as a white noise signal. The proposed model allows for the mixing of strong and weak periodic signals together with random signals to produce an excitation input that results in natural speech. Informal testing of the coder working at 1.82 kb/s showed that the output speech has high intelligibility, with quality comparable to that of a 4 kb/s sinusoidal codec.