{"title":"High-quality harmonic coding at very low bit rates","authors":"Gao Yang, H. Leich","doi":"10.1109/ICASSP.1994.389325","DOIUrl":null,"url":null,"abstract":"The paper presents a harmonic vocoder to produce high-quality speech at very low bit rates (below 2 kb/s). Voiced speech is decomposed into forward and backward signals which consist of interpolated harmonics. Unvoiced speech is reconstructed in the time domain with an approach similar to CELP. To remove the \"buzzy\" quality and avoid the \"hoarse\" quality, three methods are presented: the randomness of the harmonic phases is controlled according to pitch value and the continuity of synthetic speech is maintained; the spectral envelope determined by the LP model is modified; some noise components can be introduced for voiced synthetic speech. The harmonic vocoder produces quite natural, clear speech. Its perceptual quality is much better than that of the LPC-10 vocoder.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1994.389325","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
The paper presents a harmonic vocoder to produce high-quality speech at very low bit rates (below 2 kb/s). Voiced speech is decomposed into forward and backward signals which consist of interpolated harmonics. Unvoiced speech is reconstructed in the time domain with an approach similar to CELP. To remove the "buzzy" quality and avoid the "hoarse" quality, three methods are presented: the randomness of the harmonic phases is controlled according to pitch value and the continuity of synthetic speech is maintained; the spectral envelope determined by the LP model is modified; some noise components can be introduced for voiced synthetic speech. The harmonic vocoder produces quite natural, clear speech. Its perceptual quality is much better than that of the LPC-10 vocoder.<>