{"title":"高质量的谐波编码在非常低的比特率","authors":"Gao Yang, H. Leich","doi":"10.1109/ICASSP.1994.389325","DOIUrl":null,"url":null,"abstract":"The paper presents a harmonic vocoder to produce high-quality speech at very low bit rates (below 2 kb/s). Voiced speech is decomposed into forward and backward signals which consist of interpolated harmonics. Unvoiced speech is reconstructed in the time domain with an approach similar to CELP. To remove the \"buzzy\" quality and avoid the \"hoarse\" quality, three methods are presented: the randomness of the harmonic phases is controlled according to pitch value and the continuity of synthetic speech is maintained; the spectral envelope determined by the LP model is modified; some noise components can be introduced for voiced synthetic speech. The harmonic vocoder produces quite natural, clear speech. Its perceptual quality is much better than that of the LPC-10 vocoder.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"High-quality harmonic coding at very low bit rates\",\"authors\":\"Gao Yang, H. Leich\",\"doi\":\"10.1109/ICASSP.1994.389325\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper presents a harmonic vocoder to produce high-quality speech at very low bit rates (below 2 kb/s). Voiced speech is decomposed into forward and backward signals which consist of interpolated harmonics. Unvoiced speech is reconstructed in the time domain with an approach similar to CELP. To remove the \\\"buzzy\\\" quality and avoid the \\\"hoarse\\\" quality, three methods are presented: the randomness of the harmonic phases is controlled according to pitch value and the continuity of synthetic speech is maintained; the spectral envelope determined by the LP model is modified; some noise components can be introduced for voiced synthetic speech. The harmonic vocoder produces quite natural, clear speech. Its perceptual quality is much better than that of the LPC-10 vocoder.<<ETX>>\",\"PeriodicalId\":290798,\"journal\":{\"name\":\"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1994.389325\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1994.389325","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
High-quality harmonic coding at very low bit rates
The paper presents a harmonic vocoder to produce high-quality speech at very low bit rates (below 2 kb/s). Voiced speech is decomposed into forward and backward signals which consist of interpolated harmonics. Unvoiced speech is reconstructed in the time domain with an approach similar to CELP. To remove the "buzzy" quality and avoid the "hoarse" quality, three methods are presented: the randomness of the harmonic phases is controlled according to pitch value and the continuity of synthetic speech is maintained; the spectral envelope determined by the LP model is modified; some noise components can be introduced for voiced synthetic speech. The harmonic vocoder produces quite natural, clear speech. Its perceptual quality is much better than that of the LPC-10 vocoder.<>