{"title":"Fractal modelling of residues in linear predictive coding of speech","authors":"W. Kinsner, E. Vera","doi":"10.1109/COGINF.2009.5250762","DOIUrl":null,"url":null,"abstract":"This paper describes a novel approach of fractal modelling and coding of residuals for excitation in the linear predictive coding of speech. This work was motivated by reducing the bit rate to 1200 bps, while maintaining a good quality of speech. Linear prediction based speech coders differ primarily in the modelling of the residual. The design trade-off in the modelling of the residual is between quality and bit-rate. In this paper fractal modelling is used to model the residual. We show that fractal modelling reduces the bit-rate while maintaining quality. A 6 kbps speech coder was implemented using the piecewise self-affine fractal model. The new coder has a signal-to-noise ratio of 10.9 dB. An informal subjective measure found the perceptual quality to be comparable to that of the 13 kbps GSM coder.","PeriodicalId":420853,"journal":{"name":"2009 8th IEEE International Conference on Cognitive Informatics","volume":"311 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 8th IEEE International Conference on Cognitive Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COGINF.2009.5250762","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper describes a novel approach of fractal modelling and coding of residuals for excitation in the linear predictive coding of speech. This work was motivated by reducing the bit rate to 1200 bps, while maintaining a good quality of speech. Linear prediction based speech coders differ primarily in the modelling of the residual. The design trade-off in the modelling of the residual is between quality and bit-rate. In this paper fractal modelling is used to model the residual. We show that fractal modelling reduces the bit-rate while maintaining quality. A 6 kbps speech coder was implemented using the piecewise self-affine fractal model. The new coder has a signal-to-noise ratio of 10.9 dB. An informal subjective measure found the perceptual quality to be comparable to that of the 13 kbps GSM coder.