{"title":"Joint source-channel MMSE-decoding of speech parameters","authors":"S. Heinen, P. Vary","doi":"10.1109/ICASSP.2000.861929","DOIUrl":null,"url":null,"abstract":"For speech transmission in digital land mobile telephony, effective compression algorithms have to be used to achieve a high bandwidth efficiency. Furthermore, a variety of adverse transmission effects make it necessary to employ powerful error control techniques to keep bit error rates tolerably low and thus to guarantee a high speech duality. Speech compression is designed to remove irrelevancy and redundancy from the speech signal. Yet measuring the statistical properties of speech parameters extracted by practical compression schemes shows that a considerable amount of redundancy still remains, either in terms of non-uniform distribution or due to time-correlation of parameters extracted from subsequent speech segments. In this contribution, we propose a new minimum mean square error (MMSE) decoder for block-oriented trellis codes, that is able to exploit the time-correlation of subsequent parameter sets. The decoder yields non-discrete speech parameter mean square (MS) estimates. Thus it combines two approaches to exploit residual redundancy: source controlled channel decoding (SCCD) (Hagenauer 1995) and soft bit source decoding (SBSD) (Fingscheidt and Vary 1997) in one algorithm.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.2000.861929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
For speech transmission in digital land mobile telephony, effective compression algorithms have to be used to achieve a high bandwidth efficiency. Furthermore, a variety of adverse transmission effects make it necessary to employ powerful error control techniques to keep bit error rates tolerably low and thus to guarantee a high speech duality. Speech compression is designed to remove irrelevancy and redundancy from the speech signal. Yet measuring the statistical properties of speech parameters extracted by practical compression schemes shows that a considerable amount of redundancy still remains, either in terms of non-uniform distribution or due to time-correlation of parameters extracted from subsequent speech segments. In this contribution, we propose a new minimum mean square error (MMSE) decoder for block-oriented trellis codes, that is able to exploit the time-correlation of subsequent parameter sets. The decoder yields non-discrete speech parameter mean square (MS) estimates. Thus it combines two approaches to exploit residual redundancy: source controlled channel decoding (SCCD) (Hagenauer 1995) and soft bit source decoding (SBSD) (Fingscheidt and Vary 1997) in one algorithm.