{"title":"Complex Wavelet Modulation Subbands for Speech Compression","authors":"J. Luneau, J. Lebrun, S. H. Jensen","doi":"10.1109/DCC.2009.52","DOIUrl":null,"url":null,"abstract":"Low-frequency modulation of sound carry essential information for speech and music. They must be preserved for compression. The complex modulation spectrum is commonly obtained by spectral analysis of the sole temporal envelopes of the subbands out of a time/frequency analysis. Amplitudes and tones of speech or music tend to vary slowly over time thus the temporal envelopes are mostly of polynomial type. Processing in this domain usually creates undesirable distortions because only the magnitudes are taken into account and the phase data is often neglected. We remedy this problem with the use of a complex wavelet transform as a more appropriate envelope and phase processing tool. Complex wavelets carry both magnitude and phase explicitly with great sparsity and preserve well polynomials. Moreover an analytic Hilbert-like transform is possible with complex wavelets implemented as an orthogonal filter bank. By working in this alternative transform domain coined as ``Modulation Subbands\", this transform shows very promising compression capabilities thanks to interesting sparsity properties and suggests new approaches for joint spectro-temporal analytic processing of slow frequency and phase varying audio signals.","PeriodicalId":377880,"journal":{"name":"2009 Data Compression Conference","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Data Compression Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCC.2009.52","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Low-frequency modulation of sound carry essential information for speech and music. They must be preserved for compression. The complex modulation spectrum is commonly obtained by spectral analysis of the sole temporal envelopes of the subbands out of a time/frequency analysis. Amplitudes and tones of speech or music tend to vary slowly over time thus the temporal envelopes are mostly of polynomial type. Processing in this domain usually creates undesirable distortions because only the magnitudes are taken into account and the phase data is often neglected. We remedy this problem with the use of a complex wavelet transform as a more appropriate envelope and phase processing tool. Complex wavelets carry both magnitude and phase explicitly with great sparsity and preserve well polynomials. Moreover an analytic Hilbert-like transform is possible with complex wavelets implemented as an orthogonal filter bank. By working in this alternative transform domain coined as ``Modulation Subbands", this transform shows very promising compression capabilities thanks to interesting sparsity properties and suggests new approaches for joint spectro-temporal analytic processing of slow frequency and phase varying audio signals.