感知音频编码中IntMDCT舍入误差的研究

Te Li, R. Yu, S. Koh
{"title":"感知音频编码中IntMDCT舍入误差的研究","authors":"Te Li, R. Yu, S. Koh","doi":"10.1109/ISM.2005.111","DOIUrl":null,"url":null,"abstract":"With the proliferation of broadband access and continuous decline of storage prize per gigabyte, there has been an increasing demand of audio solution that provides high sampling rate and high resolution. Lossless audio is undoubtedly the ultimate solution. In response to this demand, MPEG issued a call for proposal soliciting technology contributions that provides a state-of-art solution. At the technology end, lossless compression requires the usage of integer transform. The integer modified discrete cosine transform (IntMDCT) has been adopted in MPEG-4 scalable to lossless (SLS) coding to enable this efficient lossless operation. Because of rounding operations, rounding errors introduced by IntMDCT exist during the whole coding process. With the SLS having capability of using operations that spreads over the bitrate spectrum which ranges from lossy to lossless, it is of interest to study the effect of rounding errors in IntMDCT for operation of SLS in lossy mode. This paper analyzes the contributions of noise due to these errors. It is found that the noise introduced by rounding operations of IntMDCT does not affect the perceptual quality of the coded audio under any circumstances. As such, it concludes that the MDCT and IntMDCT filterbanks are interchangeable at lossy bitrate. With the fact that SLS uses both MDCT and IntMDCT, the finding in this paper suggests the possibility of using only IntMDCT filterbank.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"296 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Study on rounding errors of IntMDCT in perceptual audio coding\",\"authors\":\"Te Li, R. Yu, S. Koh\",\"doi\":\"10.1109/ISM.2005.111\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the proliferation of broadband access and continuous decline of storage prize per gigabyte, there has been an increasing demand of audio solution that provides high sampling rate and high resolution. Lossless audio is undoubtedly the ultimate solution. In response to this demand, MPEG issued a call for proposal soliciting technology contributions that provides a state-of-art solution. At the technology end, lossless compression requires the usage of integer transform. The integer modified discrete cosine transform (IntMDCT) has been adopted in MPEG-4 scalable to lossless (SLS) coding to enable this efficient lossless operation. Because of rounding operations, rounding errors introduced by IntMDCT exist during the whole coding process. With the SLS having capability of using operations that spreads over the bitrate spectrum which ranges from lossy to lossless, it is of interest to study the effect of rounding errors in IntMDCT for operation of SLS in lossy mode. This paper analyzes the contributions of noise due to these errors. It is found that the noise introduced by rounding operations of IntMDCT does not affect the perceptual quality of the coded audio under any circumstances. As such, it concludes that the MDCT and IntMDCT filterbanks are interchangeable at lossy bitrate. With the fact that SLS uses both MDCT and IntMDCT, the finding in this paper suggests the possibility of using only IntMDCT filterbank.\",\"PeriodicalId\":322363,\"journal\":{\"name\":\"Seventh IEEE International Symposium on Multimedia (ISM'05)\",\"volume\":\"296 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Seventh IEEE International Symposium on Multimedia (ISM'05)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISM.2005.111\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Seventh IEEE International Symposium on Multimedia (ISM'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISM.2005.111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着宽带接入的普及和每千兆字节存储价值的不断下降,人们对高采样率和高分辨率的音频解决方案的需求越来越大。无损音频无疑是最终的解决方案。为了响应这一需求,MPEG发布了一项提案,征求提供最先进解决方案的技术贡献。在技术端,无损压缩需要使用整数变换。整数修正离散余弦变换(IntMDCT)被用于MPEG-4可扩展到无损(SLS)编码,以实现这种高效的无损操作。由于存在舍入操作,IntMDCT引入的舍入误差在整个编码过程中都存在。由于SLS具有使用在比特率频谱(从有损到无损)上扩展的操作的能力,因此研究IntMDCT中舍入误差对SLS在有损模式下操作的影响是有意义的。本文分析了这些误差对噪声的贡献。研究发现,在任何情况下,IntMDCT舍入运算引入的噪声都不会影响编码音频的感知质量。因此,它得出结论MDCT和IntMDCT滤波器组在有损比特率下是可互换的。由于SLS同时使用MDCT和IntMDCT,本文的发现表明仅使用IntMDCT滤波器组的可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Study on rounding errors of IntMDCT in perceptual audio coding
With the proliferation of broadband access and continuous decline of storage prize per gigabyte, there has been an increasing demand of audio solution that provides high sampling rate and high resolution. Lossless audio is undoubtedly the ultimate solution. In response to this demand, MPEG issued a call for proposal soliciting technology contributions that provides a state-of-art solution. At the technology end, lossless compression requires the usage of integer transform. The integer modified discrete cosine transform (IntMDCT) has been adopted in MPEG-4 scalable to lossless (SLS) coding to enable this efficient lossless operation. Because of rounding operations, rounding errors introduced by IntMDCT exist during the whole coding process. With the SLS having capability of using operations that spreads over the bitrate spectrum which ranges from lossy to lossless, it is of interest to study the effect of rounding errors in IntMDCT for operation of SLS in lossy mode. This paper analyzes the contributions of noise due to these errors. It is found that the noise introduced by rounding operations of IntMDCT does not affect the perceptual quality of the coded audio under any circumstances. As such, it concludes that the MDCT and IntMDCT filterbanks are interchangeable at lossy bitrate. With the fact that SLS uses both MDCT and IntMDCT, the finding in this paper suggests the possibility of using only IntMDCT filterbank.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信