A robust characterization of audio signals using the level of information content per Chroma

A. Manzo-Martinez, José Antonio Camarena Ibarrola
{"title":"A robust characterization of audio signals using the level of information content per Chroma","authors":"A. Manzo-Martinez, José Antonio Camarena Ibarrola","doi":"10.1109/ISSPIT.2011.6151562","DOIUrl":null,"url":null,"abstract":"In this paper we propose a new technique to characterize audio-signals. We use Shannon's Entropy to estimate the level of information content per chroma and we show that involving entropy contributes for a more robust audio characterization. A new audio-fingerprint (AFP) based on this feature is proposed in this paper which we have called Entropy-Chroma Fingerprint (ECFP). Two approaches were considered to estimate entropy; the first assumes the spectral coefficients distribute normally, while the second, estimates its probability density function (PDF) with the Parzen Windows Estimation method. We compared the robustness of the ECFP against the Chromagram-Based Audio-Fingerprint (CBFP) which is determined using the Constant Q Transform (CQT). Three thousand and five hundred AFPs were determined from songs of several genres. A subset of 350 songs were severely degraded and searched for using excerpts of 5 seconds for that matter. The ECFP determined assuming gaussianity on the PDF turned out to be much more robust than the CBFP. The ECFP determined assuming gaussianity is much faster to process than both, the CBFP and the ECFP determined with Parzen Windows and still more robust.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPIT.2011.6151562","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

In this paper we propose a new technique to characterize audio-signals. We use Shannon's Entropy to estimate the level of information content per chroma and we show that involving entropy contributes for a more robust audio characterization. A new audio-fingerprint (AFP) based on this feature is proposed in this paper which we have called Entropy-Chroma Fingerprint (ECFP). Two approaches were considered to estimate entropy; the first assumes the spectral coefficients distribute normally, while the second, estimates its probability density function (PDF) with the Parzen Windows Estimation method. We compared the robustness of the ECFP against the Chromagram-Based Audio-Fingerprint (CBFP) which is determined using the Constant Q Transform (CQT). Three thousand and five hundred AFPs were determined from songs of several genres. A subset of 350 songs were severely degraded and searched for using excerpts of 5 seconds for that matter. The ECFP determined assuming gaussianity on the PDF turned out to be much more robust than the CBFP. The ECFP determined assuming gaussianity is much faster to process than both, the CBFP and the ECFP determined with Parzen Windows and still more robust.
利用每个色度的信息含量水平对音频信号进行鲁棒表征
本文提出了一种表征音频信号的新技术。我们使用香农熵来估计每个色度的信息含量水平,我们表明,涉及熵有助于更稳健的音频表征。本文基于这一特征提出了一种新的音频指纹,我们称之为熵-色度指纹(ECFP)。考虑了两种方法来估计熵;第一种方法假设谱系数正态分布,第二种方法使用Parzen窗估计方法估计其概率密度函数(PDF)。我们比较了ECFP与使用常数Q变换(CQT)确定的基于色谱的音频指纹(CBFP)的鲁棒性。从几种流派的歌曲中确定了3500个afp。350首歌的一个子集被严重降级,并使用5秒的节选来搜索。结果证明,ECFP在PDF上确定的假设高斯性比CBFP要稳健得多。假设高斯性确定的ECFP处理速度比两者都快得多,CBFP和使用Parzen窗口确定的ECFP仍然更健壮。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信