基于快速群稀疏学习的多基音估计

Ted Kronvall, Filip Elvander, Stefan Ingi Adalbjornsson, A. Jakobsson
{"title":"基于快速群稀疏学习的多基音估计","authors":"Ted Kronvall, Filip Elvander, Stefan Ingi Adalbjornsson, A. Jakobsson","doi":"10.1109/EUSIPCO.2016.7760417","DOIUrl":null,"url":null,"abstract":"In this work, we consider the problem of multi-pitch estimation using sparse heuristics and convex modeling. In general, this is a difficult non-linear optimization problem, as the frequencies belonging to one pitch often overlap the frequencies belonging to other pitches, thereby causing ambiguity between pitches with similar frequency content. The problem is further complicated by the fact that the number of pitches is typically not known. In this work, we propose a sparse modeling framework using a generalized chroma representation in order to remove redundancy and lower the dictionary's block-coherency. The found chroma estimates are then used to solve a small convex problem, whereby spectral smoothness is enforced, resulting in the corresponding pitch estimates. Compared with previously published sparse approaches, the resulting algorithm reduces the computational complexity of each iteration, as well as speeding up the overall convergence.","PeriodicalId":127068,"journal":{"name":"2016 24th European Signal Processing Conference (EUSIPCO)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Multi-pitch estimation via fast group sparse learning\",\"authors\":\"Ted Kronvall, Filip Elvander, Stefan Ingi Adalbjornsson, A. Jakobsson\",\"doi\":\"10.1109/EUSIPCO.2016.7760417\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we consider the problem of multi-pitch estimation using sparse heuristics and convex modeling. In general, this is a difficult non-linear optimization problem, as the frequencies belonging to one pitch often overlap the frequencies belonging to other pitches, thereby causing ambiguity between pitches with similar frequency content. The problem is further complicated by the fact that the number of pitches is typically not known. In this work, we propose a sparse modeling framework using a generalized chroma representation in order to remove redundancy and lower the dictionary's block-coherency. The found chroma estimates are then used to solve a small convex problem, whereby spectral smoothness is enforced, resulting in the corresponding pitch estimates. Compared with previously published sparse approaches, the resulting algorithm reduces the computational complexity of each iteration, as well as speeding up the overall convergence.\",\"PeriodicalId\":127068,\"journal\":{\"name\":\"2016 24th European Signal Processing Conference (EUSIPCO)\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 24th European Signal Processing Conference (EUSIPCO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EUSIPCO.2016.7760417\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 24th European Signal Processing Conference (EUSIPCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EUSIPCO.2016.7760417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

在这项工作中,我们考虑了使用稀疏启发式和凸建模的多音高估计问题。一般来说,这是一个困难的非线性优化问题,因为属于一个音高的频率经常与属于其他音高的频率重叠,从而导致具有相似频率内容的音高之间的模糊。这个问题由于通常不知道音调的数量而变得更加复杂。在这项工作中,我们提出了一个使用广义色度表示的稀疏建模框架,以消除冗余并降低字典的块相干性。然后使用发现的色度估计来解决一个小的凸问题,从而强制执行光谱平滑,从而产生相应的音高估计。与先前发表的稀疏方法相比,所得到的算法降低了每次迭代的计算复杂度,并加快了整体收敛速度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-pitch estimation via fast group sparse learning
In this work, we consider the problem of multi-pitch estimation using sparse heuristics and convex modeling. In general, this is a difficult non-linear optimization problem, as the frequencies belonging to one pitch often overlap the frequencies belonging to other pitches, thereby causing ambiguity between pitches with similar frequency content. The problem is further complicated by the fact that the number of pitches is typically not known. In this work, we propose a sparse modeling framework using a generalized chroma representation in order to remove redundancy and lower the dictionary's block-coherency. The found chroma estimates are then used to solve a small convex problem, whereby spectral smoothness is enforced, resulting in the corresponding pitch estimates. Compared with previously published sparse approaches, the resulting algorithm reduces the computational complexity of each iteration, as well as speeding up the overall convergence.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信