{"title":"Perceptual tempo estimation using GMM-regression","authors":"G. Peeters, Joachim Flocon-Cholet","doi":"10.1145/2390848.2390861","DOIUrl":null,"url":null,"abstract":"Most current tempo estimation algorithms suffer from the so-called octave estimation problems (estimating twice, thrice, half or one-third of a reference tempo). However, it is difficult to qualify an error as octave error without a clear definition of what is the reference tempo. For this reason, and given that tempo is mostly a perceptual notion, we study here the estimation of perceptual tempo. We consider the perceptual tempo as defined by the results of the large-scale experiment made at Last-FM in 2011. We assume that the perception of tempo is related to the rate of variation of four musical attributes: the variation of energy, of harmonic changes, of spectral balance and short-term-event-repetitions. We then propose the use of GMM-Regression to find the relationship between the perceptual tempo and the four musical attributes. In an experiment, we show that the estimation of the tempo provided by GMM-Regression over these attributes outperforms the one provided by a state-of-the-art tempo estimation algorithm. For this task GMM-Regression also largely outperforms SVM-Regression. We finally study the estimation of three perceptual tempo classes (\"Slow\", \"In Between\", \"Fast\") using both GMM-Regression and SVM-Classification.","PeriodicalId":199844,"journal":{"name":"MIRUM '12","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MIRUM '12","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2390848.2390861","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28
Abstract
Most current tempo estimation algorithms suffer from the so-called octave estimation problems (estimating twice, thrice, half or one-third of a reference tempo). However, it is difficult to qualify an error as octave error without a clear definition of what is the reference tempo. For this reason, and given that tempo is mostly a perceptual notion, we study here the estimation of perceptual tempo. We consider the perceptual tempo as defined by the results of the large-scale experiment made at Last-FM in 2011. We assume that the perception of tempo is related to the rate of variation of four musical attributes: the variation of energy, of harmonic changes, of spectral balance and short-term-event-repetitions. We then propose the use of GMM-Regression to find the relationship between the perceptual tempo and the four musical attributes. In an experiment, we show that the estimation of the tempo provided by GMM-Regression over these attributes outperforms the one provided by a state-of-the-art tempo estimation algorithm. For this task GMM-Regression also largely outperforms SVM-Regression. We finally study the estimation of three perceptual tempo classes ("Slow", "In Between", "Fast") using both GMM-Regression and SVM-Classification.