{"title":"Fast and Accurate Song Recognition: an Approach Based on Multi-Index Hashing","authors":"Salvatore Serrano, M. Scarpa","doi":"10.23919/softcom55329.2022.9911351","DOIUrl":null,"url":null,"abstract":"An activity of wide interest for researchers and companies working in the field of audio signal processing is the capability to automatically recognize in real-time short excerpts of broadcast or played commercial songs. It appears quite difficult to obtain a robust approach able to generate a fast algorithm in order to analyze several audio flows at the same time. In this paper, we compare the results obtained using a specific improvement of an algorithm we recently proposed against several baseline approaches. Specifically, we introduced an approach based on Multi-Index Hashing which permits to improve noticeably speed in fingerprints searching also on very large datasets. Experimental results, performed using the MTG-Jamendo dataset, containing more then 50, 000 songs, show our approach outperform the others jointly considering performance parameters: accuracy, precision and query time.","PeriodicalId":261625,"journal":{"name":"2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/softcom55329.2022.9911351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
An activity of wide interest for researchers and companies working in the field of audio signal processing is the capability to automatically recognize in real-time short excerpts of broadcast or played commercial songs. It appears quite difficult to obtain a robust approach able to generate a fast algorithm in order to analyze several audio flows at the same time. In this paper, we compare the results obtained using a specific improvement of an algorithm we recently proposed against several baseline approaches. Specifically, we introduced an approach based on Multi-Index Hashing which permits to improve noticeably speed in fingerprints searching also on very large datasets. Experimental results, performed using the MTG-Jamendo dataset, containing more then 50, 000 songs, show our approach outperform the others jointly considering performance parameters: accuracy, precision and query time.