K. V. V. Girish, T. Ananthapadmanabha, A. Ramakrishnan
{"title":"Cosine similarity based dictionary learning and source recovery for classification of diverse audio sources","authors":"K. V. V. Girish, T. Ananthapadmanabha, A. Ramakrishnan","doi":"10.1109/INDICON.2016.7839032","DOIUrl":null,"url":null,"abstract":"A dictionary learning based audio source classification algorithm is proposed. Cosine similarity measure is used to select the atoms during dictionary learning. Three proposed objective measures, namely, signal to distortion ratio (SDR), the number of non-zero weights and the sum of weights have been used for classification. A frame-wise source classification accuracy of 98.86% is obtained for twelve different sources using SDR measure and a secondary support vector machine classifier. 100% accuracy has been obtained using moving SDR accumulated over 14 successive frames. For ten of the audio sources tested, 100% accuracy requires accumulation of only 6 frames of a signal.","PeriodicalId":283953,"journal":{"name":"2016 IEEE Annual India Conference (INDICON)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Annual India Conference (INDICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDICON.2016.7839032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
A dictionary learning based audio source classification algorithm is proposed. Cosine similarity measure is used to select the atoms during dictionary learning. Three proposed objective measures, namely, signal to distortion ratio (SDR), the number of non-zero weights and the sum of weights have been used for classification. A frame-wise source classification accuracy of 98.86% is obtained for twelve different sources using SDR measure and a secondary support vector machine classifier. 100% accuracy has been obtained using moving SDR accumulated over 14 successive frames. For ten of the audio sources tested, 100% accuracy requires accumulation of only 6 frames of a signal.