K. V. V. Girish, T. Ananthapadmanabha, A. Ramakrishnan
{"title":"基于余弦相似度的字典学习和源恢复对不同音频源的分类","authors":"K. V. V. Girish, T. Ananthapadmanabha, A. Ramakrishnan","doi":"10.1109/INDICON.2016.7839032","DOIUrl":null,"url":null,"abstract":"A dictionary learning based audio source classification algorithm is proposed. Cosine similarity measure is used to select the atoms during dictionary learning. Three proposed objective measures, namely, signal to distortion ratio (SDR), the number of non-zero weights and the sum of weights have been used for classification. A frame-wise source classification accuracy of 98.86% is obtained for twelve different sources using SDR measure and a secondary support vector machine classifier. 100% accuracy has been obtained using moving SDR accumulated over 14 successive frames. For ten of the audio sources tested, 100% accuracy requires accumulation of only 6 frames of a signal.","PeriodicalId":283953,"journal":{"name":"2016 IEEE Annual India Conference (INDICON)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Cosine similarity based dictionary learning and source recovery for classification of diverse audio sources\",\"authors\":\"K. V. V. Girish, T. Ananthapadmanabha, A. Ramakrishnan\",\"doi\":\"10.1109/INDICON.2016.7839032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A dictionary learning based audio source classification algorithm is proposed. Cosine similarity measure is used to select the atoms during dictionary learning. Three proposed objective measures, namely, signal to distortion ratio (SDR), the number of non-zero weights and the sum of weights have been used for classification. A frame-wise source classification accuracy of 98.86% is obtained for twelve different sources using SDR measure and a secondary support vector machine classifier. 100% accuracy has been obtained using moving SDR accumulated over 14 successive frames. For ten of the audio sources tested, 100% accuracy requires accumulation of only 6 frames of a signal.\",\"PeriodicalId\":283953,\"journal\":{\"name\":\"2016 IEEE Annual India Conference (INDICON)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE Annual India Conference (INDICON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDICON.2016.7839032\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Annual India Conference (INDICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDICON.2016.7839032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cosine similarity based dictionary learning and source recovery for classification of diverse audio sources
A dictionary learning based audio source classification algorithm is proposed. Cosine similarity measure is used to select the atoms during dictionary learning. Three proposed objective measures, namely, signal to distortion ratio (SDR), the number of non-zero weights and the sum of weights have been used for classification. A frame-wise source classification accuracy of 98.86% is obtained for twelve different sources using SDR measure and a secondary support vector machine classifier. 100% accuracy has been obtained using moving SDR accumulated over 14 successive frames. For ten of the audio sources tested, 100% accuracy requires accumulation of only 6 frames of a signal.