Vishnu Srinivasa Murthy Yarlagadda, T. K. R. Jeshventh, M. Zoeb, M. Saumyadip, S. Koolagudi
{"title":"Singer Identification from Smaller Snippets of Audio Clips Using Acoustic Features and DNNs","authors":"Vishnu Srinivasa Murthy Yarlagadda, T. K. R. Jeshventh, M. Zoeb, M. Saumyadip, S. Koolagudi","doi":"10.1109/IC3.2018.8530602","DOIUrl":null,"url":null,"abstract":"Singer identification (SID) is one of the crucial tasks of music information retrieval (MIR). The presence of background accompaniment makes the task little complicated. The performance of SID with the combination of the cepstral and chromagram features has been analyzed in this work. Mel-frequency cepstral coefficients (MFCCs) and linear prediction cepstral features (LPCCs) have been computed as cepstral features and added to 12-dimensional chroma vector which is obtained from chromagram. Two different datasets have been used for experimentation, of which one is standard artist-20 and the other one is Indian singers database, which is proposed by us, with 20 Indian singers. Two different classifiers, namely random forest (RF) and deep neural networks (DNNs) are considered based on their performance in estimating the singers. The proposed approach is found to be efficient even if the input clip is of length five seconds.","PeriodicalId":118388,"journal":{"name":"2018 Eleventh International Conference on Contemporary Computing (IC3)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Eleventh International Conference on Contemporary Computing (IC3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3.2018.8530602","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
Singer identification (SID) is one of the crucial tasks of music information retrieval (MIR). The presence of background accompaniment makes the task little complicated. The performance of SID with the combination of the cepstral and chromagram features has been analyzed in this work. Mel-frequency cepstral coefficients (MFCCs) and linear prediction cepstral features (LPCCs) have been computed as cepstral features and added to 12-dimensional chroma vector which is obtained from chromagram. Two different datasets have been used for experimentation, of which one is standard artist-20 and the other one is Indian singers database, which is proposed by us, with 20 Indian singers. Two different classifiers, namely random forest (RF) and deep neural networks (DNNs) are considered based on their performance in estimating the singers. The proposed approach is found to be efficient even if the input clip is of length five seconds.