Tai-Ming Chang, En-Ting Chen, Chia-Bin Hsieh, P. Chang
{"title":"从AAC文件中提取直接色度特征识别翻唱歌曲","authors":"Tai-Ming Chang, En-Ting Chen, Chia-Bin Hsieh, P. Chang","doi":"10.1109/GCCE.2013.6664919","DOIUrl":null,"url":null,"abstract":"This paper proposes a low-complexity and effective feature extraction method derived directly from AAC files. Unlike traditional methods that must decode audio files and then compute fast Fourier transform coefficients, the proposed system directly maps the modified discrete cosine transform coefficients into a 12-dimensional chroma feature without fully decoding it. To accelerate the matching time, segmentation is applied to reduce the time dimension in the feature space. In addition, the dynamic programming technique is used to match songs to various tempos. The experimental results show that the proposed system achieves a 62% accuracy rate, which is an improvement over the traditional FFT-based system, and reduces the computational complexity by approximately 35%.","PeriodicalId":294532,"journal":{"name":"2013 IEEE 2nd Global Conference on Consumer Electronics (GCCE)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Cover song identification with direct chroma feature extraction from AAC files\",\"authors\":\"Tai-Ming Chang, En-Ting Chen, Chia-Bin Hsieh, P. Chang\",\"doi\":\"10.1109/GCCE.2013.6664919\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a low-complexity and effective feature extraction method derived directly from AAC files. Unlike traditional methods that must decode audio files and then compute fast Fourier transform coefficients, the proposed system directly maps the modified discrete cosine transform coefficients into a 12-dimensional chroma feature without fully decoding it. To accelerate the matching time, segmentation is applied to reduce the time dimension in the feature space. In addition, the dynamic programming technique is used to match songs to various tempos. The experimental results show that the proposed system achieves a 62% accuracy rate, which is an improvement over the traditional FFT-based system, and reduces the computational complexity by approximately 35%.\",\"PeriodicalId\":294532,\"journal\":{\"name\":\"2013 IEEE 2nd Global Conference on Consumer Electronics (GCCE)\",\"volume\":\"114 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE 2nd Global Conference on Consumer Electronics (GCCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GCCE.2013.6664919\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 2nd Global Conference on Consumer Electronics (GCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GCCE.2013.6664919","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cover song identification with direct chroma feature extraction from AAC files
This paper proposes a low-complexity and effective feature extraction method derived directly from AAC files. Unlike traditional methods that must decode audio files and then compute fast Fourier transform coefficients, the proposed system directly maps the modified discrete cosine transform coefficients into a 12-dimensional chroma feature without fully decoding it. To accelerate the matching time, segmentation is applied to reduce the time dimension in the feature space. In addition, the dynamic programming technique is used to match songs to various tempos. The experimental results show that the proposed system achieves a 62% accuracy rate, which is an improvement over the traditional FFT-based system, and reduces the computational complexity by approximately 35%.