Xinyu Yang, Jing Luo, Yinrui Wang, Xi Zhao, Juan Li
{"title":"Combining auditory perception and visual features for regional recognition of Chinese folk songs","authors":"Xinyu Yang, Jing Luo, Yinrui Wang, Xi Zhao, Juan Li","doi":"10.1145/3192975.3193006","DOIUrl":null,"url":null,"abstract":"The regional recognition of Chinese folk songs is not only conducive to discovering music characteristics and regional styles of specific geographical folk songs, but also has important research value in the existing music information retrieval system. In this paper, an effective and novel approach for regional recognition of Chinese folk songs is proposed, which is based on the fusion of auditory perception and visual features using an ensemble SVM classifier. When the auditory perception features are extracted, the temporal relation among the frame features is fully considered. For the visual features, the color time-frequency maps are used to replace the gray-scale images to capture more texture information, and in order to better characterize the image texture, the texture patterns and the corresponding intensity information are both extracted. Experimental results show that the recognition method combined with auditory perception and visual features can effectively identify Chinese folk songs of different regions with an accuracy rate of 89.29%, which outperforms other state-of-the-art approaches.","PeriodicalId":128533,"journal":{"name":"Proceedings of the 2018 10th International Conference on Computer and Automation Engineering","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 10th International Conference on Computer and Automation Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3192975.3193006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The regional recognition of Chinese folk songs is not only conducive to discovering music characteristics and regional styles of specific geographical folk songs, but also has important research value in the existing music information retrieval system. In this paper, an effective and novel approach for regional recognition of Chinese folk songs is proposed, which is based on the fusion of auditory perception and visual features using an ensemble SVM classifier. When the auditory perception features are extracted, the temporal relation among the frame features is fully considered. For the visual features, the color time-frequency maps are used to replace the gray-scale images to capture more texture information, and in order to better characterize the image texture, the texture patterns and the corresponding intensity information are both extracted. Experimental results show that the recognition method combined with auditory perception and visual features can effectively identify Chinese folk songs of different regions with an accuracy rate of 89.29%, which outperforms other state-of-the-art approaches.