{"title":"基于支持向量机的中文歌词伴奏识别","authors":"Juanjuan Cai, Na Li, Hui Wang, Bin Zhu","doi":"10.1109/ICALIP.2016.7846536","DOIUrl":null,"url":null,"abstract":"The speech recognition technology is one of the hot spots in the field of audio technology. For the recognition of the lyrics with the accompaniment, there are two commonly used methods, one is applying automatic speech recognition technology to singing recognition, the other way is using sound classification, extracting audio features, and then using pattern matching classifier for classification. In this paper, we use sound classification method, adopt self-built experimental database where 31 classes Chinese isolated lyrics (Total 4650) are intercepted from different songs. And then use these words as the units. Considering speaking and singing sharing similar mechanism, we extract 39-dimensional MFCC feature parameters which are widely used in speech recognition. Combined with training materials, adjust kernel parameters and choose functions to train SVM classifier. After that, the trained SVM classification system is used to recognize the lyrics, and the average recognition accuracy rate is 42.80%.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Isolated Chinese lyrics with accompaniment recognition based on SVM\",\"authors\":\"Juanjuan Cai, Na Li, Hui Wang, Bin Zhu\",\"doi\":\"10.1109/ICALIP.2016.7846536\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The speech recognition technology is one of the hot spots in the field of audio technology. For the recognition of the lyrics with the accompaniment, there are two commonly used methods, one is applying automatic speech recognition technology to singing recognition, the other way is using sound classification, extracting audio features, and then using pattern matching classifier for classification. In this paper, we use sound classification method, adopt self-built experimental database where 31 classes Chinese isolated lyrics (Total 4650) are intercepted from different songs. And then use these words as the units. Considering speaking and singing sharing similar mechanism, we extract 39-dimensional MFCC feature parameters which are widely used in speech recognition. Combined with training materials, adjust kernel parameters and choose functions to train SVM classifier. After that, the trained SVM classification system is used to recognize the lyrics, and the average recognition accuracy rate is 42.80%.\",\"PeriodicalId\":184170,\"journal\":{\"name\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICALIP.2016.7846536\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2016.7846536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Isolated Chinese lyrics with accompaniment recognition based on SVM
The speech recognition technology is one of the hot spots in the field of audio technology. For the recognition of the lyrics with the accompaniment, there are two commonly used methods, one is applying automatic speech recognition technology to singing recognition, the other way is using sound classification, extracting audio features, and then using pattern matching classifier for classification. In this paper, we use sound classification method, adopt self-built experimental database where 31 classes Chinese isolated lyrics (Total 4650) are intercepted from different songs. And then use these words as the units. Considering speaking and singing sharing similar mechanism, we extract 39-dimensional MFCC feature parameters which are widely used in speech recognition. Combined with training materials, adjust kernel parameters and choose functions to train SVM classifier. After that, the trained SVM classification system is used to recognize the lyrics, and the average recognition accuracy rate is 42.80%.