{"title":"基于交错衍生模式和共现矩阵方法的音素识别","authors":"G. Liao, B. Ling, R. W. Lam","doi":"10.1109/ASSP54407.2021.00036","DOIUrl":null,"url":null,"abstract":"Accurate recognition of phonemes has always been a difficult point in speech recognition. This paper attempts to use the co-occurrence matrix method and interlaced derivative patterns method in image processing to complete the task of phoneme recognition. First, perform segmentation of speech based on the spectral energy and spectral centroid. Second, perform feature extraction, and calculate the four features of the co-occurrence matrix method, interlaced derivative pattern method, multi-dimensional voice program parameters and gammatone frequency cepstral coefficients of the phoneme segment. Among them, the co-occurrence matrix method and interlaced derivative pattern method are calculated based on the gammatone spectrum of the voice. Then, use random forest to calculate the importance of feature vectors, and select the top 30 features that random forest considers the most important as the features used in this article. Finally, random forest is used for classification. Note that the phoneme data sets used in this article are all downloaded from YouTube, and the classification we do is vowels, semivowels and consonants. To our best knowledge, this is the first paper that presents the phoneme recognition using interlaced derivative pattern and co-occurrence matrix method.","PeriodicalId":153782,"journal":{"name":"2021 2nd Asia Symposium on Signal Processing (ASSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Phoneme Recognition Using Interlaced Derivative Pattern and Co-occurrence Matrix Method\",\"authors\":\"G. Liao, B. Ling, R. W. Lam\",\"doi\":\"10.1109/ASSP54407.2021.00036\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate recognition of phonemes has always been a difficult point in speech recognition. This paper attempts to use the co-occurrence matrix method and interlaced derivative patterns method in image processing to complete the task of phoneme recognition. First, perform segmentation of speech based on the spectral energy and spectral centroid. Second, perform feature extraction, and calculate the four features of the co-occurrence matrix method, interlaced derivative pattern method, multi-dimensional voice program parameters and gammatone frequency cepstral coefficients of the phoneme segment. Among them, the co-occurrence matrix method and interlaced derivative pattern method are calculated based on the gammatone spectrum of the voice. Then, use random forest to calculate the importance of feature vectors, and select the top 30 features that random forest considers the most important as the features used in this article. Finally, random forest is used for classification. Note that the phoneme data sets used in this article are all downloaded from YouTube, and the classification we do is vowels, semivowels and consonants. To our best knowledge, this is the first paper that presents the phoneme recognition using interlaced derivative pattern and co-occurrence matrix method.\",\"PeriodicalId\":153782,\"journal\":{\"name\":\"2021 2nd Asia Symposium on Signal Processing (ASSP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 2nd Asia Symposium on Signal Processing (ASSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASSP54407.2021.00036\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd Asia Symposium on Signal Processing (ASSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASSP54407.2021.00036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Phoneme Recognition Using Interlaced Derivative Pattern and Co-occurrence Matrix Method
Accurate recognition of phonemes has always been a difficult point in speech recognition. This paper attempts to use the co-occurrence matrix method and interlaced derivative patterns method in image processing to complete the task of phoneme recognition. First, perform segmentation of speech based on the spectral energy and spectral centroid. Second, perform feature extraction, and calculate the four features of the co-occurrence matrix method, interlaced derivative pattern method, multi-dimensional voice program parameters and gammatone frequency cepstral coefficients of the phoneme segment. Among them, the co-occurrence matrix method and interlaced derivative pattern method are calculated based on the gammatone spectrum of the voice. Then, use random forest to calculate the importance of feature vectors, and select the top 30 features that random forest considers the most important as the features used in this article. Finally, random forest is used for classification. Note that the phoneme data sets used in this article are all downloaded from YouTube, and the classification we do is vowels, semivowels and consonants. To our best knowledge, this is the first paper that presents the phoneme recognition using interlaced derivative pattern and co-occurrence matrix method.