几种分类器在嘈杂普通话语音情感识别中的比较

Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007) Pub Date : 2007-11-26 DOI:10.1109/IIH-MSP.2007.368

T. Pao, Wen-Yuan Liao, Yu-Te Chen, Jun-Heng Yeh, Yun-Maw Cheng, Charles S. Chien

{"title":"几种分类器在嘈杂普通话语音情感识别中的比较","authors":"T. Pao, Wen-Yuan Liao, Yu-Te Chen, Jun-Heng Yeh, Yun-Maw Cheng, Charles S. Chien","doi":"10.1109/IIH-MSP.2007.368","DOIUrl":null,"url":null,"abstract":"Automatic recognition of emotions in speech aims at building classifiers for classifying emotions in test emotional speech. This paper presents an emotion recognition system to compare several classifiers from clean and noisy speech. Five emotions, including anger, happiness, sadness, neutral and boredom, from Mandarin emotional speech are investigated. The classifiers studied include KNN WCAP GMM HMM and W-DKNN. Feature selection with KNN was also included to compress acoustic features before classifying the emotional states of clean and noisy speech. Experimental results show that the proposed W-DKNN outperformed at every SNR speech among the three KNN-based classifiers and achieved highest accuracy from clean speech to 20dB noisy speech when compared with all the classifiers.","PeriodicalId":385132,"journal":{"name":"Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":"{\"title\":\"Comparison of Several Classifiers for Emotion Recognition from Noisy Mandarin Speech\",\"authors\":\"T. Pao, Wen-Yuan Liao, Yu-Te Chen, Jun-Heng Yeh, Yun-Maw Cheng, Charles S. Chien\",\"doi\":\"10.1109/IIH-MSP.2007.368\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic recognition of emotions in speech aims at building classifiers for classifying emotions in test emotional speech. This paper presents an emotion recognition system to compare several classifiers from clean and noisy speech. Five emotions, including anger, happiness, sadness, neutral and boredom, from Mandarin emotional speech are investigated. The classifiers studied include KNN WCAP GMM HMM and W-DKNN. Feature selection with KNN was also included to compress acoustic features before classifying the emotional states of clean and noisy speech. Experimental results show that the proposed W-DKNN outperformed at every SNR speech among the three KNN-based classifiers and achieved highest accuracy from clean speech to 20dB noisy speech when compared with all the classifiers.\",\"PeriodicalId\":385132,\"journal\":{\"name\":\"Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"23\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IIH-MSP.2007.368\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIH-MSP.2007.368","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 23

摘要

语音情绪自动识别的目的是建立对测试情绪语音中的情绪进行分类的分类器。本文提出了一种情感识别系统，用于比较几种分类器对干净语音和有噪声语音的识别效果。研究了汉语情感言语中的五种情绪，包括愤怒、快乐、悲伤、中性和无聊。研究的分类器包括KNN、WCAP、GMM、HMM和W-DKNN。利用KNN进行特征选择，在对干净语音和嘈杂语音的情绪状态进行分类之前对声学特征进行压缩。实验结果表明，在三种基于knn的分类器中，所提出的W-DKNN在每个信噪比的语音上都表现优异，并且在从干净语音到20dB噪声语音的分类器中，与所有分类器相比，具有最高的准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Comparison of Several Classifiers for Emotion Recognition from Noisy Mandarin Speech

Automatic recognition of emotions in speech aims at building classifiers for classifying emotions in test emotional speech. This paper presents an emotion recognition system to compare several classifiers from clean and noisy speech. Five emotions, including anger, happiness, sadness, neutral and boredom, from Mandarin emotional speech are investigated. The classifiers studied include KNN WCAP GMM HMM and W-DKNN. Feature selection with KNN was also included to compress acoustic features before classifying the emotional states of clean and noisy speech. Experimental results show that the proposed W-DKNN outperformed at every SNR speech among the three KNN-based classifiers and achieved highest accuracy from clean speech to 20dB noisy speech when compared with all the classifiers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007)

自引率

0.00%

发文量