Emotion recognition on the basis of audio signal using Naive Bayes classifier

2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI) Pub Date : 2016-09-01 DOI:10.1109/ICACCI.2016.7732408

Sagar K. Bhakre, Arti V. Bang

{"title":"Emotion recognition on the basis of audio signal using Naive Bayes classifier","authors":"Sagar K. Bhakre, Arti V. Bang","doi":"10.1109/ICACCI.2016.7732408","DOIUrl":null,"url":null,"abstract":"In this paper we have studied and implemented the classification of audio signal into four basic emotional state. For that we have considered different statistical features of pitch, energy, and ZCR (Zero Crossing Rate) MFCC (Mel frequency cepstral coefficient) from 2000 utterances of the created audio signal database. In that, Pitch feature is extracted by AMDF (average magnitude difference method) and energy is calculated by sum of square absolute value of magnitude spectrum. And MFCC is calculated by taking DCT (Discrete cosine transform) of its energies spectrum by keeping the DCT coefficients 1-14 and discarding the rest. In statistical modeling, regression analysis is a statistical process for calculating approximately the variables. It comprise many techniques for modeling and analyzing several variables. In this paper Naïve Bayes Classifier is used to classify the audio signal into four different emotions. Speech signal is random signal so we have to predict the future sample and Naïve Bayes Classifier is totally probability based classifier so in speech analysis for accurate prediction we are using Naïve Bayes classifier. In the speech signal for recognition of signal classifier require millions of dataset. The advantage of Naïve Bayes classifier is that it recognizes the signal with minimum dataset.","PeriodicalId":371328,"journal":{"name":"2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"32","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACCI.2016.7732408","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 32

Abstract

In this paper we have studied and implemented the classification of audio signal into four basic emotional state. For that we have considered different statistical features of pitch, energy, and ZCR (Zero Crossing Rate) MFCC (Mel frequency cepstral coefficient) from 2000 utterances of the created audio signal database. In that, Pitch feature is extracted by AMDF (average magnitude difference method) and energy is calculated by sum of square absolute value of magnitude spectrum. And MFCC is calculated by taking DCT (Discrete cosine transform) of its energies spectrum by keeping the DCT coefficients 1-14 and discarding the rest. In statistical modeling, regression analysis is a statistical process for calculating approximately the variables. It comprise many techniques for modeling and analyzing several variables. In this paper Naïve Bayes Classifier is used to classify the audio signal into four different emotions. Speech signal is random signal so we have to predict the future sample and Naïve Bayes Classifier is totally probability based classifier so in speech analysis for accurate prediction we are using Naïve Bayes classifier. In the speech signal for recognition of signal classifier require millions of dataset. The advantage of Naïve Bayes classifier is that it recognizes the signal with minimum dataset.

查看原文本刊更多论文

基于音频信号的朴素贝叶斯分类器情感识别

本文研究并实现了音频信号四种基本情绪状态的分类。为此，我们从创建的音频信号数据库的2000个语音中考虑了不同的音调、能量和ZCR(过零率)MFCC (Mel频率倒谱系数)的统计特征。其中，基音特征提取采用平均幅度差法(AMDF)，能量计算采用幅度谱绝对值平方和。MFCC通过对其能谱进行DCT (Discrete cosine transform，离散余弦变换)计算，保留DCT系数1-14，其余部分丢弃。在统计建模中，回归分析是对变量进行近似计算的统计过程。它包含许多建模和分析多个变量的技术。本文使用Naïve贝叶斯分类器将音频信号分为四种不同的情绪。语音信号是随机信号，所以我们必须预测未来的样本，Naïve贝叶斯分类器是完全基于概率的分类器，所以在语音分析中，为了准确预测，我们使用Naïve贝叶斯分类器。对语音信号进行识别的信号分类器需要数以百万计的数据集。Naïve贝叶斯分类器的优点是用最少的数据集识别信号。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

自引率

0.00%

发文量