{"title":"Affect Detection from Speech using Deep Convolutional Neural Network Architecture","authors":"Saikat Basu, Jaybrata Chakraborty, Md. Aftabuddin","doi":"10.1109/INDICON.2017.8487700","DOIUrl":null,"url":null,"abstract":"From last few years there are many research works have been done on the field of affect detection or emotion recognition from speech signal. Researchers has been directed to find out different emotional content from speech signals, they have tried to extract different features from speech and used different types of supervised or unsupervised learning methods to train a network such a way that a model can be developed which can identify emotion from speech signal successfully. The primary challenges of emotion recognition are choosing the emotional speech corpus (speech database), identification of different features related to speech and an appropriate choice of a classification model. In this work we have explored RML emotional speech corpus for our experiment purpose it is a collection of emotional audiovisual files of different languages. We have analyzed the performance of Deep Convolutional Neural Network with Mel-Spectrogram as features for recognition of emotion.","PeriodicalId":263943,"journal":{"name":"2017 14th IEEE India Council International Conference (INDICON)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th IEEE India Council International Conference (INDICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDICON.2017.8487700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
From last few years there are many research works have been done on the field of affect detection or emotion recognition from speech signal. Researchers has been directed to find out different emotional content from speech signals, they have tried to extract different features from speech and used different types of supervised or unsupervised learning methods to train a network such a way that a model can be developed which can identify emotion from speech signal successfully. The primary challenges of emotion recognition are choosing the emotional speech corpus (speech database), identification of different features related to speech and an appropriate choice of a classification model. In this work we have explored RML emotional speech corpus for our experiment purpose it is a collection of emotional audiovisual files of different languages. We have analyzed the performance of Deep Convolutional Neural Network with Mel-Spectrogram as features for recognition of emotion.