Speech Emotion Recognition using Convolution Neural Networks and Deep Stride Convolutional Neural Networks

T. Wani, T. Gunawan, Syed Asif Ahmad Qadri, H. Mansor, M. Kartiwi, N. Ismail
{"title":"Speech Emotion Recognition using Convolution Neural Networks and Deep Stride Convolutional Neural Networks","authors":"T. Wani, T. Gunawan, Syed Asif Ahmad Qadri, H. Mansor, M. Kartiwi, N. Ismail","doi":"10.1109/ICWT50448.2020.9243622","DOIUrl":null,"url":null,"abstract":"An assortment of techniques has been presented in the area of Speech Emotion Recognition (SER), where the main focus is to recognize the silent discriminants and useful features of speech signals. These features undergo the process of classification to recognize the specific emotion of a speaker. In recent times, deep learning techniques have emerged as a breakthrough in speech emotion recognition to detect and classify emotions. In this paper, we have modified a recently developed different network architecture of convolutional neural networks, i.e., Deep Stride Convolutional Neural Networks (DSCNN), by taking a smaller number of convolutional layers to increase the computational speed while still maintaining accuracy. Besides, we trained the state-of-art model of CNN and proposed DSCNN on spectrograms generated from the SAVEE speech emotion dataset. For the evaluation process, four emotions angry, happy, neutral, and sad, were considered. Evaluation results show that the proposed architecture DSCNN, with the prediction accuracy of 87.8%, outperforms CNN with 79.4% accuracy.","PeriodicalId":304605,"journal":{"name":"2020 6th International Conference on Wireless and Telematics (ICWT)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th International Conference on Wireless and Telematics (ICWT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWT50448.2020.9243622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16

Abstract

An assortment of techniques has been presented in the area of Speech Emotion Recognition (SER), where the main focus is to recognize the silent discriminants and useful features of speech signals. These features undergo the process of classification to recognize the specific emotion of a speaker. In recent times, deep learning techniques have emerged as a breakthrough in speech emotion recognition to detect and classify emotions. In this paper, we have modified a recently developed different network architecture of convolutional neural networks, i.e., Deep Stride Convolutional Neural Networks (DSCNN), by taking a smaller number of convolutional layers to increase the computational speed while still maintaining accuracy. Besides, we trained the state-of-art model of CNN and proposed DSCNN on spectrograms generated from the SAVEE speech emotion dataset. For the evaluation process, four emotions angry, happy, neutral, and sad, were considered. Evaluation results show that the proposed architecture DSCNN, with the prediction accuracy of 87.8%, outperforms CNN with 79.4% accuracy.
基于卷积神经网络和深度跨步卷积神经网络的语音情感识别
语音情感识别(SER)领域提出了各种各样的技术,其中主要的重点是识别语音信号的无声判别和有用特征。这些特征经过分类的过程,以识别说话人的特定情绪。近年来,深度学习技术已经成为语音情感识别的一个突破,用于检测和分类情绪。在本文中,我们修改了最近开发的卷积神经网络的不同网络架构,即深度跨步卷积神经网络(Deep Stride convolutional neural networks, DSCNN),通过采用更少的卷积层来提高计算速度,同时保持准确性。此外,我们训练了最先进的CNN模型,并在SAVEE语音情感数据集生成的频谱图上提出了DSCNN。在评估过程中,考虑了四种情绪:愤怒、快乐、中性和悲伤。评价结果表明,本文提出的结构DSCNN的预测准确率为87.8%,优于准确率为79.4%的CNN。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信