{"title":"Punjabi Speech Emotion Recognition using Prosodic, Spectral and Wavelet Features","authors":"Chaitanya Singla, Sukhdev Singh","doi":"10.1109/ICETET-SIP-2254415.2022.9791593","DOIUrl":null,"url":null,"abstract":"A rapid advancement can be seen in affective computing, speech emotion recognition (SER) and human computer interaction, SER has been getting more consideration. English, German and Indian are all popular languages that can be used for SER. Due to the lack of Punjabi speech emotion data, very few researchers have tried to implement Punjabi SER systems. Despite the fact that Punjabi is one of the most commonly spoken and understood languages over the web, there has not yet been a Punjabi speech emotion database. This work introduces a semi-natural Punjabi speech emotions database that was compiled from several TV series and Punjabi movies. This database contains utterances of male and female actors, and covers four emotions: happy, sad, neutral, and happy. The database is used to compute prosodic, wavelet and spectral features for emotion recognition. Wavelet parameters, in addition to the Mel frequency Cepstral (MFCC), which are widely used for SER, are also taken into account. The extracted features can also be used to classify emotions.","PeriodicalId":117229,"journal":{"name":"2022 10th International Conference on Emerging Trends in Engineering and Technology - Signal and Information Processing (ICETET-SIP-22)","volume":"139 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 10th International Conference on Emerging Trends in Engineering and Technology - Signal and Information Processing (ICETET-SIP-22)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETET-SIP-2254415.2022.9791593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A rapid advancement can be seen in affective computing, speech emotion recognition (SER) and human computer interaction, SER has been getting more consideration. English, German and Indian are all popular languages that can be used for SER. Due to the lack of Punjabi speech emotion data, very few researchers have tried to implement Punjabi SER systems. Despite the fact that Punjabi is one of the most commonly spoken and understood languages over the web, there has not yet been a Punjabi speech emotion database. This work introduces a semi-natural Punjabi speech emotions database that was compiled from several TV series and Punjabi movies. This database contains utterances of male and female actors, and covers four emotions: happy, sad, neutral, and happy. The database is used to compute prosodic, wavelet and spectral features for emotion recognition. Wavelet parameters, in addition to the Mel frequency Cepstral (MFCC), which are widely used for SER, are also taken into account. The extracted features can also be used to classify emotions.