{"title":"基于韵律、谱和小波特征的旁遮普语语音情感识别","authors":"Chaitanya Singla, Sukhdev Singh","doi":"10.1109/ICETET-SIP-2254415.2022.9791593","DOIUrl":null,"url":null,"abstract":"A rapid advancement can be seen in affective computing, speech emotion recognition (SER) and human computer interaction, SER has been getting more consideration. English, German and Indian are all popular languages that can be used for SER. Due to the lack of Punjabi speech emotion data, very few researchers have tried to implement Punjabi SER systems. Despite the fact that Punjabi is one of the most commonly spoken and understood languages over the web, there has not yet been a Punjabi speech emotion database. This work introduces a semi-natural Punjabi speech emotions database that was compiled from several TV series and Punjabi movies. This database contains utterances of male and female actors, and covers four emotions: happy, sad, neutral, and happy. The database is used to compute prosodic, wavelet and spectral features for emotion recognition. Wavelet parameters, in addition to the Mel frequency Cepstral (MFCC), which are widely used for SER, are also taken into account. The extracted features can also be used to classify emotions.","PeriodicalId":117229,"journal":{"name":"2022 10th International Conference on Emerging Trends in Engineering and Technology - Signal and Information Processing (ICETET-SIP-22)","volume":"139 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Punjabi Speech Emotion Recognition using Prosodic, Spectral and Wavelet Features\",\"authors\":\"Chaitanya Singla, Sukhdev Singh\",\"doi\":\"10.1109/ICETET-SIP-2254415.2022.9791593\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A rapid advancement can be seen in affective computing, speech emotion recognition (SER) and human computer interaction, SER has been getting more consideration. English, German and Indian are all popular languages that can be used for SER. Due to the lack of Punjabi speech emotion data, very few researchers have tried to implement Punjabi SER systems. Despite the fact that Punjabi is one of the most commonly spoken and understood languages over the web, there has not yet been a Punjabi speech emotion database. This work introduces a semi-natural Punjabi speech emotions database that was compiled from several TV series and Punjabi movies. This database contains utterances of male and female actors, and covers four emotions: happy, sad, neutral, and happy. The database is used to compute prosodic, wavelet and spectral features for emotion recognition. Wavelet parameters, in addition to the Mel frequency Cepstral (MFCC), which are widely used for SER, are also taken into account. The extracted features can also be used to classify emotions.\",\"PeriodicalId\":117229,\"journal\":{\"name\":\"2022 10th International Conference on Emerging Trends in Engineering and Technology - Signal and Information Processing (ICETET-SIP-22)\",\"volume\":\"139 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 10th International Conference on Emerging Trends in Engineering and Technology - Signal and Information Processing (ICETET-SIP-22)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICETET-SIP-2254415.2022.9791593\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 10th International Conference on Emerging Trends in Engineering and Technology - Signal and Information Processing (ICETET-SIP-22)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETET-SIP-2254415.2022.9791593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Punjabi Speech Emotion Recognition using Prosodic, Spectral and Wavelet Features
A rapid advancement can be seen in affective computing, speech emotion recognition (SER) and human computer interaction, SER has been getting more consideration. English, German and Indian are all popular languages that can be used for SER. Due to the lack of Punjabi speech emotion data, very few researchers have tried to implement Punjabi SER systems. Despite the fact that Punjabi is one of the most commonly spoken and understood languages over the web, there has not yet been a Punjabi speech emotion database. This work introduces a semi-natural Punjabi speech emotions database that was compiled from several TV series and Punjabi movies. This database contains utterances of male and female actors, and covers four emotions: happy, sad, neutral, and happy. The database is used to compute prosodic, wavelet and spectral features for emotion recognition. Wavelet parameters, in addition to the Mel frequency Cepstral (MFCC), which are widely used for SER, are also taken into account. The extracted features can also be used to classify emotions.