{"title":"Arabic Speech Emotion Recognition Method Based On LPC And PPSD","authors":"O. A. Mohammad, M. Elhadef","doi":"10.1109/ICCAKM50778.2021.9357769","DOIUrl":null,"url":null,"abstract":"This research detects and recognize the emotions in Arabic speech audio files that contains records of human voices with different emotion classes (sad, happy, surprised, and questioning). In the area of emotion detection, when a person becomes emotional, his voice is adjusted based on the state of emotion. As the acoustic features like pressure, strength and loudness varies from a state of emotion to another. However, in the detection of feelings, the classification and modeling part of the features gets priority with the extracted features. Therefore, extracting the best features that describes the emotions stats is the most challenging task. This paper proposes an efficient approach to recognize the Arabic speech emotions. The presented method contains three main phases, signal preprocessing phase for noise removal and signal bandwidth reduction, feature extraction phase using a combination of Linear Predictive Codes (LPC) and the 10-degree polynomial Curve fitting Coefficients over the periodogram power spectral density function of the speech signal and machine learning phase using various machine learning algorithms (ANN, KNN, SVM, Decision Tree, Logistic Regression) and compare between their accuracy results to get the best accuracy.","PeriodicalId":165854,"journal":{"name":"2021 2nd International Conference on Computation, Automation and Knowledge Management (ICCAKM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd International Conference on Computation, Automation and Knowledge Management (ICCAKM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAKM50778.2021.9357769","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This research detects and recognize the emotions in Arabic speech audio files that contains records of human voices with different emotion classes (sad, happy, surprised, and questioning). In the area of emotion detection, when a person becomes emotional, his voice is adjusted based on the state of emotion. As the acoustic features like pressure, strength and loudness varies from a state of emotion to another. However, in the detection of feelings, the classification and modeling part of the features gets priority with the extracted features. Therefore, extracting the best features that describes the emotions stats is the most challenging task. This paper proposes an efficient approach to recognize the Arabic speech emotions. The presented method contains three main phases, signal preprocessing phase for noise removal and signal bandwidth reduction, feature extraction phase using a combination of Linear Predictive Codes (LPC) and the 10-degree polynomial Curve fitting Coefficients over the periodogram power spectral density function of the speech signal and machine learning phase using various machine learning algorithms (ANN, KNN, SVM, Decision Tree, Logistic Regression) and compare between their accuracy results to get the best accuracy.