{"title":"Parameter changes across different emotions in human speech","authors":"Tao Li, Cheolwoo Jo","doi":"10.1109/ICIA.2005.1635060","DOIUrl":null,"url":null,"abstract":"Voice quality is considered to play an important role for the transmission of emotions in human speech communications. In this paper, we explored the acoustical characteristics of voice quality in the emotional speech signals based on numerical parameters, such as Jitter, RAP, Shimmer, APQ, NHR and SPI. In addition, the role of pitch, pitch range and normalized speech duration of the emotional speech was focused. Korean emotional speech database was collected from a professional actor. Nine sentences having different contents were respectively uttered with six different kinds of emotions: neutral, happiness, anger, sadness, fear and boredom. Jitter, RAP, Shimmer, APQ, NHR and SPI were computed respectively after extracting the voiced segment with the vowel /a/ from each emotional sentence. Pitch, pitch range and normalized speech duration of each emotional speech signal were also measured or computed. The statistical analysis based on the changes of these nine sets of different parameters was performed to characterize voice quality of the human emotional speeches.","PeriodicalId":136611,"journal":{"name":"2005 IEEE International Conference on Information Acquisition","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE International Conference on Information Acquisition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIA.2005.1635060","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Voice quality is considered to play an important role for the transmission of emotions in human speech communications. In this paper, we explored the acoustical characteristics of voice quality in the emotional speech signals based on numerical parameters, such as Jitter, RAP, Shimmer, APQ, NHR and SPI. In addition, the role of pitch, pitch range and normalized speech duration of the emotional speech was focused. Korean emotional speech database was collected from a professional actor. Nine sentences having different contents were respectively uttered with six different kinds of emotions: neutral, happiness, anger, sadness, fear and boredom. Jitter, RAP, Shimmer, APQ, NHR and SPI were computed respectively after extracting the voiced segment with the vowel /a/ from each emotional sentence. Pitch, pitch range and normalized speech duration of each emotional speech signal were also measured or computed. The statistical analysis based on the changes of these nine sets of different parameters was performed to characterize voice quality of the human emotional speeches.