{"title":"A preliminary speech analysis for recognizing emotion","authors":"A. Razak, Mohamad Izani Zainal Abidin, R. Komiya","doi":"10.1109/SCORED.2003.1459662","DOIUrl":null,"url":null,"abstract":"Some speech analysis to extract emotion from voice is discussed. An emotional Malay and English voice database has been developed, consisting six basic emotions namely happiness, sadness, disgust, fear, anger and surprise. As the target is content independent emotion recognition, 4 short sentences that have the most natural meaning is adopted for the illustration and analysis. A study on speech prosody is done to identify the emotional features of voice. Variation on the sample's energy, duration, and pitch for different emotions is compared. Spectrogram analysis is done on some samples to observe the effect of formant. It is found that duration, average energy and pitch can provide some indication of emotional content of a speech, but it is not enough to correctly represent the emotions. Even though there are slightly different pattern for English and Malay samples, it is still reasonable to assume that there are standard acoustic configurations in expressing particular emotions.","PeriodicalId":239300,"journal":{"name":"Proceedings. Student Conference on Research and Development, 2003. SCORED 2003.","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. Student Conference on Research and Development, 2003. SCORED 2003.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCORED.2003.1459662","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Some speech analysis to extract emotion from voice is discussed. An emotional Malay and English voice database has been developed, consisting six basic emotions namely happiness, sadness, disgust, fear, anger and surprise. As the target is content independent emotion recognition, 4 short sentences that have the most natural meaning is adopted for the illustration and analysis. A study on speech prosody is done to identify the emotional features of voice. Variation on the sample's energy, duration, and pitch for different emotions is compared. Spectrogram analysis is done on some samples to observe the effect of formant. It is found that duration, average energy and pitch can provide some indication of emotional content of a speech, but it is not enough to correctly represent the emotions. Even though there are slightly different pattern for English and Malay samples, it is still reasonable to assume that there are standard acoustic configurations in expressing particular emotions.