J. B. Alonso, Josue Cabrera, Miguel A. Ferrer, J. M. Canino, C. Travieso, M. Dutta, Anushikha Singh
{"title":"Emotional speech characterization for real time applications in real environments","authors":"J. B. Alonso, Josue Cabrera, Miguel A. Ferrer, J. M. Canino, C. Travieso, M. Dutta, Anushikha Singh","doi":"10.1109/MEDCOM.2014.7005994","DOIUrl":null,"url":null,"abstract":"A simple and effective method of automatic discrimination between emotional and unemotional speech is presented. Traditional methods of emotional discrimination use prosodic and paralinguistic features, which are determined by a linguistic segmentation of the locution. However, these methods are not appropriate in real time applications because of their high computational cost and the linguistic segmentation requirement by locutions. This letter proposes a new strategy based on a few prosodic and paralinguistic features set obtained from a temporal segmentation of the speech signal. This new strategy is robust to interfering noises that are present in real environments, offering a low computational cost and improving the performance of a segmentation based on linguistic aspects.","PeriodicalId":246177,"journal":{"name":"2014 International Conference on Medical Imaging, m-Health and Emerging Communication Systems (MedCom)","volume":"200 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Medical Imaging, m-Health and Emerging Communication Systems (MedCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MEDCOM.2014.7005994","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A simple and effective method of automatic discrimination between emotional and unemotional speech is presented. Traditional methods of emotional discrimination use prosodic and paralinguistic features, which are determined by a linguistic segmentation of the locution. However, these methods are not appropriate in real time applications because of their high computational cost and the linguistic segmentation requirement by locutions. This letter proposes a new strategy based on a few prosodic and paralinguistic features set obtained from a temporal segmentation of the speech signal. This new strategy is robust to interfering noises that are present in real environments, offering a low computational cost and improving the performance of a segmentation based on linguistic aspects.