Sentiment Recognition of Italian Elderly through Domain Adaptation on Cross-corpus Speech Dataset

F. Gasparini, A. Grossi
{"title":"Sentiment Recognition of Italian Elderly through Domain Adaptation on Cross-corpus Speech Dataset","authors":"F. Gasparini, A. Grossi","doi":"10.48550/arXiv.2211.07307","DOIUrl":null,"url":null,"abstract":"The aim of this work is to define a speech emotion recognition (SER) model able to recognize positive, neutral and negative emotions in natural conversations of Italian elderly people. Several datasets for SER are available in the literature. However most of them are in English or Chinese, have been recorded while actors and actresses pronounce short phrases and thus are not related to natural conversation. Moreover only few speeches among all the databases are related to elderly people. Therefore, in this work, a multi-language and multi-age corpus is considered merging a dataset in English, that includes also elderly people, with a dataset in Italian. A general model, trained on young and adult English actors and actresses is proposed, based on XGBoost. Then two strategies of domain adaptation are proposed to adapt the model either to elderly people and to Italian speakers. The results suggest that this approach increases the classification performance, underlining also that new datasets should be collected.","PeriodicalId":308455,"journal":{"name":"AIxAS@AI*IA","volume":"195 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AIxAS@AI*IA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2211.07307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The aim of this work is to define a speech emotion recognition (SER) model able to recognize positive, neutral and negative emotions in natural conversations of Italian elderly people. Several datasets for SER are available in the literature. However most of them are in English or Chinese, have been recorded while actors and actresses pronounce short phrases and thus are not related to natural conversation. Moreover only few speeches among all the databases are related to elderly people. Therefore, in this work, a multi-language and multi-age corpus is considered merging a dataset in English, that includes also elderly people, with a dataset in Italian. A general model, trained on young and adult English actors and actresses is proposed, based on XGBoost. Then two strategies of domain adaptation are proposed to adapt the model either to elderly people and to Italian speakers. The results suggest that this approach increases the classification performance, underlining also that new datasets should be collected.
基于跨语料库语音数据集的意大利老年人情感识别
这项工作的目的是定义一个语音情感识别(SER)模型,能够识别意大利老年人自然对话中的积极、中性和消极情绪。文献中有几个SER的数据集。然而,大多数都是用英语或汉语录制的,演员们都是用简短的短语发音,因此与自然对话无关。此外,所有数据库中与老年人有关的演讲很少。因此,在这项工作中,考虑将英语数据集(也包括老年人)与意大利语数据集合并为一个多语言和多年龄的语料库。提出了一种基于XGBoost的通用模型,对青年和成年英语男女演员进行训练。然后提出了两种领域适应策略来适应老年人和意大利语使用者。结果表明,这种方法提高了分类性能,也强调了应该收集新的数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信