Bimodal Emotion Recognition using Kernel Canonical Correlation Analysis and Multiple Kernel Learning

Jingjie Yan, Weigen Qiu
{"title":"Bimodal Emotion Recognition using Kernel Canonical Correlation Analysis and Multiple Kernel Learning","authors":"Jingjie Yan, Weigen Qiu","doi":"10.1109/CISP-BMEI53629.2021.9624428","DOIUrl":null,"url":null,"abstract":"Bimodal emotion recognition on account of kernel canonical correlation analysis (KCCA) and multiple kernel learning (MKL) is investigated and utilized to discover the befitting and effectual fusion pattern with respect to facial expression channel and body gesture channel in the form of video data. Firstly, to relieve calculated quantity of the posterior fusion and classification procedure, the two groups of quondam facial expression and body gesture video data are switched to be indicated as the form of lower dimensional histogram spatio-temporal emotion vectors respectively by Dollar's spatio-temporal feature. Then, KCCA-MKL in the form of multiple kernels is adopted to portray the nonlinear character of facial expression and body gesture video data, and simultaneously to search two modalities' conjunct nonlinear correlative structures by considering the disadvantage of the signal kernel used in KCCA. The rudimentary idea of the KCCA-MKL method is using multiple kernels with the combination of gaussian kernel and $\\chi^{2}$ kernel to substitute for the signal kernel in KCCA. In experiment step, some types of the combination of the gaussian kernel and the $\\chi^{2}$ kernel are implemented in KCCA-MKL. The test results display that the classification accuracy of the KCCA-MKL approach is 56.91% using the KNN classifier, and is better than two unimodal methods and signal kernel method. Consequently, KCCA-MKL is more unfailing and efficient for bimodal emotion recognition.","PeriodicalId":131256,"journal":{"name":"2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI53629.2021.9624428","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Bimodal emotion recognition on account of kernel canonical correlation analysis (KCCA) and multiple kernel learning (MKL) is investigated and utilized to discover the befitting and effectual fusion pattern with respect to facial expression channel and body gesture channel in the form of video data. Firstly, to relieve calculated quantity of the posterior fusion and classification procedure, the two groups of quondam facial expression and body gesture video data are switched to be indicated as the form of lower dimensional histogram spatio-temporal emotion vectors respectively by Dollar's spatio-temporal feature. Then, KCCA-MKL in the form of multiple kernels is adopted to portray the nonlinear character of facial expression and body gesture video data, and simultaneously to search two modalities' conjunct nonlinear correlative structures by considering the disadvantage of the signal kernel used in KCCA. The rudimentary idea of the KCCA-MKL method is using multiple kernels with the combination of gaussian kernel and $\chi^{2}$ kernel to substitute for the signal kernel in KCCA. In experiment step, some types of the combination of the gaussian kernel and the $\chi^{2}$ kernel are implemented in KCCA-MKL. The test results display that the classification accuracy of the KCCA-MKL approach is 56.91% using the KNN classifier, and is better than two unimodal methods and signal kernel method. Consequently, KCCA-MKL is more unfailing and efficient for bimodal emotion recognition.
基于核典型相关分析和多核学习的双峰情绪识别
研究了基于核典型相关分析(KCCA)和多核学习(MKL)的双峰情感识别方法,并利用该方法发现了视频数据中面部表情通道和肢体动作通道的合适且有效的融合模式。首先,为了减轻后路融合和分类过程的计算量,利用Dollar时空特征将两组原始面部表情和肢体动作视频数据分别转换为低维直方图时空情感向量的形式。然后,采用多核形式的KCCA- mkl来刻画面部表情和肢体动作视频数据的非线性特征,同时考虑到KCCA信号核的缺点,搜索两模态的非线性相关结构。KCCA- mkl方法的基本思想是使用高斯核和$\chi^{2}$核组合的多核来代替KCCA中的信号核。在实验步骤中,在KCCA-MKL中实现了几种高斯核和$\chi^{2}$核的组合。测试结果表明,使用KNN分类器的KCCA-MKL方法的分类准确率为56.91%,优于两种单峰方法和信号核方法。因此,KCCA-MKL在双峰情绪识别中更加可靠和有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信