Bimodal Emotion Recognition using Kernel Canonical Correlation Analysis and Multiple Kernel Learning

2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) Pub Date : 2021-10-23 DOI:10.1109/CISP-BMEI53629.2021.9624428

Jingjie Yan, Weigen Qiu

{"title":"Bimodal Emotion Recognition using Kernel Canonical Correlation Analysis and Multiple Kernel Learning","authors":"Jingjie Yan, Weigen Qiu","doi":"10.1109/CISP-BMEI53629.2021.9624428","DOIUrl":null,"url":null,"abstract":"Bimodal emotion recognition on account of kernel canonical correlation analysis (KCCA) and multiple kernel learning (MKL) is investigated and utilized to discover the befitting and effectual fusion pattern with respect to facial expression channel and body gesture channel in the form of video data. Firstly, to relieve calculated quantity of the posterior fusion and classification procedure, the two groups of quondam facial expression and body gesture video data are switched to be indicated as the form of lower dimensional histogram spatio-temporal emotion vectors respectively by Dollar's spatio-temporal feature. Then, KCCA-MKL in the form of multiple kernels is adopted to portray the nonlinear character of facial expression and body gesture video data, and simultaneously to search two modalities' conjunct nonlinear correlative structures by considering the disadvantage of the signal kernel used in KCCA. The rudimentary idea of the KCCA-MKL method is using multiple kernels with the combination of gaussian kernel and $\\chi^{2}$ kernel to substitute for the signal kernel in KCCA. In experiment step, some types of the combination of the gaussian kernel and the $\\chi^{2}$ kernel are implemented in KCCA-MKL. The test results display that the classification accuracy of the KCCA-MKL approach is 56.91% using the KNN classifier, and is better than two unimodal methods and signal kernel method. Consequently, KCCA-MKL is more unfailing and efficient for bimodal emotion recognition.","PeriodicalId":131256,"journal":{"name":"2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI53629.2021.9624428","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Bimodal emotion recognition on account of kernel canonical correlation analysis (KCCA) and multiple kernel learning (MKL) is investigated and utilized to discover the befitting and effectual fusion pattern with respect to facial expression channel and body gesture channel in the form of video data. Firstly, to relieve calculated quantity of the posterior fusion and classification procedure, the two groups of quondam facial expression and body gesture video data are switched to be indicated as the form of lower dimensional histogram spatio-temporal emotion vectors respectively by Dollar's spatio-temporal feature. Then, KCCA-MKL in the form of multiple kernels is adopted to portray the nonlinear character of facial expression and body gesture video data, and simultaneously to search two modalities' conjunct nonlinear correlative structures by considering the disadvantage of the signal kernel used in KCCA. The rudimentary idea of the KCCA-MKL method is using multiple kernels with the combination of gaussian kernel and $\chi^{2}$ kernel to substitute for the signal kernel in KCCA. In experiment step, some types of the combination of the gaussian kernel and the $\chi^{2}$ kernel are implemented in KCCA-MKL. The test results display that the classification accuracy of the KCCA-MKL approach is 56.91% using the KNN classifier, and is better than two unimodal methods and signal kernel method. Consequently, KCCA-MKL is more unfailing and efficient for bimodal emotion recognition.

查看原文本刊更多论文

基于核典型相关分析和多核学习的双峰情绪识别

研究了基于核典型相关分析(KCCA)和多核学习(MKL)的双峰情感识别方法，并利用该方法发现了视频数据中面部表情通道和肢体动作通道的合适且有效的融合模式。首先，为了减轻后路融合和分类过程的计算量，利用Dollar时空特征将两组原始面部表情和肢体动作视频数据分别转换为低维直方图时空情感向量的形式。然后，采用多核形式的KCCA- mkl来刻画面部表情和肢体动作视频数据的非线性特征，同时考虑到KCCA信号核的缺点，搜索两模态的非线性相关结构。KCCA- mkl方法的基本思想是使用高斯核和$\chi^{2}$核组合的多核来代替KCCA中的信号核。在实验步骤中，在KCCA-MKL中实现了几种高斯核和$\chi^{2}$核的组合。测试结果表明，使用KNN分类器的KCCA-MKL方法的分类准确率为56.91%，优于两种单峰方法和信号核方法。因此，KCCA-MKL在双峰情绪识别中更加可靠和有效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)

自引率

0.00%

发文量