视频流中的实时情感识别，使用B-CNN和F-CNN

2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) Pub Date : 2020-09-01 DOI:10.1109/ATSIP49331.2020.9231902

R. Guetari, A. Chetouani, Hedi Tabia, Nawrès Khlifa

{"title":"视频流中的实时情感识别，使用B-CNN和F-CNN","authors":"R. Guetari, A. Chetouani, Hedi Tabia, Nawrès Khlifa","doi":"10.1109/ATSIP49331.2020.9231902","DOIUrl":null,"url":null,"abstract":"Despite the diversity of methods developed in recent years, the implementation of an efficient system for automatic recognition of facial emotions remains a technological challenge that has not been fully resolved. Many problems have not yet been resolved. The occlusion problem remains a challenge today for the research community, certain features characterizing several different emotions may seem similar, etc. High performing and precise techniques are therefore necessary to perfectly distinguish between two different emotions, even though they might be difficult to distinguish. The objective of this work is the development of an automatic method for recognizing basic facial emotions (joy, anger, sadness, disgust, surprise, fear and neutral) in video streams. The method of deep learning, known for its great performance in image classification, becomes essential. In order to be able to benefit from several feature maps at the same time, we propose to use two techniques: bilinear pooling (B-CNN), and Fusion Feature Net (F-CNN). This technique is more efficient and more precise than conventional techniques, whether based on deep learning or not.","PeriodicalId":384018,"journal":{"name":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Real time emotion recognition in video stream, using B-CNN and F-CNN\",\"authors\":\"R. Guetari, A. Chetouani, Hedi Tabia, Nawrès Khlifa\",\"doi\":\"10.1109/ATSIP49331.2020.9231902\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Despite the diversity of methods developed in recent years, the implementation of an efficient system for automatic recognition of facial emotions remains a technological challenge that has not been fully resolved. Many problems have not yet been resolved. The occlusion problem remains a challenge today for the research community, certain features characterizing several different emotions may seem similar, etc. High performing and precise techniques are therefore necessary to perfectly distinguish between two different emotions, even though they might be difficult to distinguish. The objective of this work is the development of an automatic method for recognizing basic facial emotions (joy, anger, sadness, disgust, surprise, fear and neutral) in video streams. The method of deep learning, known for its great performance in image classification, becomes essential. In order to be able to benefit from several feature maps at the same time, we propose to use two techniques: bilinear pooling (B-CNN), and Fusion Feature Net (F-CNN). This technique is more efficient and more precise than conventional techniques, whether based on deep learning or not.\",\"PeriodicalId\":384018,\"journal\":{\"name\":\"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ATSIP49331.2020.9231902\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATSIP49331.2020.9231902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

尽管近年来开发了多种方法，但实现一个有效的面部情绪自动识别系统仍然是一个尚未完全解决的技术挑战。许多问题还没有解决。遮挡问题仍然是当今研究界的一个挑战，表征几种不同情绪的某些特征可能看起来相似，等等。因此，要完美地区分两种不同的情绪，高性能和精确的技术是必要的，尽管它们可能很难区分。这项工作的目标是开发一种自动识别视频流中基本面部情绪(喜悦、愤怒、悲伤、厌恶、惊讶、恐惧和中性)的方法。以其在图像分类方面的出色表现而闻名的深度学习方法变得至关重要。为了能够同时从多个特征映射中获益，我们建议使用两种技术:双线性池化(B-CNN)和融合特征网(F-CNN)。无论是否基于深度学习，这种技术都比传统技术更高效、更精确。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Real time emotion recognition in video stream, using B-CNN and F-CNN

Despite the diversity of methods developed in recent years, the implementation of an efficient system for automatic recognition of facial emotions remains a technological challenge that has not been fully resolved. Many problems have not yet been resolved. The occlusion problem remains a challenge today for the research community, certain features characterizing several different emotions may seem similar, etc. High performing and precise techniques are therefore necessary to perfectly distinguish between two different emotions, even though they might be difficult to distinguish. The objective of this work is the development of an automatic method for recognizing basic facial emotions (joy, anger, sadness, disgust, surprise, fear and neutral) in video streams. The method of deep learning, known for its great performance in image classification, becomes essential. In order to be able to benefit from several feature maps at the same time, we propose to use two techniques: bilinear pooling (B-CNN), and Fusion Feature Net (F-CNN). This technique is more efficient and more precise than conventional techniques, whether based on deep learning or not.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

自引率

0.00%

发文量