R. Guetari, A. Chetouani, Hedi Tabia, Nawrès Khlifa
{"title":"视频流中的实时情感识别,使用B-CNN和F-CNN","authors":"R. Guetari, A. Chetouani, Hedi Tabia, Nawrès Khlifa","doi":"10.1109/ATSIP49331.2020.9231902","DOIUrl":null,"url":null,"abstract":"Despite the diversity of methods developed in recent years, the implementation of an efficient system for automatic recognition of facial emotions remains a technological challenge that has not been fully resolved. Many problems have not yet been resolved. The occlusion problem remains a challenge today for the research community, certain features characterizing several different emotions may seem similar, etc. High performing and precise techniques are therefore necessary to perfectly distinguish between two different emotions, even though they might be difficult to distinguish. The objective of this work is the development of an automatic method for recognizing basic facial emotions (joy, anger, sadness, disgust, surprise, fear and neutral) in video streams. The method of deep learning, known for its great performance in image classification, becomes essential. In order to be able to benefit from several feature maps at the same time, we propose to use two techniques: bilinear pooling (B-CNN), and Fusion Feature Net (F-CNN). This technique is more efficient and more precise than conventional techniques, whether based on deep learning or not.","PeriodicalId":384018,"journal":{"name":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Real time emotion recognition in video stream, using B-CNN and F-CNN\",\"authors\":\"R. Guetari, A. Chetouani, Hedi Tabia, Nawrès Khlifa\",\"doi\":\"10.1109/ATSIP49331.2020.9231902\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Despite the diversity of methods developed in recent years, the implementation of an efficient system for automatic recognition of facial emotions remains a technological challenge that has not been fully resolved. Many problems have not yet been resolved. The occlusion problem remains a challenge today for the research community, certain features characterizing several different emotions may seem similar, etc. High performing and precise techniques are therefore necessary to perfectly distinguish between two different emotions, even though they might be difficult to distinguish. The objective of this work is the development of an automatic method for recognizing basic facial emotions (joy, anger, sadness, disgust, surprise, fear and neutral) in video streams. The method of deep learning, known for its great performance in image classification, becomes essential. In order to be able to benefit from several feature maps at the same time, we propose to use two techniques: bilinear pooling (B-CNN), and Fusion Feature Net (F-CNN). This technique is more efficient and more precise than conventional techniques, whether based on deep learning or not.\",\"PeriodicalId\":384018,\"journal\":{\"name\":\"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ATSIP49331.2020.9231902\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATSIP49331.2020.9231902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real time emotion recognition in video stream, using B-CNN and F-CNN
Despite the diversity of methods developed in recent years, the implementation of an efficient system for automatic recognition of facial emotions remains a technological challenge that has not been fully resolved. Many problems have not yet been resolved. The occlusion problem remains a challenge today for the research community, certain features characterizing several different emotions may seem similar, etc. High performing and precise techniques are therefore necessary to perfectly distinguish between two different emotions, even though they might be difficult to distinguish. The objective of this work is the development of an automatic method for recognizing basic facial emotions (joy, anger, sadness, disgust, surprise, fear and neutral) in video streams. The method of deep learning, known for its great performance in image classification, becomes essential. In order to be able to benefit from several feature maps at the same time, we propose to use two techniques: bilinear pooling (B-CNN), and Fusion Feature Net (F-CNN). This technique is more efficient and more precise than conventional techniques, whether based on deep learning or not.