Kaimin Yu, Zhiyong Wang, Genliang Guan, Qiuxia Wu, Z. Chi, D. Feng
{"title":"面部表情识别需要多少帧?","authors":"Kaimin Yu, Zhiyong Wang, Genliang Guan, Qiuxia Wu, Z. Chi, D. Feng","doi":"10.1109/ICMEW.2012.56","DOIUrl":null,"url":null,"abstract":"Facial expression analysis is essential to enable socially intelligent processing of multimedia video content. Most facial expression recognition algorithms generally analyze the whole image sequence of an expression to exploit its temporal characteristics. However, it is seldom studied whether it is necessary to utilize all the frames of a sequence, since human beings are able to capture the dynamics of facial expressions from very short sequences (even only one frame). In this paper, we investigate the impact of the number of frames in a facial expression sequence on facial expression recognition accuracy. In particular, we develop a key frame selection method through key point based frame representation. Experimental results on the popular CK facial expression dataset indicate that recognition accuracy achieved with half of the sequence frames is comparable to that of utilizing all the sequence frames. Our key frame selection method can further reduce the number of frames without clearly compromising recognition accuracy.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"How Many Frames Does Facial Expression Recognition Require?\",\"authors\":\"Kaimin Yu, Zhiyong Wang, Genliang Guan, Qiuxia Wu, Z. Chi, D. Feng\",\"doi\":\"10.1109/ICMEW.2012.56\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Facial expression analysis is essential to enable socially intelligent processing of multimedia video content. Most facial expression recognition algorithms generally analyze the whole image sequence of an expression to exploit its temporal characteristics. However, it is seldom studied whether it is necessary to utilize all the frames of a sequence, since human beings are able to capture the dynamics of facial expressions from very short sequences (even only one frame). In this paper, we investigate the impact of the number of frames in a facial expression sequence on facial expression recognition accuracy. In particular, we develop a key frame selection method through key point based frame representation. Experimental results on the popular CK facial expression dataset indicate that recognition accuracy achieved with half of the sequence frames is comparable to that of utilizing all the sequence frames. Our key frame selection method can further reduce the number of frames without clearly compromising recognition accuracy.\",\"PeriodicalId\":385797,\"journal\":{\"name\":\"2012 IEEE International Conference on Multimedia and Expo Workshops\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Conference on Multimedia and Expo Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMEW.2012.56\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Multimedia and Expo Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW.2012.56","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
How Many Frames Does Facial Expression Recognition Require?
Facial expression analysis is essential to enable socially intelligent processing of multimedia video content. Most facial expression recognition algorithms generally analyze the whole image sequence of an expression to exploit its temporal characteristics. However, it is seldom studied whether it is necessary to utilize all the frames of a sequence, since human beings are able to capture the dynamics of facial expressions from very short sequences (even only one frame). In this paper, we investigate the impact of the number of frames in a facial expression sequence on facial expression recognition accuracy. In particular, we develop a key frame selection method through key point based frame representation. Experimental results on the popular CK facial expression dataset indicate that recognition accuracy achieved with half of the sequence frames is comparable to that of utilizing all the sequence frames. Our key frame selection method can further reduce the number of frames without clearly compromising recognition accuracy.