Zhiyuan Li, Shizhong Han, Ahmed-Shehab Khan, Jie Cai, Zibo Meng, James O'Reilly, Yan Tong
{"title":"Pooling Map Adaptation in Convolutional Neural Network for Facial Expression Recognition","authors":"Zhiyuan Li, Shizhong Han, Ahmed-Shehab Khan, Jie Cai, Zibo Meng, James O'Reilly, Yan Tong","doi":"10.1109/ICME.2019.00194","DOIUrl":null,"url":null,"abstract":"In this work, we proposed adaptive pooling maps (APMs) for CNNs to aid facial expression recognition. Inspired by superpixels, which represent the image content more naturally, pooling maps consisting of irregular pooling regions are learned from training images as part of training a CNN model. The APMs preserve the local structural information and thus are more capable of capturing subtle facial appearance and geometrical changes caused by facial expression. Furthermore, we developed an efficient algorithm to learn the APMs efficiently. Experiments on three benchmark datasets have shown that the proposed APM-based CNN model outperforms the one with the standard pooling map and achieves state-of-the-art recognition performance for facial expression recognition in the wild.","PeriodicalId":106832,"journal":{"name":"2019 IEEE International Conference on Multimedia and Expo (ICME)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Multimedia and Expo (ICME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICME.2019.00194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
Abstract
In this work, we proposed adaptive pooling maps (APMs) for CNNs to aid facial expression recognition. Inspired by superpixels, which represent the image content more naturally, pooling maps consisting of irregular pooling regions are learned from training images as part of training a CNN model. The APMs preserve the local structural information and thus are more capable of capturing subtle facial appearance and geometrical changes caused by facial expression. Furthermore, we developed an efficient algorithm to learn the APMs efficiently. Experiments on three benchmark datasets have shown that the proposed APM-based CNN model outperforms the one with the standard pooling map and achieves state-of-the-art recognition performance for facial expression recognition in the wild.