{"title":"Facial Expression Recognition and the Application of Supervised Contrastive Learning","authors":"Chenxin Yi","doi":"10.1109/ISAIEE57420.2022.00126","DOIUrl":null,"url":null,"abstract":"Facial expression is an essential part of communication in human life. In order to automating the process of facial expression recognition and improve the accuracy, in this paper, I explore the problem of facial expression recognition through the FER-2013 dataset with a new loss function—supervised contrastive learning. The goal is to classify similar features close together with the selection of different anchor points and positive and negative examples under the label of pre-classification. I address this task by Convolutional Neural Network, using both a shallow CNN model of my own as well as deeper models such as ResNet, VGG, and Inception. I fine-tuned these models and compared their performances on the dataset. Then I ensembled them to reach a better performance. As a result, I obtained a final model that reaches 70.24% accuracy on the test set, beating two baselines (human recognition rate and null model accuracy) proposed by previous works.","PeriodicalId":345703,"journal":{"name":"2022 International Symposium on Advances in Informatics, Electronics and Education (ISAIEE)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Advances in Informatics, Electronics and Education (ISAIEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISAIEE57420.2022.00126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Facial expression is an essential part of communication in human life. In order to automating the process of facial expression recognition and improve the accuracy, in this paper, I explore the problem of facial expression recognition through the FER-2013 dataset with a new loss function—supervised contrastive learning. The goal is to classify similar features close together with the selection of different anchor points and positive and negative examples under the label of pre-classification. I address this task by Convolutional Neural Network, using both a shallow CNN model of my own as well as deeper models such as ResNet, VGG, and Inception. I fine-tuned these models and compared their performances on the dataset. Then I ensembled them to reach a better performance. As a result, I obtained a final model that reaches 70.24% accuracy on the test set, beating two baselines (human recognition rate and null model accuracy) proposed by previous works.