{"title":"Micro-expression Recognition Based on Attention-enhanced LSTM Neural Networks","authors":"Shiqi Xu, Fen Xu","doi":"10.1145/3529836.3529898","DOIUrl":null,"url":null,"abstract":"Micro-expression recognition is a difficult task in computer vision. Most existing micro-expression recognition methods extract facial features globally, leading to the inclusion of many irrelevant features and affecting the recognition accuracy in a negative way. In this paper, Long Short-Term Memory (LSTM) neural networks with spatial and temporal attention mechanisms are designed and employed to extract features selectively from the input sequences. Key frames are identified from the original micro-expression sequences at first. Then the VGG-Face model is used to extract the spatial features of those key frames. The spatial features of the micro-expression sequences are then fed into attention-enhanced long short-term memory neural networks, using a softmax function for the final classification. Our experiments with CASME II show that the attention-enhanced LSTM models improve the accuracy of micro-expression recognition significantly, compared to the results of several other leading methods.","PeriodicalId":285191,"journal":{"name":"2022 14th International Conference on Machine Learning and Computing (ICMLC)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Machine Learning and Computing (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3529836.3529898","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Micro-expression recognition is a difficult task in computer vision. Most existing micro-expression recognition methods extract facial features globally, leading to the inclusion of many irrelevant features and affecting the recognition accuracy in a negative way. In this paper, Long Short-Term Memory (LSTM) neural networks with spatial and temporal attention mechanisms are designed and employed to extract features selectively from the input sequences. Key frames are identified from the original micro-expression sequences at first. Then the VGG-Face model is used to extract the spatial features of those key frames. The spatial features of the micro-expression sequences are then fed into attention-enhanced long short-term memory neural networks, using a softmax function for the final classification. Our experiments with CASME II show that the attention-enhanced LSTM models improve the accuracy of micro-expression recognition significantly, compared to the results of several other leading methods.