基于图卷积和长短时记忆网络的在线面部表情识别

IF 0.9 Q4 TELECOMMUNICATIONS

Internet Technology Letters Pub Date : 2023-02-16 DOI:10.1002/itl2.415

Chujie Xu, Wenjie Zheng, Yong Du, Tiejun Li, Zhansheng Yuan

{"title":"基于图卷积和长短时记忆网络的在线面部表情识别","authors":"Chujie Xu, Wenjie Zheng, Yong Du, Tiejun Li, Zhansheng Yuan","doi":"10.1002/itl2.415","DOIUrl":null,"url":null,"abstract":"<p>Video-based facial expression recognition (FER) models have achieved higher accuracy with more computation, which is not suitable for online deployment in mobile intelligent terminals. Facial landmarks can model facial expression changes with their spatial location information instead of texture features. But classical convolution operation cannot make full use of landmark information. To this end, in this paper, we propose a novel long short memory network (LSTM) by embedding graph convolution named GELSTM for online video-based FER in mobile intelligent terminals. Specifically, we construct landmark-based face graph data from the client. On the server side, we introduce graph convolution which can effectively mine spatial dependencies information in a landmark-based facial graph. Moreover, the extracted landmark's features are fed to LSTM for temporal feature aggregation. We conduct experiments on the facial expression dataset and the results show our proposed method shows superior performance compared to other deep models.</p>","PeriodicalId":100725,"journal":{"name":"Internet Technology Letters","volume":"8 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Online facial expression recognition based on graph convolution and long short memory networks\",\"authors\":\"Chujie Xu, Wenjie Zheng, Yong Du, Tiejun Li, Zhansheng Yuan\",\"doi\":\"10.1002/itl2.415\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Video-based facial expression recognition (FER) models have achieved higher accuracy with more computation, which is not suitable for online deployment in mobile intelligent terminals. Facial landmarks can model facial expression changes with their spatial location information instead of texture features. But classical convolution operation cannot make full use of landmark information. To this end, in this paper, we propose a novel long short memory network (LSTM) by embedding graph convolution named GELSTM for online video-based FER in mobile intelligent terminals. Specifically, we construct landmark-based face graph data from the client. On the server side, we introduce graph convolution which can effectively mine spatial dependencies information in a landmark-based facial graph. Moreover, the extracted landmark's features are fed to LSTM for temporal feature aggregation. We conduct experiments on the facial expression dataset and the results show our proposed method shows superior performance compared to other deep models.</p>\",\"PeriodicalId\":100725,\"journal\":{\"name\":\"Internet Technology Letters\",\"volume\":\"8 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2023-02-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet Technology Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/itl2.415\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/itl2.415","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Online facial expression recognition based on graph convolution and long short memory networks

Video-based facial expression recognition (FER) models have achieved higher accuracy with more computation, which is not suitable for online deployment in mobile intelligent terminals. Facial landmarks can model facial expression changes with their spatial location information instead of texture features. But classical convolution operation cannot make full use of landmark information. To this end, in this paper, we propose a novel long short memory network (LSTM) by embedding graph convolution named GELSTM for online video-based FER in mobile intelligent terminals. Specifically, we construct landmark-based face graph data from the client. On the server side, we introduce graph convolution which can effectively mine spatial dependencies information in a landmark-based facial graph. Moreover, the extracted landmark's features are fed to LSTM for temporal feature aggregation. We conduct experiments on the facial expression dataset and the results show our proposed method shows superior performance compared to other deep models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Internet Technology Letters

CiteScore

3.10

自引率

0.00%

发文量