{"title":"真实世界环境中的三属性感知器面部表情识别","authors":"Wei-Yen Hsu;Ting-Hsuan Chiang","doi":"10.1109/TCE.2024.3519514","DOIUrl":null,"url":null,"abstract":"Facial expression recognition (FER) has become a prominent research area due to its wide range of applications, such as human-robot interaction (HRI), driver state monitoring, and medical diagnosis. However, real-world environments pose significant challenges to FER, including occlusion, variations in lighting, and different angles. In this study, a novel triple-attribute perceptron network (TAPNet) is proposed to tackle the limited effectiveness of FER in real-world environments. TAPNet improves FER performance in real-world environments by effectively leveraging triple-attribute facial features from global, local, and critical subregions, thereby fully exploiting the diverse potential information provided by each facial attribute, similar to the human face perception mechanism that extracts both global and regional information. Specifically, the global facial perception (GFP) module emphasizes the most important facial features in the overall face by increasing the number of channels to preserve features and assigning weights to different channels. Additionally, the local facial perception (LFP) and critical facial perception (CFP) modules capture regional feature information from local and critical facial features, respectively, focusing on fine-grained regional features and minimizing interference from irrelevant regions during feature extraction. The experimental results indicate that the proposed TAPNet model achieves an accuracy of 90.77% on the RAF-DB dataset and 65.13% on the AffectNet-7 dataset. Moreover, this model also demonstrates promising FER performance compared to the state-of-the-art approaches on several real-world datasets.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"71 1","pages":"608-620"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Triple-Attribute Perceptron Facial Expression Recognition in Real-World Environments\",\"authors\":\"Wei-Yen Hsu;Ting-Hsuan Chiang\",\"doi\":\"10.1109/TCE.2024.3519514\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Facial expression recognition (FER) has become a prominent research area due to its wide range of applications, such as human-robot interaction (HRI), driver state monitoring, and medical diagnosis. However, real-world environments pose significant challenges to FER, including occlusion, variations in lighting, and different angles. In this study, a novel triple-attribute perceptron network (TAPNet) is proposed to tackle the limited effectiveness of FER in real-world environments. TAPNet improves FER performance in real-world environments by effectively leveraging triple-attribute facial features from global, local, and critical subregions, thereby fully exploiting the diverse potential information provided by each facial attribute, similar to the human face perception mechanism that extracts both global and regional information. Specifically, the global facial perception (GFP) module emphasizes the most important facial features in the overall face by increasing the number of channels to preserve features and assigning weights to different channels. Additionally, the local facial perception (LFP) and critical facial perception (CFP) modules capture regional feature information from local and critical facial features, respectively, focusing on fine-grained regional features and minimizing interference from irrelevant regions during feature extraction. The experimental results indicate that the proposed TAPNet model achieves an accuracy of 90.77% on the RAF-DB dataset and 65.13% on the AffectNet-7 dataset. Moreover, this model also demonstrates promising FER performance compared to the state-of-the-art approaches on several real-world datasets.\",\"PeriodicalId\":13208,\"journal\":{\"name\":\"IEEE Transactions on Consumer Electronics\",\"volume\":\"71 1\",\"pages\":\"608-620\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Consumer Electronics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10806792/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10806792/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Triple-Attribute Perceptron Facial Expression Recognition in Real-World Environments
Facial expression recognition (FER) has become a prominent research area due to its wide range of applications, such as human-robot interaction (HRI), driver state monitoring, and medical diagnosis. However, real-world environments pose significant challenges to FER, including occlusion, variations in lighting, and different angles. In this study, a novel triple-attribute perceptron network (TAPNet) is proposed to tackle the limited effectiveness of FER in real-world environments. TAPNet improves FER performance in real-world environments by effectively leveraging triple-attribute facial features from global, local, and critical subregions, thereby fully exploiting the diverse potential information provided by each facial attribute, similar to the human face perception mechanism that extracts both global and regional information. Specifically, the global facial perception (GFP) module emphasizes the most important facial features in the overall face by increasing the number of channels to preserve features and assigning weights to different channels. Additionally, the local facial perception (LFP) and critical facial perception (CFP) modules capture regional feature information from local and critical facial features, respectively, focusing on fine-grained regional features and minimizing interference from irrelevant regions during feature extraction. The experimental results indicate that the proposed TAPNet model achieves an accuracy of 90.77% on the RAF-DB dataset and 65.13% on the AffectNet-7 dataset. Moreover, this model also demonstrates promising FER performance compared to the state-of-the-art approaches on several real-world datasets.
期刊介绍:
The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.