Yuandong Min , Ruyi Xu , Jingying Chen , Yanfeng Ji , Xiaodi Liu
{"title":"鲁棒面部表情识别,同时处理硬和错误标记的样本","authors":"Yuandong Min , Ruyi Xu , Jingying Chen , Yanfeng Ji , Xiaodi Liu","doi":"10.1016/j.patcog.2025.112026","DOIUrl":null,"url":null,"abstract":"<div><div>Facial Expression Recognition (FER) in the wild is a challenging task, especially when training data contains numerous mislabeled samples and hard samples. Typically, FER models either overfit to the mislabeled samples or underfit to the hard samples, resulting in degraded performance. However, most existing methods fail to address both issues simultaneously. To overcome this limitation, this paper introduces a novel FER method called Noise-Hard robust Graph (NHG), which dynamically supervises the updating of the adjacency matrix in the Graph Convolutional Networks, striking a balance between suppressing the impacts of noisy labels and encouraging learning from hard samples. First, we map high-dimensional facial expression features onto low-dimensional manifolds to initialize the topological relationships between the samples, thus measuring the hard sample relationships more accurately. Second, we design a Label Consistency Mask (LCM) strategy to retain potential connections for hard sample learning. LCM could also potentially preserve correct connections while noisy labels exist, supporting noise-robust learning. Third, based on differences in trends of <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span>-norm variation between mislabeled samples and hard samples, the <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span>-norm regularization (L2R) suppresses the learning of mislabeled samples while preserving the learning potential of hard samples and suppressing the propagation of their features within the graph. Experimental results demonstrate that our method achieves competitive performance compared to state-of-the-art methods in scenarios with noisy labels and hard samples.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"170 ","pages":"Article 112026"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robust facial expression recognition by simultaneously addressing hard and mislabeled samples\",\"authors\":\"Yuandong Min , Ruyi Xu , Jingying Chen , Yanfeng Ji , Xiaodi Liu\",\"doi\":\"10.1016/j.patcog.2025.112026\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Facial Expression Recognition (FER) in the wild is a challenging task, especially when training data contains numerous mislabeled samples and hard samples. Typically, FER models either overfit to the mislabeled samples or underfit to the hard samples, resulting in degraded performance. However, most existing methods fail to address both issues simultaneously. To overcome this limitation, this paper introduces a novel FER method called Noise-Hard robust Graph (NHG), which dynamically supervises the updating of the adjacency matrix in the Graph Convolutional Networks, striking a balance between suppressing the impacts of noisy labels and encouraging learning from hard samples. First, we map high-dimensional facial expression features onto low-dimensional manifolds to initialize the topological relationships between the samples, thus measuring the hard sample relationships more accurately. Second, we design a Label Consistency Mask (LCM) strategy to retain potential connections for hard sample learning. LCM could also potentially preserve correct connections while noisy labels exist, supporting noise-robust learning. Third, based on differences in trends of <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span>-norm variation between mislabeled samples and hard samples, the <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span>-norm regularization (L2R) suppresses the learning of mislabeled samples while preserving the learning potential of hard samples and suppressing the propagation of their features within the graph. Experimental results demonstrate that our method achieves competitive performance compared to state-of-the-art methods in scenarios with noisy labels and hard samples.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"170 \",\"pages\":\"Article 112026\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325006867\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325006867","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Robust facial expression recognition by simultaneously addressing hard and mislabeled samples
Facial Expression Recognition (FER) in the wild is a challenging task, especially when training data contains numerous mislabeled samples and hard samples. Typically, FER models either overfit to the mislabeled samples or underfit to the hard samples, resulting in degraded performance. However, most existing methods fail to address both issues simultaneously. To overcome this limitation, this paper introduces a novel FER method called Noise-Hard robust Graph (NHG), which dynamically supervises the updating of the adjacency matrix in the Graph Convolutional Networks, striking a balance between suppressing the impacts of noisy labels and encouraging learning from hard samples. First, we map high-dimensional facial expression features onto low-dimensional manifolds to initialize the topological relationships between the samples, thus measuring the hard sample relationships more accurately. Second, we design a Label Consistency Mask (LCM) strategy to retain potential connections for hard sample learning. LCM could also potentially preserve correct connections while noisy labels exist, supporting noise-robust learning. Third, based on differences in trends of -norm variation between mislabeled samples and hard samples, the -norm regularization (L2R) suppresses the learning of mislabeled samples while preserving the learning potential of hard samples and suppressing the propagation of their features within the graph. Experimental results demonstrate that our method achieves competitive performance compared to state-of-the-art methods in scenarios with noisy labels and hard samples.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.