{"title":"双分支噪声提取与抑制下的标签噪声面部表情识别","authors":"Yunfei Li;Hao Liu;Daihong Jiang;Jiuzhen Liang","doi":"10.1109/TAFFC.2024.3519359","DOIUrl":null,"url":null,"abstract":"Noisy labels in Facial Expression Recognition (FER) datasets severely affect the performance of FER models. We propose a novel dual-branch noise extraction and suppression method to address this issue. This algorithm reduces the model’s impact from noisy labels by decreasing the dataset noise ratio and suppressing label-noisy samples. The method comprises three primary stages: sample extraction, pseudo-label generation, and re-training. The approach initially extracts label-noisy samples from the dataset by computing an exponential moving average of the model predictions and the joint probability distribution matrix of noisy and actual labels. The remaining samples form a clean dataset. Next, the training weights of the clean dataset are utilized to assign appropriate pseudo-labels to the label-noisy samples. Subsequently, the noisy labels are replaced with pseudo-labels to create a corrected dataset. The corrected and clean datasets are combined to create the reconstructed dataset, reducing noisy labels within the dataset. Finally, the model is retrained using the reconstructed dataset. Furthermore, this study introduces a novel gradient suppression smoothing function specifically designed to mitigate the impact of label-noisy samples in the dataset during the re-training process. The proposed algorithm is robust, with accuracies of 91.17%, 91.56%, and 91.58% on the RAF-DB dataset with 10%, 20%, and 30% noisy labels, and accuracies of 89.91%, 90.17%, and 89.69% on the corresponding FERPlus.","PeriodicalId":13131,"journal":{"name":"IEEE Transactions on Affective Computing","volume":"16 3","pages":"1514-1525"},"PeriodicalIF":9.8000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Facial Expression Recognition With Label-Noisy Under Dual-Branch Noise Extraction and Suppression\",\"authors\":\"Yunfei Li;Hao Liu;Daihong Jiang;Jiuzhen Liang\",\"doi\":\"10.1109/TAFFC.2024.3519359\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Noisy labels in Facial Expression Recognition (FER) datasets severely affect the performance of FER models. We propose a novel dual-branch noise extraction and suppression method to address this issue. This algorithm reduces the model’s impact from noisy labels by decreasing the dataset noise ratio and suppressing label-noisy samples. The method comprises three primary stages: sample extraction, pseudo-label generation, and re-training. The approach initially extracts label-noisy samples from the dataset by computing an exponential moving average of the model predictions and the joint probability distribution matrix of noisy and actual labels. The remaining samples form a clean dataset. Next, the training weights of the clean dataset are utilized to assign appropriate pseudo-labels to the label-noisy samples. Subsequently, the noisy labels are replaced with pseudo-labels to create a corrected dataset. The corrected and clean datasets are combined to create the reconstructed dataset, reducing noisy labels within the dataset. Finally, the model is retrained using the reconstructed dataset. Furthermore, this study introduces a novel gradient suppression smoothing function specifically designed to mitigate the impact of label-noisy samples in the dataset during the re-training process. The proposed algorithm is robust, with accuracies of 91.17%, 91.56%, and 91.58% on the RAF-DB dataset with 10%, 20%, and 30% noisy labels, and accuracies of 89.91%, 90.17%, and 89.69% on the corresponding FERPlus.\",\"PeriodicalId\":13131,\"journal\":{\"name\":\"IEEE Transactions on Affective Computing\",\"volume\":\"16 3\",\"pages\":\"1514-1525\"},\"PeriodicalIF\":9.8000,\"publicationDate\":\"2024-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Affective Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10806865/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Affective Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10806865/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Facial Expression Recognition With Label-Noisy Under Dual-Branch Noise Extraction and Suppression
Noisy labels in Facial Expression Recognition (FER) datasets severely affect the performance of FER models. We propose a novel dual-branch noise extraction and suppression method to address this issue. This algorithm reduces the model’s impact from noisy labels by decreasing the dataset noise ratio and suppressing label-noisy samples. The method comprises three primary stages: sample extraction, pseudo-label generation, and re-training. The approach initially extracts label-noisy samples from the dataset by computing an exponential moving average of the model predictions and the joint probability distribution matrix of noisy and actual labels. The remaining samples form a clean dataset. Next, the training weights of the clean dataset are utilized to assign appropriate pseudo-labels to the label-noisy samples. Subsequently, the noisy labels are replaced with pseudo-labels to create a corrected dataset. The corrected and clean datasets are combined to create the reconstructed dataset, reducing noisy labels within the dataset. Finally, the model is retrained using the reconstructed dataset. Furthermore, this study introduces a novel gradient suppression smoothing function specifically designed to mitigate the impact of label-noisy samples in the dataset during the re-training process. The proposed algorithm is robust, with accuracies of 91.17%, 91.56%, and 91.58% on the RAF-DB dataset with 10%, 20%, and 30% noisy labels, and accuracies of 89.91%, 90.17%, and 89.69% on the corresponding FERPlus.
期刊介绍:
The IEEE Transactions on Affective Computing is an international and interdisciplinary journal. Its primary goal is to share research findings on the development of systems capable of recognizing, interpreting, and simulating human emotions and related affective phenomena. The journal publishes original research on the underlying principles and theories that explain how and why affective factors shape human-technology interactions. It also focuses on how techniques for sensing and simulating affect can enhance our understanding of human emotions and processes. Additionally, the journal explores the design, implementation, and evaluation of systems that prioritize the consideration of affect in their usability. We also welcome surveys of existing work that provide new perspectives on the historical and future directions of this field.