{"title":"MUTFER2024: A new dataset for South African emotion recognition","authors":"Rogerant Tshibangu , Jules R Tapamo","doi":"10.1016/j.dib.2025.111592","DOIUrl":null,"url":null,"abstract":"<div><div>Facial Emotion Recognition (FER) plays a critical role in applications such as human-computer interaction, security, and healthcare. The effectiveness of FER systems largely depends on the quality and diversity of the datasets used for training and evaluation. However, existing FER datasets often lack adequate representation of African populations, leading to racial biases in recognizing emotions across diverse ethnic groups. This issue arises from the predominance of Western-centric datasets used in training FER systems, which results in inaccurate and biased outcomes when applied to African or non-Caucasian faces.</div><div>To address this limitation, we introduce MUTFER2024, a novel dataset developed at Mangosuthu University of Technology. MUTFER2024 aims to minimize racial bias in FER systems by providing an extensive collection of facial emotion images from African participants. The dataset comprises 13,032 images collected from 300 individuals, including students and staff members, and is categorized into seven emotion classes: happy, sad, angry, surprised, neutral, disgusted, and fearful.</div><div>This paper details the methodology employed in data collection, segmentation, and categorization. Facial emotion images were gathered through structured submission protocols to ensure diversity in expressions. Subsequently, the images were meticulously segmented and categorized into the specified emotion classes. Data were collected under real-world conditions using mobile and computer cameras. The dataset is hosted on GitHub and can be used to train emotion recognition models for underrepresented African populations.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111592"},"PeriodicalIF":1.0000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925003245","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Facial Emotion Recognition (FER) plays a critical role in applications such as human-computer interaction, security, and healthcare. The effectiveness of FER systems largely depends on the quality and diversity of the datasets used for training and evaluation. However, existing FER datasets often lack adequate representation of African populations, leading to racial biases in recognizing emotions across diverse ethnic groups. This issue arises from the predominance of Western-centric datasets used in training FER systems, which results in inaccurate and biased outcomes when applied to African or non-Caucasian faces.
To address this limitation, we introduce MUTFER2024, a novel dataset developed at Mangosuthu University of Technology. MUTFER2024 aims to minimize racial bias in FER systems by providing an extensive collection of facial emotion images from African participants. The dataset comprises 13,032 images collected from 300 individuals, including students and staff members, and is categorized into seven emotion classes: happy, sad, angry, surprised, neutral, disgusted, and fearful.
This paper details the methodology employed in data collection, segmentation, and categorization. Facial emotion images were gathered through structured submission protocols to ensure diversity in expressions. Subsequently, the images were meticulously segmented and categorized into the specified emotion classes. Data were collected under real-world conditions using mobile and computer cameras. The dataset is hosted on GitHub and can be used to train emotion recognition models for underrepresented African populations.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.