Joyce Nakatumba-Nabende, Sulaiman Kagumire, Caroline Kantono, Peter Nabende
{"title":"非洲低资源语言语音自动识别模型偏差评估与缓解的系统文献综述","authors":"Joyce Nakatumba-Nabende, Sulaiman Kagumire, Caroline Kantono, Peter Nabende","doi":"10.1145/3769089","DOIUrl":null,"url":null,"abstract":"With recent advancements in speech recognition, it is crucial to ensure that automatic speech recognition (ASR) systems do not exhibit systematic biases, such as those related to gender, age, accent, and dialect. Although research has extensively examined systematic biases such as those related to gender, age, accent, and dialect, for high-resource languages, research on low-resource African languages remains limited. This systematic literature review synthesizes evidence on bias evaluation and mitigation in ASR models for African languages, adhering to the PRISMA reporting guidelines. Our analysis reveals that most biases stem from data imbalances and limited linguistic diversity in training datasets, resulting in disproportionately high error rates for underrepresented speaker groups. Mitigation strategies in African contexts have primarily focused on data-centric methods, including dataset expansion, augmentation, and transfer learning. In contrast, more advanced approaches, including fairness-aware modeling, bias-aware loss functions, adversarial debiasing, and speaker-adaptive techniques, are rarely applied. Gender, accent, and dialect biases dominate the few African studies available, while age and racial biases are almost absent. The limited number of African languages covered highlights the urgent need for more representative and inclusive research. Addressing these gaps will support the development of fairer and more robust ASR technologies across the continent.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"75 1","pages":""},"PeriodicalIF":28.0000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Systematic Literature Review on Bias Evaluation and Mitigation in Automatic Speech Recognition Models for Low-Resource African Languages\",\"authors\":\"Joyce Nakatumba-Nabende, Sulaiman Kagumire, Caroline Kantono, Peter Nabende\",\"doi\":\"10.1145/3769089\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With recent advancements in speech recognition, it is crucial to ensure that automatic speech recognition (ASR) systems do not exhibit systematic biases, such as those related to gender, age, accent, and dialect. Although research has extensively examined systematic biases such as those related to gender, age, accent, and dialect, for high-resource languages, research on low-resource African languages remains limited. This systematic literature review synthesizes evidence on bias evaluation and mitigation in ASR models for African languages, adhering to the PRISMA reporting guidelines. Our analysis reveals that most biases stem from data imbalances and limited linguistic diversity in training datasets, resulting in disproportionately high error rates for underrepresented speaker groups. Mitigation strategies in African contexts have primarily focused on data-centric methods, including dataset expansion, augmentation, and transfer learning. In contrast, more advanced approaches, including fairness-aware modeling, bias-aware loss functions, adversarial debiasing, and speaker-adaptive techniques, are rarely applied. Gender, accent, and dialect biases dominate the few African studies available, while age and racial biases are almost absent. The limited number of African languages covered highlights the urgent need for more representative and inclusive research. Addressing these gaps will support the development of fairer and more robust ASR technologies across the continent.\",\"PeriodicalId\":50926,\"journal\":{\"name\":\"ACM Computing Surveys\",\"volume\":\"75 1\",\"pages\":\"\"},\"PeriodicalIF\":28.0000,\"publicationDate\":\"2025-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Computing Surveys\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3769089\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Computing Surveys","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3769089","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
A Systematic Literature Review on Bias Evaluation and Mitigation in Automatic Speech Recognition Models for Low-Resource African Languages
With recent advancements in speech recognition, it is crucial to ensure that automatic speech recognition (ASR) systems do not exhibit systematic biases, such as those related to gender, age, accent, and dialect. Although research has extensively examined systematic biases such as those related to gender, age, accent, and dialect, for high-resource languages, research on low-resource African languages remains limited. This systematic literature review synthesizes evidence on bias evaluation and mitigation in ASR models for African languages, adhering to the PRISMA reporting guidelines. Our analysis reveals that most biases stem from data imbalances and limited linguistic diversity in training datasets, resulting in disproportionately high error rates for underrepresented speaker groups. Mitigation strategies in African contexts have primarily focused on data-centric methods, including dataset expansion, augmentation, and transfer learning. In contrast, more advanced approaches, including fairness-aware modeling, bias-aware loss functions, adversarial debiasing, and speaker-adaptive techniques, are rarely applied. Gender, accent, and dialect biases dominate the few African studies available, while age and racial biases are almost absent. The limited number of African languages covered highlights the urgent need for more representative and inclusive research. Addressing these gaps will support the development of fairer and more robust ASR technologies across the continent.
期刊介绍:
ACM Computing Surveys is an academic journal that focuses on publishing surveys and tutorials on various areas of computing research and practice. The journal aims to provide comprehensive and easily understandable articles that guide readers through the literature and help them understand topics outside their specialties. In terms of impact, CSUR has a high reputation with a 2022 Impact Factor of 16.6. It is ranked 3rd out of 111 journals in the field of Computer Science Theory & Methods.
ACM Computing Surveys is indexed and abstracted in various services, including AI2 Semantic Scholar, Baidu, Clarivate/ISI: JCR, CNKI, DeepDyve, DTU, EBSCO: EDS/HOST, and IET Inspec, among others.