Toward a fair, gender-debiased classifier for the diagnosis of attention deficit/hyperactivity disorder- a Machine-Learning based classification study.
Susanne Neufang, Feifei Li, Atae Akhrif, Oya D Beyan
{"title":"Toward a fair, gender-debiased classifier for the diagnosis of attention deficit/hyperactivity disorder- a Machine-Learning based classification study.","authors":"Susanne Neufang, Feifei Li, Atae Akhrif, Oya D Beyan","doi":"10.1186/s12911-025-03126-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Attention deficit/hyperactivity disorder (ADHD) is the most common neurodevelopmental disorder. Gender disparities in the diagnosis of ADHD have been reported, suggesting that females tend to be diagnosed later in life than males are. The delayed diagnosis in females has been attributed to an inequality in the diagnostic criteria, failing to focus on the gender differences regarding symptomatology, comorbidity, and societal factors contributing to this disparity.</p><p><strong>Methods: </strong>In this study, we introduced debiased classifiers for the diagnosis of ADHD via different bias mitigation algorithms of the AI Fairness 360 toolbox on a training dataset of 400 children and adolescents with and without ADHD (98 females, 25 ADHD patients, 73 typically developing females), a subsample of the Child Mind Institute dataset. Test data were acquired in an earlier study. Two datasets were used, one including personal characteristic features, scores of the clinical questionnaire Child Behavior Checklist, and wavelet variance coefficients as quantifiers of neural dynamics (fMRI), a second dataset included personal characteristic features, scores of the clinical questionnaire Child Behavior Checklist, and radiomic features of neural structure (sMRI).</p><p><strong>Results: </strong>We found that the reweighed XGBoost model achieved the best accuracy and highest fairness in both datasets. Using model explanation, we showed how reweighing influenced feature importance at the global and local levels.</p><p><strong>Conclusion: </strong>Based on methodological characteristics and insights from global and local model explana-tion, we discuss the reasons of these findings and conclude, that using the combination of bias mitigation and model explanation, improved classification models can be achieved.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"290"},"PeriodicalIF":3.8000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12326834/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03126-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Attention deficit/hyperactivity disorder (ADHD) is the most common neurodevelopmental disorder. Gender disparities in the diagnosis of ADHD have been reported, suggesting that females tend to be diagnosed later in life than males are. The delayed diagnosis in females has been attributed to an inequality in the diagnostic criteria, failing to focus on the gender differences regarding symptomatology, comorbidity, and societal factors contributing to this disparity.
Methods: In this study, we introduced debiased classifiers for the diagnosis of ADHD via different bias mitigation algorithms of the AI Fairness 360 toolbox on a training dataset of 400 children and adolescents with and without ADHD (98 females, 25 ADHD patients, 73 typically developing females), a subsample of the Child Mind Institute dataset. Test data were acquired in an earlier study. Two datasets were used, one including personal characteristic features, scores of the clinical questionnaire Child Behavior Checklist, and wavelet variance coefficients as quantifiers of neural dynamics (fMRI), a second dataset included personal characteristic features, scores of the clinical questionnaire Child Behavior Checklist, and radiomic features of neural structure (sMRI).
Results: We found that the reweighed XGBoost model achieved the best accuracy and highest fairness in both datasets. Using model explanation, we showed how reweighing influenced feature importance at the global and local levels.
Conclusion: Based on methodological characteristics and insights from global and local model explana-tion, we discuss the reasons of these findings and conclude, that using the combination of bias mitigation and model explanation, improved classification models can be achieved.
期刊介绍:
BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.