Toward a fair, gender-debiased classifier for the diagnosis of attention deficit/hyperactivity disorder- a Machine-Learning based classification study.

IF 3.8 3区医学 Q2 MEDICAL INFORMATICS

BMC Medical Informatics and Decision Making Pub Date : 2025-08-05 DOI:10.1186/s12911-025-03126-0

Susanne Neufang, Feifei Li, Atae Akhrif, Oya D Beyan

{"title":"Toward a fair, gender-debiased classifier for the diagnosis of attention deficit/hyperactivity disorder- a Machine-Learning based classification study.","authors":"Susanne Neufang, Feifei Li, Atae Akhrif, Oya D Beyan","doi":"10.1186/s12911-025-03126-0","DOIUrl":null,"url":null,"abstract":"Background: Attention deficit/hyperactivity disorder (ADHD) is the most common neurodevelopmental disorder. Gender disparities in the diagnosis of ADHD have been reported, suggesting that females tend to be diagnosed later in life than males are. The delayed diagnosis in females has been attributed to an inequality in the diagnostic criteria, failing to focus on the gender differences regarding symptomatology, comorbidity, and societal factors contributing to this disparity.Methods: In this study, we introduced debiased classifiers for the diagnosis of ADHD via different bias mitigation algorithms of the AI Fairness 360 toolbox on a training dataset of 400 children and adolescents with and without ADHD (98 females, 25 ADHD patients, 73 typically developing females), a subsample of the Child Mind Institute dataset. Test data were acquired in an earlier study. Two datasets were used, one including personal characteristic features, scores of the clinical questionnaire Child Behavior Checklist, and wavelet variance coefficients as quantifiers of neural dynamics (fMRI), a second dataset included personal characteristic features, scores of the clinical questionnaire Child Behavior Checklist, and radiomic features of neural structure (sMRI).Results: We found that the reweighed XGBoost model achieved the best accuracy and highest fairness in both datasets. Using model explanation, we showed how reweighing influenced feature importance at the global and local levels.Conclusion: Based on methodological characteristics and insights from global and local model explana-tion, we discuss the reasons of these findings and conclude, that using the combination of bias mitigation and model explanation, improved classification models can be achieved.","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"290"},"PeriodicalIF":3.8000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12326834/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03126-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Attention deficit/hyperactivity disorder (ADHD) is the most common neurodevelopmental disorder. Gender disparities in the diagnosis of ADHD have been reported, suggesting that females tend to be diagnosed later in life than males are. The delayed diagnosis in females has been attributed to an inequality in the diagnostic criteria, failing to focus on the gender differences regarding symptomatology, comorbidity, and societal factors contributing to this disparity.

Methods: In this study, we introduced debiased classifiers for the diagnosis of ADHD via different bias mitigation algorithms of the AI Fairness 360 toolbox on a training dataset of 400 children and adolescents with and without ADHD (98 females, 25 ADHD patients, 73 typically developing females), a subsample of the Child Mind Institute dataset. Test data were acquired in an earlier study. Two datasets were used, one including personal characteristic features, scores of the clinical questionnaire Child Behavior Checklist, and wavelet variance coefficients as quantifiers of neural dynamics (fMRI), a second dataset included personal characteristic features, scores of the clinical questionnaire Child Behavior Checklist, and radiomic features of neural structure (sMRI).

Results: We found that the reweighed XGBoost model achieved the best accuracy and highest fairness in both datasets. Using model explanation, we showed how reweighing influenced feature importance at the global and local levels.

Conclusion: Based on methodological characteristics and insights from global and local model explana-tion, we discuss the reasons of these findings and conclude, that using the combination of bias mitigation and model explanation, improved classification models can be achieved.

Abstract Image

查看原文本刊更多论文

为诊断注意缺陷/多动障碍建立一个公平、无性别偏见的分类器——一项基于机器学习的分类研究。

背景：注意缺陷/多动障碍（ADHD）是最常见的神经发育障碍。据报道，ADHD诊断中的性别差异表明，女性往往比男性更晚被诊断出来。女性的延迟诊断归因于诊断标准的不平等，未能关注导致这种差异的症状学、合并症和社会因素方面的性别差异。方法：在本研究中，我们在儿童心理研究所数据集的一个子样本，400名患有和不患有多动症的儿童和青少年（98名女性，25名ADHD患者，73名正常发育的女性）的训练数据集上，通过AI公平性360工具箱的不同偏见缓解算法，引入了去偏见分类器来诊断多动症。测试数据是在早期的研究中获得的。使用两个数据集，一个数据集包括个人特征、临床问卷儿童行为检查表得分和小波方差系数作为神经动力学（fMRI）的量化指标，另一个数据集包括个人特征、临床问卷儿童行为检查表得分和神经结构放射学特征（sMRI）。结果：我们发现重新加权的XGBoost模型在两个数据集上都获得了最好的准确性和最高的公平性。通过模型解释，我们展示了在全局和局部水平上重新权衡如何影响特征的重要性。结论：基于方法学特点和全球和局部模型解释的见解，我们讨论了这些发现的原因，并得出结论，使用偏差缓解和模型解释相结合，可以实现改进的分类模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Informatics and Decision Making 医学-医学：信息

CiteScore

7.20

自引率

5.70%

发文量

297

审稿时长

1 months

期刊介绍： BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.