On the interpretability of the SVM model for predicting infant mortality in Bangladesh.

IF 2.4 3区医学 Q3 ENVIRONMENTAL SCIENCES

Journal of Health, Population, and Nutrition Pub Date : 2024-10-27 DOI:10.1186/s41043-024-00646-9

Md Abu Sayeed, Azizur Rahman, Atikur Rahman, Rumana Rois

{"title":"On the interpretability of the SVM model for predicting infant mortality in Bangladesh.","authors":"Md Abu Sayeed, Azizur Rahman, Atikur Rahman, Rumana Rois","doi":"10.1186/s41043-024-00646-9","DOIUrl":null,"url":null,"abstract":"Background: Although machine learning (ML) models are well-liked for their outperformance in prediction, greatly avoided due to the lack of intuition and explanation of their predictions. Interpretable ML is, therefore, an emerging research field that combines the performance and interpretability of ML models to create comprehensive solutions for complex decision-making analysis. Conversely, infant mortality is a global public health concern affecting health, social well-being, socio-economic development, and healthcare services. The study employs advanced interpretable ML techniques to anticipate and understand the factors affecting infant mortality in Bangladesh, overcoming the shortcomings of the conventional logistic regression (LR) model.Methods: By utilizing the global surrogate model and local individual conditional expectation (ICE) interpretability technique, the interpretable support vector machine (SVM) has been used in this study to reveal significant characteristics of infant mortality using data from the Bangladesh Demographic and Health Survey (BDHS) 2017-18. To investigate intricate decision-making analysis of infant mortality, we adapted SVM and LR techniques with the hyperparameter tuning parameters. These models' performances were initially assessed using the receiver operating characteristics (ROC) curve, run-time, and confusion matrix parameters with 100 permutations. Afterward, the SVM model's model-agnostic explanation and the LR model's interpretation were compared to enhance advanced comprehension for further insights.Results: The results of the 100 permutations demonstrated that the LR model (Average: accuracy = 0.9105, precision = NaN, sensitivity = 0, specificity = 1, F1-score = 0, area under the ROC curve (AUC) = 0.6780, run-time = 0.0832) outperformed the SVM model (Average: accuracy = 0.8470, precision = 0.1062, sensitivity = 0.0949, specificity = 0.9209, F1-score = 0.1000, AUC = 0.5632, run-time = 0.0254) in predicting infant mortality, but the LR model had a slower run-time and it was unable to predict any positive cases. The interpretation of LR analysis revealed that infant mortality rates decrease when mothers give birth after over two years, with higher educational attainment, overweight or obese mothers, working mothers, and families with polluted cooking fuel having lower rates. The local ICE interpretability technique, which depicts individual influences on the average likelihood of dying within the first birthday, explores the interpretable SVM model that mothers with normal BMIs, giving birth within two years, using less polluted cooking fuel, working mothers, and having male infant were more likely to experience infant death. The interpretable SVM model based on the global surrogate model also reveals that working mothers who used polluted cooking fuel at home and working women who used less polluted cooking fuel but had a longer period between pregnancies than two years would have higher infant death rates. Even among non-working mothers who used polluted cooking fuel and gave birth within two years of the preceding one, infant death rates were higher.Conclusions: The interpretable SVM model reveals global interpretations help clinicians understand the entire conditional distribution, while local interpretations focus on specific instances, providing different insights into model behavior. Interpretable ML models aid policymakers, stakeholders, and families in understanding and preventing infant deaths by improving policy-making strategies and establishing effective family counseling services.","PeriodicalId":15969,"journal":{"name":"Journal of Health, Population, and Nutrition","volume":"43 1","pages":"170"},"PeriodicalIF":2.4000,"publicationDate":"2024-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11520049/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Health, Population, and Nutrition","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s41043-024-00646-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Although machine learning (ML) models are well-liked for their outperformance in prediction, greatly avoided due to the lack of intuition and explanation of their predictions. Interpretable ML is, therefore, an emerging research field that combines the performance and interpretability of ML models to create comprehensive solutions for complex decision-making analysis. Conversely, infant mortality is a global public health concern affecting health, social well-being, socio-economic development, and healthcare services. The study employs advanced interpretable ML techniques to anticipate and understand the factors affecting infant mortality in Bangladesh, overcoming the shortcomings of the conventional logistic regression (LR) model.

Methods: By utilizing the global surrogate model and local individual conditional expectation (ICE) interpretability technique, the interpretable support vector machine (SVM) has been used in this study to reveal significant characteristics of infant mortality using data from the Bangladesh Demographic and Health Survey (BDHS) 2017-18. To investigate intricate decision-making analysis of infant mortality, we adapted SVM and LR techniques with the hyperparameter tuning parameters. These models' performances were initially assessed using the receiver operating characteristics (ROC) curve, run-time, and confusion matrix parameters with 100 permutations. Afterward, the SVM model's model-agnostic explanation and the LR model's interpretation were compared to enhance advanced comprehension for further insights.

Results: The results of the 100 permutations demonstrated that the LR model (Average: accuracy = 0.9105, precision = NaN, sensitivity = 0, specificity = 1, F1-score = 0, area under the ROC curve (AUC) = 0.6780, run-time = 0.0832) outperformed the SVM model (Average: accuracy = 0.8470, precision = 0.1062, sensitivity = 0.0949, specificity = 0.9209, F1-score = 0.1000, AUC = 0.5632, run-time = 0.0254) in predicting infant mortality, but the LR model had a slower run-time and it was unable to predict any positive cases. The interpretation of LR analysis revealed that infant mortality rates decrease when mothers give birth after over two years, with higher educational attainment, overweight or obese mothers, working mothers, and families with polluted cooking fuel having lower rates. The local ICE interpretability technique, which depicts individual influences on the average likelihood of dying within the first birthday, explores the interpretable SVM model that mothers with normal BMIs, giving birth within two years, using less polluted cooking fuel, working mothers, and having male infant were more likely to experience infant death. The interpretable SVM model based on the global surrogate model also reveals that working mothers who used polluted cooking fuel at home and working women who used less polluted cooking fuel but had a longer period between pregnancies than two years would have higher infant death rates. Even among non-working mothers who used polluted cooking fuel and gave birth within two years of the preceding one, infant death rates were higher.

Conclusions: The interpretable SVM model reveals global interpretations help clinicians understand the entire conditional distribution, while local interpretations focus on specific instances, providing different insights into model behavior. Interpretable ML models aid policymakers, stakeholders, and families in understanding and preventing infant deaths by improving policy-making strategies and establishing effective family counseling services.

查看原文本刊更多论文

关于预测孟加拉国婴儿死亡率的 SVM 模型的可解释性。

背景：尽管机器学习（ML）模型因其出色的预测性能而备受青睐，但由于其预测结果缺乏直观性和解释性，人们对其避而远之。因此，可解释的 ML 是一个新兴的研究领域，它结合了 ML 模型的性能和可解释性，为复杂的决策分析提供了全面的解决方案。相反，婴儿死亡率是一个全球性的公共卫生问题，影响着健康、社会福利、社会经济发展和医疗保健服务。本研究采用先进的可解释 ML 技术来预测和了解影响孟加拉国婴儿死亡率的因素，克服了传统逻辑回归（LR）模型的缺点：通过利用全局代用模型和局部个体条件期望（ICE）可解释性技术，本研究使用了可解释支持向量机（SVM），利用 2017-18 年孟加拉国人口与健康调查（BDHS）的数据揭示了婴儿死亡率的重要特征。为了研究婴儿死亡率的复杂决策分析，我们调整了 SVM 和 LR 技术的超参数调整参数。这些模型的性能最初是通过接收者操作特征曲线（ROC）、运行时间和混淆矩阵参数（100 次排列）进行评估的。随后，对 SVM 模型的模型无关性解释和 LR 模型的解释进行了比较，以提高高级理解能力，从而获得更深入的见解：100 次排列的结果表明，LR 模型（平均：准确度 = 0.9105，精确度 = NaN，灵敏度 = 0，特异性 = 1，F1-分数 = 0，ROC 曲线下面积 (AUC) = 0.6780，运行时间 = 0.0832）优于 SVM 模型（平均：准确度 = 0.8470，精确度 = 0.1070，F1-分数 = 0，ROC 曲线下面积 (AUC) = 0.6780，运行时间 = 0.0832）。8470, precision = 0.1062, sensitivity = 0.0949, specificity = 0.9209, F1-score = 0.1000, AUC = 0.5632, run-time = 0.0254），但 LR 模型的运行时间较慢，且无法预测任何阳性病例。LR 分析的解释结果表明，当母亲生育超过两年后，婴儿死亡率会下降，教育程度较高、超重或肥胖的母亲、有工作的母亲以及使用污染烹饪燃料的家庭的婴儿死亡率较低。本地 ICE 可解释性技术描绘了对第一个生日内死亡的平均可能性的个体影响，它探索了可解释 SVM 模型，即体重指数正常、在两年内分娩、使用污染较少的烹饪燃料、职业母亲和有男婴的母亲更有可能经历婴儿死亡。基于全球代孕模型的可解释 SVM 模型还显示，在家使用污染烹饪燃料的职业母亲和使用污染较少的烹饪燃料但两次怀孕间隔时间超过两年的职业妇女的婴儿死亡率较高。即使在使用污染烹饪燃料且在前一胎后两年内分娩的非职业母亲中，婴儿死亡率也较高：可解释 SVM 模型揭示的全局解释有助于临床医生理解整个条件分布，而局部解释则侧重于特定实例，为模型行为提供了不同的见解。可解释的 ML 模型有助于政策制定者、利益相关者和家庭了解并通过改进政策制定策略和建立有效的家庭咨询服务来预防婴儿死亡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Health, Population, and Nutrition 医学-公共卫生、环境卫生与职业卫生

CiteScore

2.20

自引率

0.00%

发文量

审稿时长

6 months

期刊介绍： Journal of Health, Population and Nutrition brings together research on all aspects of issues related to population, nutrition and health. The journal publishes articles across a broad range of topics including global health, maternal and child health, nutrition, common illnesses and determinants of population health.