Ensemble Machine Learning Algorithm for Diabetes Prediction in Maiduguri, Borno State

Emmanuel Gbenga Dada, Aishatu Ibrahim Birma, Abdulkarim Abbas Gora
{"title":"Ensemble Machine Learning Algorithm for Diabetes Prediction in Maiduguri, Borno State","authors":"Emmanuel Gbenga Dada, Aishatu Ibrahim Birma, Abdulkarim Abbas Gora","doi":"10.58578/mjms.v2i2.2875","DOIUrl":null,"url":null,"abstract":"Diabetes mellitus (DM) is a metabolic disease characterised by high levels of glucose in the blood, known as hyperglycemia, that can result in multiple problems within the body. The World Health Organisation (WHO) data for 2021 reveals a substantial increase in the prevalence of diabetes mellitus (DM), with the number of cases rising from 108 million in 1980 to 422 million in 2014. Between 2000 and 2019, there was a 3% increase in mortality rates associated with diabetes, categorised by age. In 2019, DM caused the deaths of more than 2 million people. These concerning figures clearly necessitate an immediate response. An alarming incidence of diabetes among the population of Maiduguri and Borno State inspired this investigation. This research proposed stacking ensemble learning approach to predict the rate of occurrence of diabetes cases in Maiduguri. The paper used different types of regression models to predict the occurrences of diabetes cases in Maiduguri over time. These models included adaptive boosting regression (Adaboost), gradient boosting regression (GBOOST), random forest regression (RFR), ordinary least square regression (OLS), least absolute shrinkage selection operator regression (LASSO), and ridge regression (RIDGE). The performance indicators studied in this work are root mean square (RMSE), mean absolute error (MAE), and mean square error (MSE). These metrics were used to assess the effectiveness of both the machine learning and proposed Stacking Ensemble Learning (SEL) approaches. Performance metrics considered in this study are root mean square (RMSE), mean absolute error (MAE), and mean square error (MSE), which were used to evaluate the performance of the machine learning and the proposed Stacking Ensemble Learning (SEL) technique. Experimental results revealed that SEL is a better predictor compared to other machine learning approaches considered in this work with an RMSE of 0.0493; a MSE of 0.0024; and a MAE of 0.0349. It is hoped that this research will help government officials understand the threat of diabetes and take the necessary mitigation actions.","PeriodicalId":515236,"journal":{"name":"Mikailalsys Journal of Mathematics and Statistics","volume":"108 50","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mikailalsys Journal of Mathematics and Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.58578/mjms.v2i2.2875","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Diabetes mellitus (DM) is a metabolic disease characterised by high levels of glucose in the blood, known as hyperglycemia, that can result in multiple problems within the body. The World Health Organisation (WHO) data for 2021 reveals a substantial increase in the prevalence of diabetes mellitus (DM), with the number of cases rising from 108 million in 1980 to 422 million in 2014. Between 2000 and 2019, there was a 3% increase in mortality rates associated with diabetes, categorised by age. In 2019, DM caused the deaths of more than 2 million people. These concerning figures clearly necessitate an immediate response. An alarming incidence of diabetes among the population of Maiduguri and Borno State inspired this investigation. This research proposed stacking ensemble learning approach to predict the rate of occurrence of diabetes cases in Maiduguri. The paper used different types of regression models to predict the occurrences of diabetes cases in Maiduguri over time. These models included adaptive boosting regression (Adaboost), gradient boosting regression (GBOOST), random forest regression (RFR), ordinary least square regression (OLS), least absolute shrinkage selection operator regression (LASSO), and ridge regression (RIDGE). The performance indicators studied in this work are root mean square (RMSE), mean absolute error (MAE), and mean square error (MSE). These metrics were used to assess the effectiveness of both the machine learning and proposed Stacking Ensemble Learning (SEL) approaches. Performance metrics considered in this study are root mean square (RMSE), mean absolute error (MAE), and mean square error (MSE), which were used to evaluate the performance of the machine learning and the proposed Stacking Ensemble Learning (SEL) technique. Experimental results revealed that SEL is a better predictor compared to other machine learning approaches considered in this work with an RMSE of 0.0493; a MSE of 0.0024; and a MAE of 0.0349. It is hoped that this research will help government officials understand the threat of diabetes and take the necessary mitigation actions.
用于博尔诺州迈杜古里市糖尿病预测的集合机器学习算法
糖尿病(DM)是一种代谢性疾病,其特征是血液中葡萄糖含量过高,即所谓的高血糖症,可导致身体出现多种问题。世界卫生组织(WHO)2021 年的数据显示,糖尿病(DM)患病率大幅上升,病例数从 1980 年的 1.08 亿增加到 2014 年的 4.22 亿。从2000年到2019年,按年龄分类,与糖尿病相关的死亡率增加了3%。2019 年,糖尿病导致 200 多万人死亡。这些令人担忧的数字显然需要立即采取应对措施。迈杜古里和博尔诺州人口中惊人的糖尿病发病率激发了这项调查。这项研究提出了堆叠集合学习方法来预测迈杜古里糖尿病病例的发生率。论文使用了不同类型的回归模型来预测迈杜古里糖尿病病例随着时间推移的发生率。这些模型包括自适应提升回归(Adaboost)、梯度提升回归(GBOOST)、随机森林回归(RFR)、普通最小平方回归(OLS)、最小绝对收缩选择算子回归(LASSO)和脊回归(RIDGE)。本文研究的性能指标包括均方根(RMSE)、平均绝对误差(MAE)和均方误差(MSE)。这些指标用于评估机器学习方法和建议的堆叠集合学习(SEL)方法的有效性。本研究考虑的性能指标是均方根(RMSE)、平均绝对误差(MAE)和均方误差(MSE),这些指标用于评估机器学习和拟议的堆积集合学习(SEL)技术的性能。实验结果表明,与本研究中考虑的其他机器学习方法相比,SEL 是一种更好的预测方法,其 RMSE 为 0.0493;MSE 为 0.0024;MAE 为 0.0349。希望这项研究能帮助政府官员了解糖尿病的威胁,并采取必要的缓解措施。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信