{"title":"基于机器学习分类器的糖尿病风险预测方法。","authors":"Jai Kumar B, Mohanasundaram Ranganathan","doi":"10.1088/2057-1976/ad857b","DOIUrl":null,"url":null,"abstract":"<p><p>Currently, Diabetes Mellitus (DM) can be life-threatening due to the dietary habits and lifestyle choices of individuals. Diabetes is characterised by elevated levels of glucose in the blood and an excess of protein in the blood. Poor eating habits and lifestyles are largely responsible for the rise in overweight, obesity, and various related conditions. This study investigated many diabetes-related risk forecasting techniques and algorithms. The eight machine learning (ML) algorithms used the diabetes dataset to test various prediction techniques, including a Support Vector Classifier, gradient-boosting, multilayer perceptron, random forest, K-nearest neighbors, logistic regression, extreme gradient boosting, and decision tree. To enhance the diabetic prediction ability of the model, we suggested using Feature Engineering (FE) and feature scaling. For our investigation, we utilized the Mendeley dataset on diabetes to assess the capacity of the model to predict diabetes. We developed a model by using Python programming and eight classification techniques. The Random Forest with 99.21%, Gradient Boosting with 99.61%, Extreme Gradient Boosting, and Decision Tree achieved the highest F1 score (99.81%), accuracy rate (99.80%), precision (99.81%), and recall (99.81%) of all classification approaches.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A machine learning classifier-based approach for diabetes mellitus risk prediction.\",\"authors\":\"Jai Kumar B, Mohanasundaram Ranganathan\",\"doi\":\"10.1088/2057-1976/ad857b\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Currently, Diabetes Mellitus (DM) can be life-threatening due to the dietary habits and lifestyle choices of individuals. Diabetes is characterised by elevated levels of glucose in the blood and an excess of protein in the blood. Poor eating habits and lifestyles are largely responsible for the rise in overweight, obesity, and various related conditions. This study investigated many diabetes-related risk forecasting techniques and algorithms. The eight machine learning (ML) algorithms used the diabetes dataset to test various prediction techniques, including a Support Vector Classifier, gradient-boosting, multilayer perceptron, random forest, K-nearest neighbors, logistic regression, extreme gradient boosting, and decision tree. To enhance the diabetic prediction ability of the model, we suggested using Feature Engineering (FE) and feature scaling. For our investigation, we utilized the Mendeley dataset on diabetes to assess the capacity of the model to predict diabetes. We developed a model by using Python programming and eight classification techniques. The Random Forest with 99.21%, Gradient Boosting with 99.61%, Extreme Gradient Boosting, and Decision Tree achieved the highest F1 score (99.81%), accuracy rate (99.80%), precision (99.81%), and recall (99.81%) of all classification approaches.</p>\",\"PeriodicalId\":8896,\"journal\":{\"name\":\"Biomedical Physics & Engineering Express\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Physics & Engineering Express\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1088/2057-1976/ad857b\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Physics & Engineering Express","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2057-1976/ad857b","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
摘要
目前,由于个人饮食习惯和生活方式的选择,糖尿病(DM)可能危及生命。糖尿病的特征是血液中葡萄糖水平升高和血液中蛋白质过量。不良的饮食习惯和生活方式是导致超重、肥胖和各种相关疾病增加的主要原因。本研究调查了许多与糖尿病相关的风险预测技术和算法。八种机器学习(ML)算法使用糖尿病数据集来测试各种预测技术,包括支持向量分类器、梯度提升、多层感知器、随机森林、K-近邻、逻辑回归、极端梯度提升和决策树。为了提高模型的糖尿病预测能力,我们建议使用特征工程(FE)和特征缩放。在调查中,我们利用 Mendeley 糖尿病数据集来评估模型预测糖尿病的能力。我们使用 Python 编程和八种分类技术开发了一个模型。在所有分类方法中,随机森林(99.21%)、梯度提升(99.61%)、极端梯度提升和决策树分别获得了最高的 F1 分数(99.81%)、准确率(99.80%)、精确率(99.81%)和召回率(99.81%)。
A machine learning classifier-based approach for diabetes mellitus risk prediction.
Currently, Diabetes Mellitus (DM) can be life-threatening due to the dietary habits and lifestyle choices of individuals. Diabetes is characterised by elevated levels of glucose in the blood and an excess of protein in the blood. Poor eating habits and lifestyles are largely responsible for the rise in overweight, obesity, and various related conditions. This study investigated many diabetes-related risk forecasting techniques and algorithms. The eight machine learning (ML) algorithms used the diabetes dataset to test various prediction techniques, including a Support Vector Classifier, gradient-boosting, multilayer perceptron, random forest, K-nearest neighbors, logistic regression, extreme gradient boosting, and decision tree. To enhance the diabetic prediction ability of the model, we suggested using Feature Engineering (FE) and feature scaling. For our investigation, we utilized the Mendeley dataset on diabetes to assess the capacity of the model to predict diabetes. We developed a model by using Python programming and eight classification techniques. The Random Forest with 99.21%, Gradient Boosting with 99.61%, Extreme Gradient Boosting, and Decision Tree achieved the highest F1 score (99.81%), accuracy rate (99.80%), precision (99.81%), and recall (99.81%) of all classification approaches.
期刊介绍:
BPEX is an inclusive, international, multidisciplinary journal devoted to publishing new research on any application of physics and/or engineering in medicine and/or biology. Characterized by a broad geographical coverage and a fast-track peer-review process, relevant topics include all aspects of biophysics, medical physics and biomedical engineering. Papers that are almost entirely clinical or biological in their focus are not suitable. The journal has an emphasis on publishing interdisciplinary work and bringing research fields together, encompassing experimental, theoretical and computational work.