{"title":"基于机器学习的可信糖尿病预测模型","authors":"Aruna Devi B, Karthik N","doi":"10.1109/COMSNETS59351.2024.10427272","DOIUrl":null,"url":null,"abstract":"Diabetes is a condition that develops when the blood sugar is above average level. Diabetes shoots up the risk of eye, kidney, nerve, and heart problems. By adopting suitable measures to prevent or manage blood sugar level may reduce the chances of getting diabetes-related health issues. Medical data collection is not done on a regular basis but is determined by the patient's condition and other clinical or administrative factors. A trustworthy model is free from missing values, faulty data and makes reliable predictions. Factors like data quality, anomalies, and model selection play a significant role in building a trustworthy model. Hence, this work concentrates on the prediction of diabetes by considering the following issues: 1) missing values in the dataset, 2) an imbalanced dataset and the presence of anomalies, and 3) diabetes prediction using Machine learning (ML) algorithm. The main objective of this work is to propose a trustworthy diabetes prediction model for an incomplete and imbalanced dataset using ML-based imputation, techniques for balancing datasets, and anomaly detection.","PeriodicalId":518748,"journal":{"name":"2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS)","volume":"22 1","pages":"400-402"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning-Based Trustworthy Diabetes Prediction Model\",\"authors\":\"Aruna Devi B, Karthik N\",\"doi\":\"10.1109/COMSNETS59351.2024.10427272\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diabetes is a condition that develops when the blood sugar is above average level. Diabetes shoots up the risk of eye, kidney, nerve, and heart problems. By adopting suitable measures to prevent or manage blood sugar level may reduce the chances of getting diabetes-related health issues. Medical data collection is not done on a regular basis but is determined by the patient's condition and other clinical or administrative factors. A trustworthy model is free from missing values, faulty data and makes reliable predictions. Factors like data quality, anomalies, and model selection play a significant role in building a trustworthy model. Hence, this work concentrates on the prediction of diabetes by considering the following issues: 1) missing values in the dataset, 2) an imbalanced dataset and the presence of anomalies, and 3) diabetes prediction using Machine learning (ML) algorithm. The main objective of this work is to propose a trustworthy diabetes prediction model for an incomplete and imbalanced dataset using ML-based imputation, techniques for balancing datasets, and anomaly detection.\",\"PeriodicalId\":518748,\"journal\":{\"name\":\"2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS)\",\"volume\":\"22 1\",\"pages\":\"400-402\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COMSNETS59351.2024.10427272\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COMSNETS59351.2024.10427272","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
糖尿病是血糖超过平均水平时出现的一种病症。糖尿病会增加眼睛、肾脏、神经和心脏出现问题的风险。通过采取适当的措施预防或控制血糖水平,可以降低出现糖尿病相关健康问题的几率。医疗数据的收集不是定期进行的,而是根据病人的病情和其他临床或行政因素决定的。值得信赖的模型不存在缺失值和错误数据,并能做出可靠的预测。数据质量、异常情况和模型选择等因素在建立可信模型的过程中发挥着重要作用。因此,这项工作主要通过考虑以下问题来预测糖尿病:1)数据集中的缺失值;2)不平衡数据集和异常的存在;3)使用机器学习(ML)算法预测糖尿病。这项工作的主要目的是利用基于 ML 的估算、平衡数据集的技术和异常检测,为不完整和不平衡的数据集提出一个值得信赖的糖尿病预测模型。
Machine Learning-Based Trustworthy Diabetes Prediction Model
Diabetes is a condition that develops when the blood sugar is above average level. Diabetes shoots up the risk of eye, kidney, nerve, and heart problems. By adopting suitable measures to prevent or manage blood sugar level may reduce the chances of getting diabetes-related health issues. Medical data collection is not done on a regular basis but is determined by the patient's condition and other clinical or administrative factors. A trustworthy model is free from missing values, faulty data and makes reliable predictions. Factors like data quality, anomalies, and model selection play a significant role in building a trustworthy model. Hence, this work concentrates on the prediction of diabetes by considering the following issues: 1) missing values in the dataset, 2) an imbalanced dataset and the presence of anomalies, and 3) diabetes prediction using Machine learning (ML) algorithm. The main objective of this work is to propose a trustworthy diabetes prediction model for an incomplete and imbalanced dataset using ML-based imputation, techniques for balancing datasets, and anomaly detection.