{"title":"Diabetes Prediction using Machine Learning","authors":"G. Parimala, R. Kayalvizhi, S. Nithiya","doi":"10.1109/ICCCI56745.2023.10128216","DOIUrl":null,"url":null,"abstract":"Diabetes is considered to be one of the worst illnesses in the world. Diabetes is caused by a combination of variables, including obesity, excessive blood glucose levels, and other causes. It does this by altering the insulin hormone, which in turn causes an irregular metabolism in the crab and raises its blood sugar levels. This program’s primary objective is to lessen the risk that people may acquire diabetes by making forecasts for them and urging them to take more care of their diet and lifestyle in the years to come. The key goals of this research were to develop and execute a method for predicting diabetes using machine learning techniques, as well as investigate the strategies that would be used to achieve success in this Endeavour. The suggested technique makes use of a wide variety of classification and ensemble learning algorithms, some examples of which include Knn, Label Encoder, and train test split. The results of the research may provide information that will help medical professionals make more accurate early predictions and judgments in order to better manage diabetes and save lives. The method first extracts information from a dataset, such as certain symptoms that may be utilized to gain further knowledge about diabetes, and then validates that information using other data. This paper objective was to build classification models for the diabetes data set, develop models that can determine whether or not a person is sick, and get the greatest possible validation scores in the models that were developed. Massive datasets may be found in the healthcare business. By investigating enormous datasets in this manner, we may uncover previously unknown information and trends, which will enable us to draw conclusions based on the data and make accurate forecasts. We categorize the dataset using random techniques since our major goal in doing this research is to determine the method that is the most accurate for predicting diabetes. This will be accomplished by integrating machine learning, data visualization, and data interpretation. The use of machine learning, which is becoming more important in the modern healthcare sector, will be the focus of this research. Massive datasets may be found in the healthcare business.","PeriodicalId":205683,"journal":{"name":"2023 International Conference on Computer Communication and Informatics (ICCCI)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Communication and Informatics (ICCCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCI56745.2023.10128216","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Diabetes is considered to be one of the worst illnesses in the world. Diabetes is caused by a combination of variables, including obesity, excessive blood glucose levels, and other causes. It does this by altering the insulin hormone, which in turn causes an irregular metabolism in the crab and raises its blood sugar levels. This program’s primary objective is to lessen the risk that people may acquire diabetes by making forecasts for them and urging them to take more care of their diet and lifestyle in the years to come. The key goals of this research were to develop and execute a method for predicting diabetes using machine learning techniques, as well as investigate the strategies that would be used to achieve success in this Endeavour. The suggested technique makes use of a wide variety of classification and ensemble learning algorithms, some examples of which include Knn, Label Encoder, and train test split. The results of the research may provide information that will help medical professionals make more accurate early predictions and judgments in order to better manage diabetes and save lives. The method first extracts information from a dataset, such as certain symptoms that may be utilized to gain further knowledge about diabetes, and then validates that information using other data. This paper objective was to build classification models for the diabetes data set, develop models that can determine whether or not a person is sick, and get the greatest possible validation scores in the models that were developed. Massive datasets may be found in the healthcare business. By investigating enormous datasets in this manner, we may uncover previously unknown information and trends, which will enable us to draw conclusions based on the data and make accurate forecasts. We categorize the dataset using random techniques since our major goal in doing this research is to determine the method that is the most accurate for predicting diabetes. This will be accomplished by integrating machine learning, data visualization, and data interpretation. The use of machine learning, which is becoming more important in the modern healthcare sector, will be the focus of this research. Massive datasets may be found in the healthcare business.
糖尿病被认为是世界上最严重的疾病之一。糖尿病是由多种因素引起的,包括肥胖、血糖水平过高和其他原因。它通过改变胰岛素激素来做到这一点,而胰岛素激素反过来又会导致螃蟹的新陈代谢不规律,并提高血糖水平。这个项目的主要目标是通过对人们进行预测,并敦促他们在未来几年更加注意自己的饮食和生活方式,从而降低人们患糖尿病的风险。这项研究的主要目标是开发和执行一种使用机器学习技术预测糖尿病的方法,以及研究将用于在这一努力中取得成功的策略。建议的技术使用了各种各样的分类和集成学习算法,其中一些例子包括Knn、Label Encoder和train test split。研究结果可能会提供信息,帮助医疗专业人员做出更准确的早期预测和判断,以便更好地管理糖尿病和挽救生命。该方法首先从数据集中提取信息,例如可用于进一步了解糖尿病的某些症状,然后使用其他数据验证该信息。本文的目标是为糖尿病数据集建立分类模型,开发可以确定一个人是否生病的模型,并在所开发的模型中获得尽可能高的验证分数。在医疗保健业务中可能会发现大量数据集。通过这种方式调查大量的数据集,我们可以发现以前未知的信息和趋势,这将使我们能够根据数据得出结论并做出准确的预测。我们使用随机技术对数据集进行分类,因为我们做这项研究的主要目标是确定预测糖尿病最准确的方法。这将通过整合机器学习、数据可视化和数据解释来实现。机器学习的使用在现代医疗保健领域变得越来越重要,这将是本研究的重点。在医疗保健业务中可能会发现大量数据集。