{"title":"Predicting Diabetes u sing SVM Implemented by Machine Learning","authors":"Srikar Sistla","doi":"10.35940/ijsce.b3557.0512222","DOIUrl":null,"url":null,"abstract":"Age, BMI, and insulin levels, which play important roles because they are not constant and do not follow any specific patterns, are some of the factors that can be used to identify the chronic disease of Diabetes. Besides the elements described above, a few additional will be studied in subsequent subjects in this study. Before cleaning the data, support vector machine (SVM) algorithms, pandas, NumPy, and sci-kit-learn libraries are used to predict the patient's diagnosis and classify the data into various categories. The output contains two parameters: DIABETIC and NON-DIABETIC. With the available dataset, the accuracy score of training data was 77.5 percent and the accuracy score of test data was 80.5 percent.","PeriodicalId":173799,"journal":{"name":"International Journal of Soft Computing and Engineering","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Soft Computing and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35940/ijsce.b3557.0512222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Age, BMI, and insulin levels, which play important roles because they are not constant and do not follow any specific patterns, are some of the factors that can be used to identify the chronic disease of Diabetes. Besides the elements described above, a few additional will be studied in subsequent subjects in this study. Before cleaning the data, support vector machine (SVM) algorithms, pandas, NumPy, and sci-kit-learn libraries are used to predict the patient's diagnosis and classify the data into various categories. The output contains two parameters: DIABETIC and NON-DIABETIC. With the available dataset, the accuracy score of training data was 77.5 percent and the accuracy score of test data was 80.5 percent.