基于级联聚类和分类的糖尿病预测新方法

2012 Third International Conference on Computing, Communication and Networking Technologies (ICCCNT'12) Pub Date : 2012-07-26 DOI:10.1109/ICCCNT.2012.6396069

P. Hemant, T. Pushpavathi

{"title":"基于级联聚类和分类的糖尿病预测新方法","authors":"P. Hemant, T. Pushpavathi","doi":"10.1109/ICCCNT.2012.6396069","DOIUrl":null,"url":null,"abstract":"Knowledge of incidence and prevalence of a disease is vital in Community Medicine to control a disease. It is important in Internal Medicine for clinical diagnosis and presumptive treatment on a probability model. Prevalence informs the total case load at a given time. Incidence yields a pointer to extent of attention required and choice of measures. Initially K-means clustering is used to group the disease related data into clusters and assigns classes to clusters. Subsequently multiple different classification algorithms are trained on the result set to build the final classifier model based on K-fold cross validation method. This methodology is evaluated using 768 raw diabetes data obtained from a city hospital. The best accuracy for the given dataset is achieved in bagging algorithm compared to other classifiers. The proposed approach helps doctors in their diagnosis decisions and also in their treatment planning procedures for different categories.","PeriodicalId":364589,"journal":{"name":"2012 Third International Conference on Computing, Communication and Networking Technologies (ICCCNT'12)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":"{\"title\":\"A novel approach to predict diabetes by Cascading Clustering and Classification\",\"authors\":\"P. Hemant, T. Pushpavathi\",\"doi\":\"10.1109/ICCCNT.2012.6396069\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Knowledge of incidence and prevalence of a disease is vital in Community Medicine to control a disease. It is important in Internal Medicine for clinical diagnosis and presumptive treatment on a probability model. Prevalence informs the total case load at a given time. Incidence yields a pointer to extent of attention required and choice of measures. Initially K-means clustering is used to group the disease related data into clusters and assigns classes to clusters. Subsequently multiple different classification algorithms are trained on the result set to build the final classifier model based on K-fold cross validation method. This methodology is evaluated using 768 raw diabetes data obtained from a city hospital. The best accuracy for the given dataset is achieved in bagging algorithm compared to other classifiers. The proposed approach helps doctors in their diagnosis decisions and also in their treatment planning procedures for different categories.\",\"PeriodicalId\":364589,\"journal\":{\"name\":\"2012 Third International Conference on Computing, Communication and Networking Technologies (ICCCNT'12)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"23\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Third International Conference on Computing, Communication and Networking Technologies (ICCCNT'12)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCNT.2012.6396069\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Third International Conference on Computing, Communication and Networking Technologies (ICCCNT'12)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCNT.2012.6396069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 23

摘要

了解疾病的发病率和流行程度对社区医学控制疾病至关重要。概率模型对内科临床诊断和推测治疗具有重要意义。流行率反映了某一特定时间的总病例量。发生率表明需要注意的程度和措施的选择。最初使用K-means聚类将疾病相关数据分组，并为聚类分配类。然后在结果集上训练多个不同的分类算法，建立基于K-fold交叉验证方法的最终分类器模型。使用从一家城市医院获得的768份原始糖尿病数据对该方法进行了评估。与其他分类器相比，套袋算法对给定数据集的准确率最高。提出的方法有助于医生在他们的诊断决策，也在他们的治疗计划程序的不同类别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A novel approach to predict diabetes by Cascading Clustering and Classification

Knowledge of incidence and prevalence of a disease is vital in Community Medicine to control a disease. It is important in Internal Medicine for clinical diagnosis and presumptive treatment on a probability model. Prevalence informs the total case load at a given time. Incidence yields a pointer to extent of attention required and choice of measures. Initially K-means clustering is used to group the disease related data into clusters and assigns classes to clusters. Subsequently multiple different classification algorithms are trained on the result set to build the final classifier model based on K-fold cross validation method. This methodology is evaluated using 768 raw diabetes data obtained from a city hospital. The best accuracy for the given dataset is achieved in bagging algorithm compared to other classifiers. The proposed approach helps doctors in their diagnosis decisions and also in their treatment planning procedures for different categories.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 Third International Conference on Computing, Communication and Networking Technologies (ICCCNT'12)

自引率

0.00%

发文量