Hybrid Multistage Fuzzy Clustering System for Medical Data Classification

2018 International Conference on Computing Sciences and Engineering (ICCSE) Pub Date : 2018-04-10 DOI:10.29007/B9QT

Maryam Abdullah, Fawaz S. Al-Anzi, S. Al-Sharhan

{"title":"Hybrid Multistage Fuzzy Clustering System for Medical Data Classification","authors":"Maryam Abdullah, Fawaz S. Al-Anzi, S. Al-Sharhan","doi":"10.29007/B9QT","DOIUrl":null,"url":null,"abstract":"Due to the rapid development in technology nowadays, massive amount of data are available. In medicine, decision making is entirely based on the hidden information in these massive data. For that reason, data mining and machine learning technologies provide powerful tools for knowledge discovery within data. Two main techniques are used interchangeably: clustering and classification. In machine learning, clustering is an unsupervised learning technique while classification is a supervised learning method. These techniques are capable of extracting useful patterns and information which aid the process of data analysis and clinical decisions. This research presents a recent study of these techniques in the medical field during the past five years. Moreover, this paper proposes a hybrid multistage fuzzy clustering system applied to medical data classification. In the proposed system, two fuzzy clustering algorithms specifically FCM and GK were initially employed to obtain the membership values. These weights are then used in the second stage of the system as additional informative features to improve the classification process completed by SVM algorithm. Wisconsin Breast Cancer dataset, real-world application, obtained from UCI were used in the experiments. The results of the experiments show that the additional weights further improve the classification accuracy with 99.06% and 100% sensitivity.","PeriodicalId":383579,"journal":{"name":"2018 International Conference on Computing Sciences and Engineering (ICCSE)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Computing Sciences and Engineering (ICCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29007/B9QT","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Due to the rapid development in technology nowadays, massive amount of data are available. In medicine, decision making is entirely based on the hidden information in these massive data. For that reason, data mining and machine learning technologies provide powerful tools for knowledge discovery within data. Two main techniques are used interchangeably: clustering and classification. In machine learning, clustering is an unsupervised learning technique while classification is a supervised learning method. These techniques are capable of extracting useful patterns and information which aid the process of data analysis and clinical decisions. This research presents a recent study of these techniques in the medical field during the past five years. Moreover, this paper proposes a hybrid multistage fuzzy clustering system applied to medical data classification. In the proposed system, two fuzzy clustering algorithms specifically FCM and GK were initially employed to obtain the membership values. These weights are then used in the second stage of the system as additional informative features to improve the classification process completed by SVM algorithm. Wisconsin Breast Cancer dataset, real-world application, obtained from UCI were used in the experiments. The results of the experiments show that the additional weights further improve the classification accuracy with 99.06% and 100% sensitivity.

查看原文本刊更多论文

混合多级模糊聚类系统在医疗数据分类中的应用

由于当今科技的飞速发展，产生了海量的数据。在医学上，决策完全是基于这些海量数据中隐藏的信息。因此，数据挖掘和机器学习技术为数据中的知识发现提供了强大的工具。两种主要技术可以互换使用:聚类和分类。在机器学习中，聚类是一种无监督学习技术，而分类是一种有监督学习方法。这些技术能够提取有用的模式和信息，帮助数据分析和临床决策的过程。本研究介绍了在过去五年中这些技术在医学领域的最新研究。此外，本文还提出了一种应用于医疗数据分类的混合多级模糊聚类系统。在该系统中，首先采用FCM和GK两种模糊聚类算法来获取隶属度值。然后在系统的第二阶段使用这些权重作为附加的信息特征来改进SVM算法完成的分类过程。威斯康星乳腺癌数据集，实际应用，从UCI获得用于实验。实验结果表明，增加的权重进一步提高了分类准确率，达到99.06%，灵敏度达到100%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 International Conference on Computing Sciences and Engineering (ICCSE)

自引率

0.00%

发文量