Hybrid Multistage Fuzzy Clustering System for Medical Data Classification

Maryam Abdullah, Fawaz S. Al-Anzi, S. Al-Sharhan
{"title":"Hybrid Multistage Fuzzy Clustering System for Medical Data Classification","authors":"Maryam Abdullah, Fawaz S. Al-Anzi, S. Al-Sharhan","doi":"10.29007/B9QT","DOIUrl":null,"url":null,"abstract":"Due to the rapid development in technology nowadays, massive amount of data are available. In medicine, decision making is entirely based on the hidden information in these massive data. For that reason, data mining and machine learning technologies provide powerful tools for knowledge discovery within data. Two main techniques are used interchangeably: clustering and classification. In machine learning, clustering is an unsupervised learning technique while classification is a supervised learning method. These techniques are capable of extracting useful patterns and information which aid the process of data analysis and clinical decisions. This research presents a recent study of these techniques in the medical field during the past five years. Moreover, this paper proposes a hybrid multistage fuzzy clustering system applied to medical data classification. In the proposed system, two fuzzy clustering algorithms specifically FCM and GK were initially employed to obtain the membership values. These weights are then used in the second stage of the system as additional informative features to improve the classification process completed by SVM algorithm. Wisconsin Breast Cancer dataset, real-world application, obtained from UCI were used in the experiments. The results of the experiments show that the additional weights further improve the classification accuracy with 99.06% and 100% sensitivity.","PeriodicalId":383579,"journal":{"name":"2018 International Conference on Computing Sciences and Engineering (ICCSE)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Computing Sciences and Engineering (ICCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29007/B9QT","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Due to the rapid development in technology nowadays, massive amount of data are available. In medicine, decision making is entirely based on the hidden information in these massive data. For that reason, data mining and machine learning technologies provide powerful tools for knowledge discovery within data. Two main techniques are used interchangeably: clustering and classification. In machine learning, clustering is an unsupervised learning technique while classification is a supervised learning method. These techniques are capable of extracting useful patterns and information which aid the process of data analysis and clinical decisions. This research presents a recent study of these techniques in the medical field during the past five years. Moreover, this paper proposes a hybrid multistage fuzzy clustering system applied to medical data classification. In the proposed system, two fuzzy clustering algorithms specifically FCM and GK were initially employed to obtain the membership values. These weights are then used in the second stage of the system as additional informative features to improve the classification process completed by SVM algorithm. Wisconsin Breast Cancer dataset, real-world application, obtained from UCI were used in the experiments. The results of the experiments show that the additional weights further improve the classification accuracy with 99.06% and 100% sensitivity.
混合多级模糊聚类系统在医疗数据分类中的应用
由于当今科技的飞速发展,产生了海量的数据。在医学上,决策完全是基于这些海量数据中隐藏的信息。因此,数据挖掘和机器学习技术为数据中的知识发现提供了强大的工具。两种主要技术可以互换使用:聚类和分类。在机器学习中,聚类是一种无监督学习技术,而分类是一种有监督学习方法。这些技术能够提取有用的模式和信息,帮助数据分析和临床决策的过程。本研究介绍了在过去五年中这些技术在医学领域的最新研究。此外,本文还提出了一种应用于医疗数据分类的混合多级模糊聚类系统。在该系统中,首先采用FCM和GK两种模糊聚类算法来获取隶属度值。然后在系统的第二阶段使用这些权重作为附加的信息特征来改进SVM算法完成的分类过程。威斯康星乳腺癌数据集,实际应用,从UCI获得用于实验。实验结果表明,增加的权重进一步提高了分类准确率,达到99.06%,灵敏度达到100%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信