A comparative study to classify big data using fuzzy techniques

Soha Safwat Labib
{"title":"A comparative study to classify big data using fuzzy techniques","authors":"Soha Safwat Labib","doi":"10.1109/ICEDSA.2016.7818508","DOIUrl":null,"url":null,"abstract":"It is very difficult to implement an efficient analysis by using the customary techniques currently available; this is due to the fact that the data size has had a huge increase. Many complications were faced because of the numerous characteristics of big data; some of them include complexity, value, variability, variety, velocity, and volume. The objective of this paper is to implement classification techniques using the map reduce framework using fuzzy and crisp methods, also to arrange for a study that can compare and contrast the outcomes of the suggested systems against the methods appraised in the documented works. For this research the applied method for the fuzzy technique is the fuzzy k-nearest neighbor, and for the non-fuzzy techniques both the support vector machine and the k-nearest neighbor are used. The use of the map reduce paradigm is applied to be able to process big data. We also implemented an integrated system using the Support Vector Machine with the fuzzy soft label and Gaussian fuzzy membership. Results show that fuzzy k-nearest neighbor classifier gives higher accuracy but it takes a lot of time in classification compared to the other techniques. But the outcomes when projected onto other data sets demonstrate that the suggested method that used fuzzy logic in the Reducer function gives higher accuracy and lower time than the new suggested methods and the methods revised in the paper.","PeriodicalId":247318,"journal":{"name":"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEDSA.2016.7818508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

It is very difficult to implement an efficient analysis by using the customary techniques currently available; this is due to the fact that the data size has had a huge increase. Many complications were faced because of the numerous characteristics of big data; some of them include complexity, value, variability, variety, velocity, and volume. The objective of this paper is to implement classification techniques using the map reduce framework using fuzzy and crisp methods, also to arrange for a study that can compare and contrast the outcomes of the suggested systems against the methods appraised in the documented works. For this research the applied method for the fuzzy technique is the fuzzy k-nearest neighbor, and for the non-fuzzy techniques both the support vector machine and the k-nearest neighbor are used. The use of the map reduce paradigm is applied to be able to process big data. We also implemented an integrated system using the Support Vector Machine with the fuzzy soft label and Gaussian fuzzy membership. Results show that fuzzy k-nearest neighbor classifier gives higher accuracy but it takes a lot of time in classification compared to the other techniques. But the outcomes when projected onto other data sets demonstrate that the suggested method that used fuzzy logic in the Reducer function gives higher accuracy and lower time than the new suggested methods and the methods revised in the paper.
利用模糊技术对大数据进行分类的比较研究
使用现有的习惯技术很难实现有效的分析;这是由于数据大小有了巨大的增长。由于大数据的众多特征,面临着许多复杂的问题;其中包括复杂性、价值、可变性、多样性、速度和数量。本文的目的是利用模糊和清晰的方法使用地图约简框架来实现分类技术,并安排一项研究,可以将建议系统的结果与文献中评价的方法进行比较和对比。在本研究中,模糊技术的应用方法是模糊k近邻,非模糊技术的应用方法是支持向量机和k近邻。使用map reduce范式来处理大数据。我们还利用模糊软标签和高斯模糊隶属度的支持向量机实现了一个集成系统。结果表明,模糊k近邻分类器具有较高的分类精度,但与其他分类方法相比,其分类时间较长。但在其他数据集上的投影结果表明,在Reducer函数中使用模糊逻辑的建议方法比新的建议方法和本文修正的方法具有更高的精度和更短的时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信