A comparative study to classify big data using fuzzy techniques

2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA) Pub Date : 2016-12-01 DOI:10.1109/ICEDSA.2016.7818508

Soha Safwat Labib

{"title":"A comparative study to classify big data using fuzzy techniques","authors":"Soha Safwat Labib","doi":"10.1109/ICEDSA.2016.7818508","DOIUrl":null,"url":null,"abstract":"It is very difficult to implement an efficient analysis by using the customary techniques currently available; this is due to the fact that the data size has had a huge increase. Many complications were faced because of the numerous characteristics of big data; some of them include complexity, value, variability, variety, velocity, and volume. The objective of this paper is to implement classification techniques using the map reduce framework using fuzzy and crisp methods, also to arrange for a study that can compare and contrast the outcomes of the suggested systems against the methods appraised in the documented works. For this research the applied method for the fuzzy technique is the fuzzy k-nearest neighbor, and for the non-fuzzy techniques both the support vector machine and the k-nearest neighbor are used. The use of the map reduce paradigm is applied to be able to process big data. We also implemented an integrated system using the Support Vector Machine with the fuzzy soft label and Gaussian fuzzy membership. Results show that fuzzy k-nearest neighbor classifier gives higher accuracy but it takes a lot of time in classification compared to the other techniques. But the outcomes when projected onto other data sets demonstrate that the suggested method that used fuzzy logic in the Reducer function gives higher accuracy and lower time than the new suggested methods and the methods revised in the paper.","PeriodicalId":247318,"journal":{"name":"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEDSA.2016.7818508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

It is very difficult to implement an efficient analysis by using the customary techniques currently available; this is due to the fact that the data size has had a huge increase. Many complications were faced because of the numerous characteristics of big data; some of them include complexity, value, variability, variety, velocity, and volume. The objective of this paper is to implement classification techniques using the map reduce framework using fuzzy and crisp methods, also to arrange for a study that can compare and contrast the outcomes of the suggested systems against the methods appraised in the documented works. For this research the applied method for the fuzzy technique is the fuzzy k-nearest neighbor, and for the non-fuzzy techniques both the support vector machine and the k-nearest neighbor are used. The use of the map reduce paradigm is applied to be able to process big data. We also implemented an integrated system using the Support Vector Machine with the fuzzy soft label and Gaussian fuzzy membership. Results show that fuzzy k-nearest neighbor classifier gives higher accuracy but it takes a lot of time in classification compared to the other techniques. But the outcomes when projected onto other data sets demonstrate that the suggested method that used fuzzy logic in the Reducer function gives higher accuracy and lower time than the new suggested methods and the methods revised in the paper.

查看原文本刊更多论文

利用模糊技术对大数据进行分类的比较研究

使用现有的习惯技术很难实现有效的分析;这是由于数据大小有了巨大的增长。由于大数据的众多特征，面临着许多复杂的问题;其中包括复杂性、价值、可变性、多样性、速度和数量。本文的目的是利用模糊和清晰的方法使用地图约简框架来实现分类技术，并安排一项研究，可以将建议系统的结果与文献中评价的方法进行比较和对比。在本研究中，模糊技术的应用方法是模糊k近邻，非模糊技术的应用方法是支持向量机和k近邻。使用map reduce范式来处理大数据。我们还利用模糊软标签和高斯模糊隶属度的支持向量机实现了一个集成系统。结果表明，模糊k近邻分类器具有较高的分类精度，但与其他分类方法相比，其分类时间较长。但在其他数据集上的投影结果表明，在Reducer函数中使用模糊逻辑的建议方法比新的建议方法和本文修正的方法具有更高的精度和更短的时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)

自引率

0.00%

发文量