Describing Fuzzy Membership Function and Detecting the Outlier by Using Five Number Summary of Data

Md. Farooq Hasan, Md. Abdus Sobhan
{"title":"Describing Fuzzy Membership Function and Detecting the Outlier by Using Five Number Summary of Data","authors":"Md. Farooq Hasan, Md. Abdus Sobhan","doi":"10.4236/ajcm.2020.103022","DOIUrl":null,"url":null,"abstract":"One of the most important activities in data science is defining a membership function in fuzzy system. Although there are few ways to describe membership function like artificial neural networks, genetic algorithms etc.; they are very complex and time consuming. On the other hand, the presence of outlier in a data set produces deceptive results in the modeling. So it is important to detect and eliminate them to prevent their negative effect on the modeling. This paper describes a new and simple way of constructing fuzzy membership function by using five-number summary of a data set. Five states membership function can be created in this new method. At the same time, if there is any outlier in the data set, it can be detected with the help of this method. Usually box plot is used to identify the outliers of a data set. So along with the new approach, the box plot has also been drawn so that it is understood that the results obtained in the new method are accurate. Several real life examples and their analysis have been discussed with graph to demonstrate the potential of the proposed method. The results obtained show that the proposed method has given good results. In the case of outlier, the proposed method and the box plot method have shown similar results. Primary advantage of this new procedure is that it is not as expensive as neural networks, and genetic algorithms.","PeriodicalId":64456,"journal":{"name":"美国计算数学期刊(英文)","volume":"10 1","pages":"410-424"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"美国计算数学期刊(英文)","FirstCategoryId":"1089","ListUrlMain":"https://doi.org/10.4236/ajcm.2020.103022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

One of the most important activities in data science is defining a membership function in fuzzy system. Although there are few ways to describe membership function like artificial neural networks, genetic algorithms etc.; they are very complex and time consuming. On the other hand, the presence of outlier in a data set produces deceptive results in the modeling. So it is important to detect and eliminate them to prevent their negative effect on the modeling. This paper describes a new and simple way of constructing fuzzy membership function by using five-number summary of a data set. Five states membership function can be created in this new method. At the same time, if there is any outlier in the data set, it can be detected with the help of this method. Usually box plot is used to identify the outliers of a data set. So along with the new approach, the box plot has also been drawn so that it is understood that the results obtained in the new method are accurate. Several real life examples and their analysis have been discussed with graph to demonstrate the potential of the proposed method. The results obtained show that the proposed method has given good results. In the case of outlier, the proposed method and the box plot method have shown similar results. Primary advantage of this new procedure is that it is not as expensive as neural networks, and genetic algorithms.
模糊隶属函数描述及数据五数汇总的离群值检测
数据科学中最重要的活动之一是定义模糊系统中的隶属函数。虽然描述隶属函数的方法很少,如人工神经网络、遗传算法等。;它们非常复杂且耗时。另一方面,数据集中异常值的存在会在建模中产生欺骗性的结果。因此,检测和消除它们以防止它们对建模的负面影响是很重要的。本文描述了一种利用数据集的五个数摘要构造模糊隶属函数的新的简单方法。这种新方法可以创建五态隶属函数。同时,如果数据集中有任何异常值,可以借助该方法进行检测。通常,方框图用于识别数据集的异常值。因此,除了新方法外,还绘制了箱形图,以便理解新方法中获得的结果是准确的。通过图形讨论了几个现实生活中的例子及其分析,证明了所提出方法的潜力。结果表明,该方法具有良好的效果。在异常值的情况下,所提出的方法和盒图方法显示出相似的结果。这种新程序的主要优点是它不像神经网络和遗传算法那样昂贵。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
348
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信