用鲁棒聚类方法识别线性函数关系模型中的多个异常值

IF 0.7 4区 综合性期刊 Q3 MULTIDISCIPLINARY SCIENCES
Adilah Abdul Ghapor, Y. Zubairi, Sayed Md. Al Mamun, Siti Fatimah Hassan, E. Aruchunan, N. A. Mokhtar
{"title":"用鲁棒聚类方法识别线性函数关系模型中的多个异常值","authors":"Adilah Abdul Ghapor, Y. Zubairi, Sayed Md. Al Mamun, Siti Fatimah Hassan, E. Aruchunan, N. A. Mokhtar","doi":"10.17576/jsm-2023-5205-20","DOIUrl":null,"url":null,"abstract":"Outliers are some observation points outside the usual pattern of the other observations. It is essential to detect outliers as anomalous observations can affect the inference made in the analysis. In this study, we propose an efficient clustering procedure to identify multiple outliers in the linear functional relationship model using the single linkage algorithm with the Euclidean distance as the similarity measure. A new robust cut-off point using the median and median absolute deviation for the tree heights to classify the potential outliers are proposed in this study. Experimental results from the simulation study suggest our proposed method is able to identify the presence of multiple outliers with very small probability of swamping and masking. Application in real data also shows that the proposed clustering method for this linear functional relationship model successfully detects the outliers, thus suggesting the method's practicality in real-world problems.","PeriodicalId":21366,"journal":{"name":"Sains Malaysiana","volume":" ","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2023-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying Multiple Outliers in Linear Functional Relationship Model using a Robust Clustering Method\",\"authors\":\"Adilah Abdul Ghapor, Y. Zubairi, Sayed Md. Al Mamun, Siti Fatimah Hassan, E. Aruchunan, N. A. Mokhtar\",\"doi\":\"10.17576/jsm-2023-5205-20\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Outliers are some observation points outside the usual pattern of the other observations. It is essential to detect outliers as anomalous observations can affect the inference made in the analysis. In this study, we propose an efficient clustering procedure to identify multiple outliers in the linear functional relationship model using the single linkage algorithm with the Euclidean distance as the similarity measure. A new robust cut-off point using the median and median absolute deviation for the tree heights to classify the potential outliers are proposed in this study. Experimental results from the simulation study suggest our proposed method is able to identify the presence of multiple outliers with very small probability of swamping and masking. Application in real data also shows that the proposed clustering method for this linear functional relationship model successfully detects the outliers, thus suggesting the method's practicality in real-world problems.\",\"PeriodicalId\":21366,\"journal\":{\"name\":\"Sains Malaysiana\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2023-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sains Malaysiana\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.17576/jsm-2023-5205-20\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sains Malaysiana","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.17576/jsm-2023-5205-20","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

异常值是在其他观测的通常模式之外的一些观测点。检测异常值是至关重要的,因为异常观测可能会影响分析中的推断。在这项研究中,我们提出了一种有效的聚类程序,使用以欧几里得距离为相似度量的单链算法来识别线性函数关系模型中的多个异常值。本研究提出了一种新的鲁棒截止点,使用树高的中值和中值绝对偏差来对潜在的异常值进行分类。模拟研究的实验结果表明,我们提出的方法能够识别多个异常值的存在,而淹没和掩蔽的概率非常小。在实际数据中的应用也表明,该线性函数关系模型的聚类方法成功地检测到了异常值,从而表明了该方法在实际问题中的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identifying Multiple Outliers in Linear Functional Relationship Model using a Robust Clustering Method
Outliers are some observation points outside the usual pattern of the other observations. It is essential to detect outliers as anomalous observations can affect the inference made in the analysis. In this study, we propose an efficient clustering procedure to identify multiple outliers in the linear functional relationship model using the single linkage algorithm with the Euclidean distance as the similarity measure. A new robust cut-off point using the median and median absolute deviation for the tree heights to classify the potential outliers are proposed in this study. Experimental results from the simulation study suggest our proposed method is able to identify the presence of multiple outliers with very small probability of swamping and masking. Application in real data also shows that the proposed clustering method for this linear functional relationship model successfully detects the outliers, thus suggesting the method's practicality in real-world problems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Sains Malaysiana
Sains Malaysiana MULTIDISCIPLINARY SCIENCES-
CiteScore
1.60
自引率
12.50%
发文量
196
审稿时长
3-6 weeks
期刊介绍: Sains Malaysiana is a refereed journal committed to the advancement of scholarly knowledge and research findings of the several branches of science and technology. It contains articles on Earth Sciences, Health Sciences, Life Sciences, Mathematical Sciences and Physical Sciences. The journal publishes articles, reviews, and research notes whose content and approach are of interest to a wide range of scholars. Sains Malaysiana is published by the UKM Press an its autonomous Editorial Board are drawn from the Faculty of Science and Technology, Universiti Kebangsaan Malaysia. In addition, distinguished scholars from local and foreign universities are appointed to serve as advisory board members and referees.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信