AN ENHANCED CLASSIFICATION SYSTEM BASED ON KERNEL PRINCIPAL COMPONENT ANALYSIS AND DATA COMPLEXITY MEASURES

IF 0.3 Q4 MULTIDISCIPLINARY SCIENCES
Fatih Sağlam, E. Dünder, M. Cengiz
{"title":"AN ENHANCED CLASSIFICATION SYSTEM BASED ON KERNEL PRINCIPAL COMPONENT ANALYSIS AND DATA COMPLEXITY MEASURES","authors":"Fatih Sağlam, E. Dünder, M. Cengiz","doi":"10.46939/j.sci.arts-23.2-a12","DOIUrl":null,"url":null,"abstract":"Principal component analysis is commonly used as a pre-step before employing a classifier to avoid the negative effect of the dimensionality and multicollinearity. The performance of a classifier is severely affected by the deviations from the linearity of the data structure and noisy samples. In this paper, we propose a new classification system that overcomes the drawback of these crucial problems, simultaneously. Our proposal is relying on the kernel principal component analysis with a proper parameter selection approach with data complexity measures. According to the empirical results, F1, T2 and T3 in AUC, T3 in GMEAN and T2 and T3 in MCC performed better than classical and other complexity measures. Comparison of classifiers showed that Radial SVM performs better in AUC, and KNN performs better in GMEAN and MCC using KPCA with complexity measures. As a result, our proposed system produces better results in various classification algorithms with respect to classical approach.","PeriodicalId":54169,"journal":{"name":"Journal of Science and Arts","volume":" ","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Science and Arts","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46939/j.sci.arts-23.2-a12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Principal component analysis is commonly used as a pre-step before employing a classifier to avoid the negative effect of the dimensionality and multicollinearity. The performance of a classifier is severely affected by the deviations from the linearity of the data structure and noisy samples. In this paper, we propose a new classification system that overcomes the drawback of these crucial problems, simultaneously. Our proposal is relying on the kernel principal component analysis with a proper parameter selection approach with data complexity measures. According to the empirical results, F1, T2 and T3 in AUC, T3 in GMEAN and T2 and T3 in MCC performed better than classical and other complexity measures. Comparison of classifiers showed that Radial SVM performs better in AUC, and KNN performs better in GMEAN and MCC using KPCA with complexity measures. As a result, our proposed system produces better results in various classification algorithms with respect to classical approach.
基于核主成分分析和数据复杂度度量的增强型分类系统
在使用分类器之前,通常使用主成分分析作为前置步骤,以避免维数和多重共线性的负面影响。分类器的性能受到数据结构的线性偏差和噪声样本的严重影响。在本文中,我们提出了一个新的分类系统,同时克服了这些关键问题的缺点。我们的建议是依靠核主成分分析,采用适当的参数选择方法和数据复杂性度量。从实证结果来看,AUC的F1、T2和T3、GMEAN的T3和MCC的T2和T3均优于经典和其他复杂性测度。两种分类器的比较表明,径向支持向量机(Radial SVM)在AUC中表现较好,KNN在GMEAN和MCC中表现较好。结果表明,相对于经典方法,我们提出的系统在各种分类算法中产生了更好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Science and Arts
Journal of Science and Arts MULTIDISCIPLINARY SCIENCES-
自引率
25.00%
发文量
57
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信