基于DENCLUE的原型选择改进多标签k近邻算法

Monia, Himanshu Suyal, Aditya Gupta
{"title":"基于DENCLUE的原型选择改进多标签k近邻算法","authors":"Monia, Himanshu Suyal, Aditya Gupta","doi":"10.1109/icrito51393.2021.9596427","DOIUrl":null,"url":null,"abstract":"Prototype Selection (PS) techniques can be implemented very efficiently to allow the nearest neighbor classification to be faster by using a simple method to preserve the most prominent data for the Multi-Label k-Nearest Neighbour (ML-kNN) training. The most often used algorithm for classifying multi-labeled data is ML-kNN data which is adopted through the use of the well-known kNN algorithm. The use of Prototype Selection often minimizes the accuracy, and hence the performance of the algorithm. To solve the problem of reducing the accuracy of the ML-kNN, we proposed a novel solution that reduces data by using the well-known clustering algorithm-DENCLUE. Data that belongs to one of the clusters is considered prominent, while data that does not belong to any of the clusters is considered noisy and will not be included in the training process. In terms of noisy data reduction, the findings demonstrate a significant improvement over the ML-kNN.","PeriodicalId":259978,"journal":{"name":"2021 9th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Improved Multi-label k-Nearest Neighbour Algorithm with Prototype Selection using DENCLUE\",\"authors\":\"Monia, Himanshu Suyal, Aditya Gupta\",\"doi\":\"10.1109/icrito51393.2021.9596427\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Prototype Selection (PS) techniques can be implemented very efficiently to allow the nearest neighbor classification to be faster by using a simple method to preserve the most prominent data for the Multi-Label k-Nearest Neighbour (ML-kNN) training. The most often used algorithm for classifying multi-labeled data is ML-kNN data which is adopted through the use of the well-known kNN algorithm. The use of Prototype Selection often minimizes the accuracy, and hence the performance of the algorithm. To solve the problem of reducing the accuracy of the ML-kNN, we proposed a novel solution that reduces data by using the well-known clustering algorithm-DENCLUE. Data that belongs to one of the clusters is considered prominent, while data that does not belong to any of the clusters is considered noisy and will not be included in the training process. In terms of noisy data reduction, the findings demonstrate a significant improvement over the ML-kNN.\",\"PeriodicalId\":259978,\"journal\":{\"name\":\"2021 9th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 9th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icrito51393.2021.9596427\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 9th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icrito51393.2021.9596427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

原型选择(PS)技术可以非常有效地实现,通过使用一种简单的方法来为多标签k-近邻(ML-kNN)训练保留最突出的数据,从而使最近邻分类更快。多标签数据分类最常用的算法是ML-kNN数据,它是通过使用众所周知的kNN算法来实现的。使用原型选择通常会降低算法的准确性,从而降低算法的性能。为了解决ML-kNN的精度降低问题,我们提出了一种新的解决方案,即使用著名的聚类算法denclue来减少数据。属于其中一个聚类的数据被认为是突出的,而不属于任何一个聚类的数据被认为是嘈杂的,不会被包括在训练过程中。在噪声数据减少方面,研究结果显示了ML-kNN的显著改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Improved Multi-label k-Nearest Neighbour Algorithm with Prototype Selection using DENCLUE
Prototype Selection (PS) techniques can be implemented very efficiently to allow the nearest neighbor classification to be faster by using a simple method to preserve the most prominent data for the Multi-Label k-Nearest Neighbour (ML-kNN) training. The most often used algorithm for classifying multi-labeled data is ML-kNN data which is adopted through the use of the well-known kNN algorithm. The use of Prototype Selection often minimizes the accuracy, and hence the performance of the algorithm. To solve the problem of reducing the accuracy of the ML-kNN, we proposed a novel solution that reduces data by using the well-known clustering algorithm-DENCLUE. Data that belongs to one of the clusters is considered prominent, while data that does not belong to any of the clusters is considered noisy and will not be included in the training process. In terms of noisy data reduction, the findings demonstrate a significant improvement over the ML-kNN.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信