{"title":"改进分类性能的新模糊k近邻算法","authors":"Hassan I. Abdalla , Ali A. Amer , Mohammad Nassef","doi":"10.1016/j.future.2025.108139","DOIUrl":null,"url":null,"abstract":"<div><div>In fuzzy k-nearest neighbor, smooth class boundaries are provided by each instance’s fuzzy degree of membership. However, there are additional costs associated with calculating the memberships due to memory limitations and runtime overhead. Furthermore, in the presence of class imbalance and outliers, the effectiveness and efficiency of the most advanced fuzzy kNNs continue to decline. Thus, new fuzzy kNNs with straightforward designs are developed in this study to substantially lessen the influence of these problems and improve overall performance. The local mean vectors with the single linkage and the cumulative means of neighbors are combined, establishing these models, which are referred to as LMSL-FkNN and CMDW-FkNN, respectively. A comprehensive evaluation study spanning five experimental stages is carried out against six cutting-edge kNN competitors utilizing fifty-four real-world (balanced, imbalanced, noisy, and time series) datasets in order to illustrate the competitiveness of the established models. With CMDW-FkNN comfortably dominating the competition across the vast majority of datasets (specifically UCI, highly-Imbalanced, and Time Series datasets), the results supported by statistical tests, across three assessment metrics-accuracy, F-measure, and ROC-show that both models have significantly more promise than their rivals.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"176 ","pages":"Article 108139"},"PeriodicalIF":6.2000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"New fuzzy K-nearest neighbor algorithms for classification performance improvement\",\"authors\":\"Hassan I. Abdalla , Ali A. Amer , Mohammad Nassef\",\"doi\":\"10.1016/j.future.2025.108139\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In fuzzy k-nearest neighbor, smooth class boundaries are provided by each instance’s fuzzy degree of membership. However, there are additional costs associated with calculating the memberships due to memory limitations and runtime overhead. Furthermore, in the presence of class imbalance and outliers, the effectiveness and efficiency of the most advanced fuzzy kNNs continue to decline. Thus, new fuzzy kNNs with straightforward designs are developed in this study to substantially lessen the influence of these problems and improve overall performance. The local mean vectors with the single linkage and the cumulative means of neighbors are combined, establishing these models, which are referred to as LMSL-FkNN and CMDW-FkNN, respectively. A comprehensive evaluation study spanning five experimental stages is carried out against six cutting-edge kNN competitors utilizing fifty-four real-world (balanced, imbalanced, noisy, and time series) datasets in order to illustrate the competitiveness of the established models. With CMDW-FkNN comfortably dominating the competition across the vast majority of datasets (specifically UCI, highly-Imbalanced, and Time Series datasets), the results supported by statistical tests, across three assessment metrics-accuracy, F-measure, and ROC-show that both models have significantly more promise than their rivals.</div></div>\",\"PeriodicalId\":55132,\"journal\":{\"name\":\"Future Generation Computer Systems-The International Journal of Escience\",\"volume\":\"176 \",\"pages\":\"Article 108139\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Future Generation Computer Systems-The International Journal of Escience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167739X25004339\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25004339","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
New fuzzy K-nearest neighbor algorithms for classification performance improvement
In fuzzy k-nearest neighbor, smooth class boundaries are provided by each instance’s fuzzy degree of membership. However, there are additional costs associated with calculating the memberships due to memory limitations and runtime overhead. Furthermore, in the presence of class imbalance and outliers, the effectiveness and efficiency of the most advanced fuzzy kNNs continue to decline. Thus, new fuzzy kNNs with straightforward designs are developed in this study to substantially lessen the influence of these problems and improve overall performance. The local mean vectors with the single linkage and the cumulative means of neighbors are combined, establishing these models, which are referred to as LMSL-FkNN and CMDW-FkNN, respectively. A comprehensive evaluation study spanning five experimental stages is carried out against six cutting-edge kNN competitors utilizing fifty-four real-world (balanced, imbalanced, noisy, and time series) datasets in order to illustrate the competitiveness of the established models. With CMDW-FkNN comfortably dominating the competition across the vast majority of datasets (specifically UCI, highly-Imbalanced, and Time Series datasets), the results supported by statistical tests, across three assessment metrics-accuracy, F-measure, and ROC-show that both models have significantly more promise than their rivals.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.