Rajib Kumar Halder, Mohammed Nasir Uddin, Md Ashraf Uddin
{"title":"A novel region based neighbors searching classification algorithm for big data","authors":"Rajib Kumar Halder, Mohammed Nasir Uddin, Md Ashraf Uddin","doi":"10.1016/j.ijcce.2025.05.002","DOIUrl":null,"url":null,"abstract":"<div><div>The K-Nearest Neighbors (KNN) algorithm remains a cornerstone of machine learning due to its intuitive design and effectiveness in classification tasks. However, its performance often suffers from critical limitations, such as sensitivity to the choice of the parameter K and an inability to effectively capture complex relationships among neighboring instances. To overcome these challenges, we propose the <strong>Region-Based Neighbors Searching Classification Algorithm (RNSCA)</strong>—a novel, adaptive framework that significantly enhances the scalability, flexibility, and accuracy of traditional KNN, especially in high-dimensional and large-scale datasets. RNSCA leverages dynamic, region-based partitioning for more focused and efficient neighbor searches and incorporates a weighted activation function to prioritize the most relevant data points during classification. Additionally, ensemble learning techniques are integrated to strengthen model robustness and improve generalization. The proposed algorithm is extensively validated on benchmark datasets including Iris, Crop Recommendation, Breast Cancer, Diabetes, and Chronic Kidney Disease (CKD). Experimental results consistently demonstrate RNSCA’s superior performance in modeling nuanced local structures and mitigating the core limitations of conventional KNN. This research presents a compelling advancement in classification algorithms, with practical implications across domains such as healthcare, agriculture, and environmental intelligence.</div></div>","PeriodicalId":100694,"journal":{"name":"International Journal of Cognitive Computing in Engineering","volume":"6 ","pages":"Pages 516-536"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Computing in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666307425000269","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The K-Nearest Neighbors (KNN) algorithm remains a cornerstone of machine learning due to its intuitive design and effectiveness in classification tasks. However, its performance often suffers from critical limitations, such as sensitivity to the choice of the parameter K and an inability to effectively capture complex relationships among neighboring instances. To overcome these challenges, we propose the Region-Based Neighbors Searching Classification Algorithm (RNSCA)—a novel, adaptive framework that significantly enhances the scalability, flexibility, and accuracy of traditional KNN, especially in high-dimensional and large-scale datasets. RNSCA leverages dynamic, region-based partitioning for more focused and efficient neighbor searches and incorporates a weighted activation function to prioritize the most relevant data points during classification. Additionally, ensemble learning techniques are integrated to strengthen model robustness and improve generalization. The proposed algorithm is extensively validated on benchmark datasets including Iris, Crop Recommendation, Breast Cancer, Diabetes, and Chronic Kidney Disease (CKD). Experimental results consistently demonstrate RNSCA’s superior performance in modeling nuanced local structures and mitigating the core limitations of conventional KNN. This research presents a compelling advancement in classification algorithms, with practical implications across domains such as healthcare, agriculture, and environmental intelligence.