{"title":"Performance analysis of k-Nearest Neighbors classification on Reuters news article datasets","authors":"Qian Yang","doi":"10.54254/2755-2721/55/20241444","DOIUrl":null,"url":null,"abstract":"The k-Nearest Neighbors (k-NN) algorithm is a fundamental and widely-used classification technique that has found applications in various domains, including text classification. In this paper, we present a comprehensive analysis of the k-NN classification algorithm applied to the Reuters news article dataset. Our study includes the data, implementation k-NN classification with different parameters, performance evaluation, and statistical analysis to draw meaningful conclusions. In a comprehensive analysis of the k-NN classification algorithm used for the Reuters news article data-set. A variety of metrics is used to evaluate the performance of the k-NN algorithm, such as accuracy, precision, recall, and F1 scores. These metrics provide a comprehensive view of how well the algorithm classifies news articles. Our statistical analysis reveals significant performance differences between various k-NN configurations. This can help researchers and practitioners make informed decisions when choosing the best parameters for their specific text classification tasks. In conclusion, our study provides valuable insights into the application of k-NN classification algorithms to textual data, highlighting the importance of parameter tuning and rigorous evaluation. These findings can guide practitioners to effectively use k-NN for text classification tasks and inspire further research in the field.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"11 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied and Computational Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54254/2755-2721/55/20241444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The k-Nearest Neighbors (k-NN) algorithm is a fundamental and widely-used classification technique that has found applications in various domains, including text classification. In this paper, we present a comprehensive analysis of the k-NN classification algorithm applied to the Reuters news article dataset. Our study includes the data, implementation k-NN classification with different parameters, performance evaluation, and statistical analysis to draw meaningful conclusions. In a comprehensive analysis of the k-NN classification algorithm used for the Reuters news article data-set. A variety of metrics is used to evaluate the performance of the k-NN algorithm, such as accuracy, precision, recall, and F1 scores. These metrics provide a comprehensive view of how well the algorithm classifies news articles. Our statistical analysis reveals significant performance differences between various k-NN configurations. This can help researchers and practitioners make informed decisions when choosing the best parameters for their specific text classification tasks. In conclusion, our study provides valuable insights into the application of k-NN classification algorithms to textual data, highlighting the importance of parameter tuning and rigorous evaluation. These findings can guide practitioners to effectively use k-NN for text classification tasks and inspire further research in the field.