{"title":"基于区域距离的k-NN分类","authors":"Swe Swe Aung, I. Nagayama, S. Tamaki","doi":"10.1109/ICIIBMS.2017.8279719","DOIUrl":null,"url":null,"abstract":"k-Nearest Neighbor (k-NN) is very simple and powerful approach to conceptually approximate real-valued or discrete-valued target function. Many researchers have recently approved that k-NN is a high prediction accuracy for variety of real world systems using many different types of datasets. However, as we know, k-NN is a type of lazy learning algorithms as it has to compare to each of stored training examples for each observed instance. Besides, the prediction accuracy of k-NN is under the influence of K values. Mostly, the higher K values make the algorithm yield lower prediction accuracy according to our experiments. For these issues, this paper focuses on two properties that are to upgrade the classification accuracy by introducing Regional Distance-based k-NN (RD-kNN) and to speed up the processing time performance of k-NN by applying multi-threading approach. For the experiments, we used the real data sets, wine, iris, heart stalog, breast cancer, and breast tissue, from UCI machine learning repository. According to our test cases and simulations carried out, it was also experimentally confirmed that the new approach, RD-kNN, has a better performance than classical k-NN.","PeriodicalId":122969,"journal":{"name":"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","volume":"123 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Regional distance-based k-NN classification\",\"authors\":\"Swe Swe Aung, I. Nagayama, S. Tamaki\",\"doi\":\"10.1109/ICIIBMS.2017.8279719\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"k-Nearest Neighbor (k-NN) is very simple and powerful approach to conceptually approximate real-valued or discrete-valued target function. Many researchers have recently approved that k-NN is a high prediction accuracy for variety of real world systems using many different types of datasets. However, as we know, k-NN is a type of lazy learning algorithms as it has to compare to each of stored training examples for each observed instance. Besides, the prediction accuracy of k-NN is under the influence of K values. Mostly, the higher K values make the algorithm yield lower prediction accuracy according to our experiments. For these issues, this paper focuses on two properties that are to upgrade the classification accuracy by introducing Regional Distance-based k-NN (RD-kNN) and to speed up the processing time performance of k-NN by applying multi-threading approach. For the experiments, we used the real data sets, wine, iris, heart stalog, breast cancer, and breast tissue, from UCI machine learning repository. According to our test cases and simulations carried out, it was also experimentally confirmed that the new approach, RD-kNN, has a better performance than classical k-NN.\",\"PeriodicalId\":122969,\"journal\":{\"name\":\"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)\",\"volume\":\"123 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIIBMS.2017.8279719\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIBMS.2017.8279719","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
k-Nearest Neighbor (k-NN) is very simple and powerful approach to conceptually approximate real-valued or discrete-valued target function. Many researchers have recently approved that k-NN is a high prediction accuracy for variety of real world systems using many different types of datasets. However, as we know, k-NN is a type of lazy learning algorithms as it has to compare to each of stored training examples for each observed instance. Besides, the prediction accuracy of k-NN is under the influence of K values. Mostly, the higher K values make the algorithm yield lower prediction accuracy according to our experiments. For these issues, this paper focuses on two properties that are to upgrade the classification accuracy by introducing Regional Distance-based k-NN (RD-kNN) and to speed up the processing time performance of k-NN by applying multi-threading approach. For the experiments, we used the real data sets, wine, iris, heart stalog, breast cancer, and breast tissue, from UCI machine learning repository. According to our test cases and simulations carried out, it was also experimentally confirmed that the new approach, RD-kNN, has a better performance than classical k-NN.