{"title":"Gene Function Classification Using Fuzzy K-Nearest Neighbor Approach","authors":"Dan Li, J. Deogun, Kefei Wang","doi":"10.1109/GrC.2007.99","DOIUrl":null,"url":null,"abstract":"Prediction of gene function is a classification problem. Given its simplicity and relatively high accuracy, K-Nearest Neighbor (KNN) classification has become a popular choice for many real life applications. However, traditional KNN approach has two drawbacks. First, it cannot identify classes that do not exist in the training data sets. Second, it treats all K neighbors in a similar way without consideration of the distance differences between the test instance and its neighbors. In this paper, exploiting the potential of fuzzy set theory to handle uncertainty in data sets, we develop a fuzzy KNN approach for gene function classification. Experiments show that integrating fuzzy set theory into original KNN approach improves the overall performance of the classification model.","PeriodicalId":259430,"journal":{"name":"2007 IEEE International Conference on Granular Computing (GRC 2007)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE International Conference on Granular Computing (GRC 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GrC.2007.99","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18
Abstract
Prediction of gene function is a classification problem. Given its simplicity and relatively high accuracy, K-Nearest Neighbor (KNN) classification has become a popular choice for many real life applications. However, traditional KNN approach has two drawbacks. First, it cannot identify classes that do not exist in the training data sets. Second, it treats all K neighbors in a similar way without consideration of the distance differences between the test instance and its neighbors. In this paper, exploiting the potential of fuzzy set theory to handle uncertainty in data sets, we develop a fuzzy KNN approach for gene function classification. Experiments show that integrating fuzzy set theory into original KNN approach improves the overall performance of the classification model.