{"title":"High dimensional document classification using novel similarity function","authors":"K. Kumar, R. Srinivasan, Elijah Blessing Singh","doi":"10.1145/3330431.3330462","DOIUrl":null,"url":null,"abstract":"Document dimensionality is a major concern and worrying factor when high dimensionality documents are used for classification. Reducing the dimensionality can have both positive and negative effects. If dimensionality reduction is not appropriate then the classification performed using the reduced dimensionality documents may not give good classification results. Our previous research was focused on addressing dimensionality reduction using novel similarity function, but it did not address text classification. This paper addresses the classification task performed by applying the proposed similarity function. Experiment results prove the classifier performance with dimensionality reduction is better to the performance without dimensionality reduction.","PeriodicalId":196960,"journal":{"name":"Proceedings of the 5th International Conference on Engineering and MIS","volume":"106 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Engineering and MIS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3330431.3330462","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Document dimensionality is a major concern and worrying factor when high dimensionality documents are used for classification. Reducing the dimensionality can have both positive and negative effects. If dimensionality reduction is not appropriate then the classification performed using the reduced dimensionality documents may not give good classification results. Our previous research was focused on addressing dimensionality reduction using novel similarity function, but it did not address text classification. This paper addresses the classification task performed by applying the proposed similarity function. Experiment results prove the classifier performance with dimensionality reduction is better to the performance without dimensionality reduction.