{"title":"Use of Different Variants of Item Response Theory-Based Feature Selection Method for Text Categorization","authors":"Onder Coban","doi":"10.1109/ICTACSE50438.2022.10009653","DOIUrl":null,"url":null,"abstract":"In this study, we investigate the performance of the item response theory (IRT)-based feature selection (FS) approach on eight text datasets considering different feature sets and weighting schemes. We also employ its recently introduced variants in our evaluation. The results of our extensive experiments show that the IRT-based FS approach often reaches or improves the classification f-score by selecting a higher number of features compared to their well-known peers. Recently introduced variants, on the other hand, often fall behind the IRT1 and IRT2 for the task of text categorization.","PeriodicalId":301767,"journal":{"name":"2022 International Conference on Theoretical and Applied Computer Science and Engineering (ICTASCE)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Theoretical and Applied Computer Science and Engineering (ICTASCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTACSE50438.2022.10009653","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this study, we investigate the performance of the item response theory (IRT)-based feature selection (FS) approach on eight text datasets considering different feature sets and weighting schemes. We also employ its recently introduced variants in our evaluation. The results of our extensive experiments show that the IRT-based FS approach often reaches or improves the classification f-score by selecting a higher number of features compared to their well-known peers. Recently introduced variants, on the other hand, often fall behind the IRT1 and IRT2 for the task of text categorization.