Joshua Adams, Henry Williams, J. Carter, G. Dozier
{"title":"Genetic Heuristic Development: Feature selection for author identification","authors":"Joshua Adams, Henry Williams, J. Carter, G. Dozier","doi":"10.1109/CIBIM.2013.6607911","DOIUrl":null,"url":null,"abstract":"Author identification is the process of recognizing an author based on a sample of text. Feature selection is the process of selecting the most salient features required for recognition. In many cases, this results in an increase in recognition accuracy. In this paper, we apply Genetic and Evolutionary Feature Selection with Machine Learning (GEFeSML) to author identification. We then introduce Genetic Heuristic Development (GHD), a process to improve the matching process. GHD uses subsets of features found by GEFeSML to create a high performing heuristic for feature selection. This technique successfully increases recognition accuracy while significantly reducing the number of features required for recognition.","PeriodicalId":286155,"journal":{"name":"2013 IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (CIBIM)","volume":"52 5","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (CIBIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBIM.2013.6607911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Author identification is the process of recognizing an author based on a sample of text. Feature selection is the process of selecting the most salient features required for recognition. In many cases, this results in an increase in recognition accuracy. In this paper, we apply Genetic and Evolutionary Feature Selection with Machine Learning (GEFeSML) to author identification. We then introduce Genetic Heuristic Development (GHD), a process to improve the matching process. GHD uses subsets of features found by GEFeSML to create a high performing heuristic for feature selection. This technique successfully increases recognition accuracy while significantly reducing the number of features required for recognition.