S. Ay, Yavuz Selim Dogan, Seyfullah Alver, Cetin Kaya
{"title":"A novel attribute weighting method with genetic algorithm for document classification","authors":"S. Ay, Yavuz Selim Dogan, Seyfullah Alver, Cetin Kaya","doi":"10.1109/SIU.2016.7495943","DOIUrl":null,"url":null,"abstract":"Thanks to the proliferation of Internet, a lot of data are produced by both Web sites and personal users. The documents are required to be classified in terms of their content in order to reach the necessary information fast and correctly from produced data. One of the biggest difficulties in document classification systems is detection of attribute that represent the classes in best way. In this research, a new attribute method is presented by using a Genetic Algorithm for document classification problem. This proposed method is tested on 450 documents that are from 6 different categories collected from a news portal that broadcasts online. According to experimental results 93% of success is achieved with the proposed method.","PeriodicalId":427250,"journal":{"name":"2016 24th Signal Processing and Communication Application Conference (SIU)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 24th Signal Processing and Communication Application Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU.2016.7495943","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Thanks to the proliferation of Internet, a lot of data are produced by both Web sites and personal users. The documents are required to be classified in terms of their content in order to reach the necessary information fast and correctly from produced data. One of the biggest difficulties in document classification systems is detection of attribute that represent the classes in best way. In this research, a new attribute method is presented by using a Genetic Algorithm for document classification problem. This proposed method is tested on 450 documents that are from 6 different categories collected from a news portal that broadcasts online. According to experimental results 93% of success is achieved with the proposed method.