{"title":"Graph-Based Web Query Classification","authors":"Chunwei Xia, Xin Wang","doi":"10.1109/WISA.2015.68","DOIUrl":null,"url":null,"abstract":"Understanding Web users' search intent expressed by their queries is essential for a search engine to provide the appropriate answers. Web query classification (QC) algorithms have been widely studied to improve the accuracy and meet users' demands. Some QC algorithms convert queries into vectors and use SVM or CRF model as the classifier. However, with the volume of data increasing, the time consumed significantly increases. In this paper, we propose a method in which we split the queries into words and convert queries into a graph, after that, we adopt a liner equation as the classifier. Experimental results exhibit that our method has similar accuracy but higher efficiency compared with the existing methods. Our method can decrease the training time by 10% compared with the SVM algorithm, and also outperform the CRF model.","PeriodicalId":198938,"journal":{"name":"2015 12th Web Information System and Application Conference (WISA)","volume":"306 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 12th Web Information System and Application Conference (WISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2015.68","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Understanding Web users' search intent expressed by their queries is essential for a search engine to provide the appropriate answers. Web query classification (QC) algorithms have been widely studied to improve the accuracy and meet users' demands. Some QC algorithms convert queries into vectors and use SVM or CRF model as the classifier. However, with the volume of data increasing, the time consumed significantly increases. In this paper, we propose a method in which we split the queries into words and convert queries into a graph, after that, we adopt a liner equation as the classifier. Experimental results exhibit that our method has similar accuracy but higher efficiency compared with the existing methods. Our method can decrease the training time by 10% compared with the SVM algorithm, and also outperform the CRF model.