{"title":"Beyond Bag of Words: A New Model for XML Keyword Query","authors":"Xiping Liu, Changxuan Wan","doi":"10.1109/ICMECG.2014.59","DOIUrl":null,"url":null,"abstract":"Keyword search is an effective paradigm for information discovery and has been introduced recently to query XML documents. Effective keyword search of XML documents needs full understanding of the keyword query. Traditional bag of-words model cannot differentiate the roles of keywords as well as the relationship between keywords, thus is not proper for XML keyword queries. In this paper, we present a novel model specially designed for XML keyword query. The model takes a very different point of view on a keyword query: a keyword query is interpreted as a composition of several query units, each representing a query condition. We believe that this viewpoint captures the semantics of the query. To get an objective measure of the relevances of results with respect to the query, we devise a scoring method based on the proposed model that caters for query semantics as well as the structural properties of XML documents. Experimental results verify the effectiveness of our methods.","PeriodicalId":413431,"journal":{"name":"2014 International Conference on Management of e-Commerce and e-Government","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Management of e-Commerce and e-Government","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMECG.2014.59","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Keyword search is an effective paradigm for information discovery and has been introduced recently to query XML documents. Effective keyword search of XML documents needs full understanding of the keyword query. Traditional bag of-words model cannot differentiate the roles of keywords as well as the relationship between keywords, thus is not proper for XML keyword queries. In this paper, we present a novel model specially designed for XML keyword query. The model takes a very different point of view on a keyword query: a keyword query is interpreted as a composition of several query units, each representing a query condition. We believe that this viewpoint captures the semantics of the query. To get an objective measure of the relevances of results with respect to the query, we devise a scoring method based on the proposed model that caters for query semantics as well as the structural properties of XML documents. Experimental results verify the effectiveness of our methods.