{"title":"A Query Expansion Algorithm Based on Phrases Semantic Similarity","authors":"Yongli Liu, C. Li, Pingan Zhang, Z. Xiong","doi":"10.1109/ISIP.2008.57","DOIUrl":null,"url":null,"abstract":"During the indexing process of traditional search engine, web pages become a list of terms, but single term cannot represent the rich content of web pages, which makes information retrieval methods mainly based on terms matching often result in depressed precision. This paper proposes a novel query expansion technique that has phrases as its expansion unit. Phrases typically have a higher information content and a smaller degree of ambiguity than their constituent words, and therefore represent the concepts expressed in text more accurately than single terms. This method extracts key phrases from original results, and calculates the semantic similarity between the query phrase and each phrase extracted using the semantic similarity algorithm based on WordNet, and then expands the query with the most similar phrases to search again. Experimental results show that the proposed algorithm can provide more precision than the traditional query expansion methods.","PeriodicalId":103284,"journal":{"name":"2008 International Symposiums on Information Processing","volume":"323 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"30","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Symposiums on Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISIP.2008.57","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 30
Abstract
During the indexing process of traditional search engine, web pages become a list of terms, but single term cannot represent the rich content of web pages, which makes information retrieval methods mainly based on terms matching often result in depressed precision. This paper proposes a novel query expansion technique that has phrases as its expansion unit. Phrases typically have a higher information content and a smaller degree of ambiguity than their constituent words, and therefore represent the concepts expressed in text more accurately than single terms. This method extracts key phrases from original results, and calculates the semantic similarity between the query phrase and each phrase extracted using the semantic similarity algorithm based on WordNet, and then expands the query with the most similar phrases to search again. Experimental results show that the proposed algorithm can provide more precision than the traditional query expansion methods.