Finding effective query strings from results of primary search

2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA) Pub Date : 2014-08-01 DOI:10.1109/ICAICTA.2014.7005959

Ryota Teshima, Masayuki Okabe, Kyoji Umemura

引用次数: 0

Abstract

This paper proposes a method to find query strings suitable for successive searches from primary search results. This method may be regarded as a novel kind of keyword extraction for information retrieval, where these strings are extracted from primary search results. These strings are selected depending on the following conditions: effectiveness, prevalence, and uniqueness. In addition, this method does not use any kind of dictionary, not even a Japanese morphological analyzer. The proposed procedure consists of two parts. The first part is selecting the first candidates, which are all of the keywords in primary search results. The second part is narrowing down the candidates so that the candidates form reasonable cluster of primary search results. Our main concern is whether the selected string is meaningful or understandable for people. We have found that more than 90% of the strings that satisfy the conditions above are meaningful and correct Japanese words.

查看原文本刊更多论文

从主搜索的结果中查找有效的查询字符串

本文提出了一种从主搜索结果中寻找适合连续搜索的查询字符串的方法。该方法可以看作是一种新型的信息检索关键字提取方法，从主搜索结果中提取这些字符串。这些字符串是根据以下条件选择的:有效性、普遍性和唯一性。此外，这种方法不使用任何一种词典，甚至不使用日语词法分析器。建议的程序由两部分组成。第一部分是选择第一候选项，即主要搜索结果中的所有关键字。第二部分是缩小候选项的范围，使候选项形成合理的主搜索结果聚类。我们主要关心的是所选字符串对人们来说是否有意义或可理解。我们发现90%以上满足上述条件的字符串都是有意义且正确的日语单词。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)

自引率

0.00%

发文量