{"title":"Finding effective query strings from results of primary search","authors":"Ryota Teshima, Masayuki Okabe, Kyoji Umemura","doi":"10.1109/ICAICTA.2014.7005959","DOIUrl":null,"url":null,"abstract":"This paper proposes a method to find query strings suitable for successive searches from primary search results. This method may be regarded as a novel kind of keyword extraction for information retrieval, where these strings are extracted from primary search results. These strings are selected depending on the following conditions: effectiveness, prevalence, and uniqueness. In addition, this method does not use any kind of dictionary, not even a Japanese morphological analyzer. The proposed procedure consists of two parts. The first part is selecting the first candidates, which are all of the keywords in primary search results. The second part is narrowing down the candidates so that the candidates form reasonable cluster of primary search results. Our main concern is whether the selected string is meaningful or understandable for people. We have found that more than 90% of the strings that satisfy the conditions above are meaningful and correct Japanese words.","PeriodicalId":173600,"journal":{"name":"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)","volume":"273 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICTA.2014.7005959","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes a method to find query strings suitable for successive searches from primary search results. This method may be regarded as a novel kind of keyword extraction for information retrieval, where these strings are extracted from primary search results. These strings are selected depending on the following conditions: effectiveness, prevalence, and uniqueness. In addition, this method does not use any kind of dictionary, not even a Japanese morphological analyzer. The proposed procedure consists of two parts. The first part is selecting the first candidates, which are all of the keywords in primary search results. The second part is narrowing down the candidates so that the candidates form reasonable cluster of primary search results. Our main concern is whether the selected string is meaningful or understandable for people. We have found that more than 90% of the strings that satisfy the conditions above are meaningful and correct Japanese words.