从主搜索的结果中查找有效的查询字符串

Ryota Teshima, Masayuki Okabe, Kyoji Umemura
{"title":"从主搜索的结果中查找有效的查询字符串","authors":"Ryota Teshima, Masayuki Okabe, Kyoji Umemura","doi":"10.1109/ICAICTA.2014.7005959","DOIUrl":null,"url":null,"abstract":"This paper proposes a method to find query strings suitable for successive searches from primary search results. This method may be regarded as a novel kind of keyword extraction for information retrieval, where these strings are extracted from primary search results. These strings are selected depending on the following conditions: effectiveness, prevalence, and uniqueness. In addition, this method does not use any kind of dictionary, not even a Japanese morphological analyzer. The proposed procedure consists of two parts. The first part is selecting the first candidates, which are all of the keywords in primary search results. The second part is narrowing down the candidates so that the candidates form reasonable cluster of primary search results. Our main concern is whether the selected string is meaningful or understandable for people. We have found that more than 90% of the strings that satisfy the conditions above are meaningful and correct Japanese words.","PeriodicalId":173600,"journal":{"name":"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)","volume":"273 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Finding effective query strings from results of primary search\",\"authors\":\"Ryota Teshima, Masayuki Okabe, Kyoji Umemura\",\"doi\":\"10.1109/ICAICTA.2014.7005959\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a method to find query strings suitable for successive searches from primary search results. This method may be regarded as a novel kind of keyword extraction for information retrieval, where these strings are extracted from primary search results. These strings are selected depending on the following conditions: effectiveness, prevalence, and uniqueness. In addition, this method does not use any kind of dictionary, not even a Japanese morphological analyzer. The proposed procedure consists of two parts. The first part is selecting the first candidates, which are all of the keywords in primary search results. The second part is narrowing down the candidates so that the candidates form reasonable cluster of primary search results. Our main concern is whether the selected string is meaningful or understandable for people. We have found that more than 90% of the strings that satisfy the conditions above are meaningful and correct Japanese words.\",\"PeriodicalId\":173600,\"journal\":{\"name\":\"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)\",\"volume\":\"273 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAICTA.2014.7005959\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICTA.2014.7005959","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种从主搜索结果中寻找适合连续搜索的查询字符串的方法。该方法可以看作是一种新型的信息检索关键字提取方法,从主搜索结果中提取这些字符串。这些字符串是根据以下条件选择的:有效性、普遍性和唯一性。此外,这种方法不使用任何一种词典,甚至不使用日语词法分析器。建议的程序由两部分组成。第一部分是选择第一候选项,即主要搜索结果中的所有关键字。第二部分是缩小候选项的范围,使候选项形成合理的主搜索结果聚类。我们主要关心的是所选字符串对人们来说是否有意义或可理解。我们发现90%以上满足上述条件的字符串都是有意义且正确的日语单词。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Finding effective query strings from results of primary search
This paper proposes a method to find query strings suitable for successive searches from primary search results. This method may be regarded as a novel kind of keyword extraction for information retrieval, where these strings are extracted from primary search results. These strings are selected depending on the following conditions: effectiveness, prevalence, and uniqueness. In addition, this method does not use any kind of dictionary, not even a Japanese morphological analyzer. The proposed procedure consists of two parts. The first part is selecting the first candidates, which are all of the keywords in primary search results. The second part is narrowing down the candidates so that the candidates form reasonable cluster of primary search results. Our main concern is whether the selected string is meaningful or understandable for people. We have found that more than 90% of the strings that satisfy the conditions above are meaningful and correct Japanese words.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信