语义扩展,提高查询公式的多样性

Elliot Ide, C. Olivares-Rodríguez
{"title":"语义扩展,提高查询公式的多样性","authors":"Elliot Ide, C. Olivares-Rodríguez","doi":"10.1109/LA-CCI48322.2021.9769853","DOIUrl":null,"url":null,"abstract":"Although the diversity of results has been studied since the early information retrieval systems, few studies explore diversity and its representation in an educational context. Inherently, approaches that seek to address difficulties in web search are focused on maximizing the relevance of results over the original query. This work presents a method that integrates semantic relationships using Word Embedding for expansion with blind feedback to improve diversity. Using a corpus based on the user’s query logs from a realistic setting, three Word2vec models are trained to obtain semantically relevant terms for each naturally elaborated query by students. The proposed architecture is studied in a specific search task, limiting the number of candidate terms in each model according to the allowed frequency of words. Finally, the diversity in two groups of queries is compared, measuring the lexical similarity of the snippets of the results pre-expansion and post-expansion. Results indicate the potential for improving diversity, also showing that lower semantic similarity can lead to better diversity. Therefore, we provide a method to improve learning through web searches.","PeriodicalId":431041,"journal":{"name":"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic expansion to improve diversity in query formulation\",\"authors\":\"Elliot Ide, C. Olivares-Rodríguez\",\"doi\":\"10.1109/LA-CCI48322.2021.9769853\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although the diversity of results has been studied since the early information retrieval systems, few studies explore diversity and its representation in an educational context. Inherently, approaches that seek to address difficulties in web search are focused on maximizing the relevance of results over the original query. This work presents a method that integrates semantic relationships using Word Embedding for expansion with blind feedback to improve diversity. Using a corpus based on the user’s query logs from a realistic setting, three Word2vec models are trained to obtain semantically relevant terms for each naturally elaborated query by students. The proposed architecture is studied in a specific search task, limiting the number of candidate terms in each model according to the allowed frequency of words. Finally, the diversity in two groups of queries is compared, measuring the lexical similarity of the snippets of the results pre-expansion and post-expansion. Results indicate the potential for improving diversity, also showing that lower semantic similarity can lead to better diversity. Therefore, we provide a method to improve learning through web searches.\",\"PeriodicalId\":431041,\"journal\":{\"name\":\"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/LA-CCI48322.2021.9769853\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LA-CCI48322.2021.9769853","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

虽然从早期的信息检索系统开始就对结果的多样性进行了研究,但很少有研究探讨多样性及其在教育背景下的表现。从本质上讲,解决网络搜索困难的方法关注的是将结果与原始查询的相关性最大化。本文提出了一种利用词嵌入扩展和盲反馈集成语义关系以提高多样性的方法。使用基于用户实际设置的查询日志的语料库,训练三个Word2vec模型,以便为学生的每个自然详细的查询获得语义相关的术语。提出的体系结构在特定的搜索任务中进行研究,根据允许的词频率限制每个模型中的候选词的数量。最后,比较两组查询的多样性,测量展开前和展开后结果片段的词汇相似性。结果表明,语义相似度越低,多样性越好。因此,我们提供了一种通过网络搜索来提高学习的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Semantic expansion to improve diversity in query formulation
Although the diversity of results has been studied since the early information retrieval systems, few studies explore diversity and its representation in an educational context. Inherently, approaches that seek to address difficulties in web search are focused on maximizing the relevance of results over the original query. This work presents a method that integrates semantic relationships using Word Embedding for expansion with blind feedback to improve diversity. Using a corpus based on the user’s query logs from a realistic setting, three Word2vec models are trained to obtain semantically relevant terms for each naturally elaborated query by students. The proposed architecture is studied in a specific search task, limiting the number of candidate terms in each model according to the allowed frequency of words. Finally, the diversity in two groups of queries is compared, measuring the lexical similarity of the snippets of the results pre-expansion and post-expansion. Results indicate the potential for improving diversity, also showing that lower semantic similarity can lead to better diversity. Therefore, we provide a method to improve learning through web searches.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信