{"title":"语义扩展,提高查询公式的多样性","authors":"Elliot Ide, C. Olivares-Rodríguez","doi":"10.1109/LA-CCI48322.2021.9769853","DOIUrl":null,"url":null,"abstract":"Although the diversity of results has been studied since the early information retrieval systems, few studies explore diversity and its representation in an educational context. Inherently, approaches that seek to address difficulties in web search are focused on maximizing the relevance of results over the original query. This work presents a method that integrates semantic relationships using Word Embedding for expansion with blind feedback to improve diversity. Using a corpus based on the user’s query logs from a realistic setting, three Word2vec models are trained to obtain semantically relevant terms for each naturally elaborated query by students. The proposed architecture is studied in a specific search task, limiting the number of candidate terms in each model according to the allowed frequency of words. Finally, the diversity in two groups of queries is compared, measuring the lexical similarity of the snippets of the results pre-expansion and post-expansion. Results indicate the potential for improving diversity, also showing that lower semantic similarity can lead to better diversity. Therefore, we provide a method to improve learning through web searches.","PeriodicalId":431041,"journal":{"name":"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic expansion to improve diversity in query formulation\",\"authors\":\"Elliot Ide, C. Olivares-Rodríguez\",\"doi\":\"10.1109/LA-CCI48322.2021.9769853\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although the diversity of results has been studied since the early information retrieval systems, few studies explore diversity and its representation in an educational context. Inherently, approaches that seek to address difficulties in web search are focused on maximizing the relevance of results over the original query. This work presents a method that integrates semantic relationships using Word Embedding for expansion with blind feedback to improve diversity. Using a corpus based on the user’s query logs from a realistic setting, three Word2vec models are trained to obtain semantically relevant terms for each naturally elaborated query by students. The proposed architecture is studied in a specific search task, limiting the number of candidate terms in each model according to the allowed frequency of words. Finally, the diversity in two groups of queries is compared, measuring the lexical similarity of the snippets of the results pre-expansion and post-expansion. Results indicate the potential for improving diversity, also showing that lower semantic similarity can lead to better diversity. Therefore, we provide a method to improve learning through web searches.\",\"PeriodicalId\":431041,\"journal\":{\"name\":\"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/LA-CCI48322.2021.9769853\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LA-CCI48322.2021.9769853","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Semantic expansion to improve diversity in query formulation
Although the diversity of results has been studied since the early information retrieval systems, few studies explore diversity and its representation in an educational context. Inherently, approaches that seek to address difficulties in web search are focused on maximizing the relevance of results over the original query. This work presents a method that integrates semantic relationships using Word Embedding for expansion with blind feedback to improve diversity. Using a corpus based on the user’s query logs from a realistic setting, three Word2vec models are trained to obtain semantically relevant terms for each naturally elaborated query by students. The proposed architecture is studied in a specific search task, limiting the number of candidate terms in each model according to the allowed frequency of words. Finally, the diversity in two groups of queries is compared, measuring the lexical similarity of the snippets of the results pre-expansion and post-expansion. Results indicate the potential for improving diversity, also showing that lower semantic similarity can lead to better diversity. Therefore, we provide a method to improve learning through web searches.