面向个性化结果的信息检索阅读理解

Q3 Social Sciences

Proceedings of the Association for Information Science and Technology Pub Date : 2023-10-01 DOI:10.1002/pra2.929

Yumi Kim, Heesop Kim

{"title":"面向个性化结果的信息检索阅读理解","authors":"Yumi Kim, Heesop Kim","doi":"10.1002/pra2.929","DOIUrl":null,"url":null,"abstract":"ABSTRACT Recent research on personalized retrieval technology has been actively conducted to meet the needs of users for seeking adequate information. To refine the retrieval, researchers are considering user behavior patterns in a variety of ways. In this study, we use eye‐tracking metadata to predict users' levels of comprehension as textual evidence for IR processes. Furthermore, we incorporated eye‐tracking metadata on the Automated Readability Index (ARI), a readability assessment tool of an English text. Our research is largely divided into two tasks: i) comprehension evaluation task (CET) and ii) comprehension‐based retrieval task (CRT). In the CET task, for predicting the comprehension level, we applied various regression models. Among them, the Voting regressor demonstrated the highest performance with a Spearman's 𝜌 of 0.68. In the CRT task, we incorporated the level of comprehension predicted in the CET task and ARI score into the ranking results. We derived a sentenceBERT to find the relevant text for a query and the Normalized Discounted Cumulative Gain (nDCG) for evaluating the CRT task. The nDCG score for Comprehension Level only and that with ARI together were 0.65 and 0.78, respectively. Thus, applying ARI resulted in a higher nDCG score compared to comprehension level only.","PeriodicalId":37833,"journal":{"name":"Proceedings of the Association for Information Science and Technology","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reading Comprehension in Information Retrieval (<scp>RCIR</scp>) for Personalized Results\",\"authors\":\"Yumi Kim, Heesop Kim\",\"doi\":\"10.1002/pra2.929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Recent research on personalized retrieval technology has been actively conducted to meet the needs of users for seeking adequate information. To refine the retrieval, researchers are considering user behavior patterns in a variety of ways. In this study, we use eye‐tracking metadata to predict users' levels of comprehension as textual evidence for IR processes. Furthermore, we incorporated eye‐tracking metadata on the Automated Readability Index (ARI), a readability assessment tool of an English text. Our research is largely divided into two tasks: i) comprehension evaluation task (CET) and ii) comprehension‐based retrieval task (CRT). In the CET task, for predicting the comprehension level, we applied various regression models. Among them, the Voting regressor demonstrated the highest performance with a Spearman's 𝜌 of 0.68. In the CRT task, we incorporated the level of comprehension predicted in the CET task and ARI score into the ranking results. We derived a sentenceBERT to find the relevant text for a query and the Normalized Discounted Cumulative Gain (nDCG) for evaluating the CRT task. The nDCG score for Comprehension Level only and that with ARI together were 0.65 and 0.78, respectively. Thus, applying ARI resulted in a higher nDCG score compared to comprehension level only.\",\"PeriodicalId\":37833,\"journal\":{\"name\":\"Proceedings of the Association for Information Science and Technology\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Association for Information Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/pra2.929\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Association for Information Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/pra2.929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 0

摘要

近年来，个性化检索技术的研究一直在积极进行，以满足用户对充分信息的需求。为了改进检索，研究人员正在以各种方式考虑用户行为模式。在本研究中，我们使用眼动追踪元数据来预测用户的理解水平，作为IR过程的文本证据。此外，我们将眼动追踪元数据纳入了自动可读性指数(ARI)，这是一种英语文本的可读性评估工具。我们的研究主要分为两个任务:理解评价任务(CET)和基于理解的检索任务(CRT)。在CET任务中，为了预测理解水平，我们应用了各种回归模型。其中，Voting回归量表现最好，Spearman’s𝜌为0.68。在CRT任务中，我们将CET任务和ARI分数预测的理解水平纳入排名结果。我们派生了一个sentenceBERT来查找查询的相关文本，并派生了一个归一化贴现累积增益(nDCG)来评估CRT任务。单纯理解水平组和合并ARI组的nDCG得分分别为0.65和0.78。因此，与仅应用理解水平相比，应用ARI可导致更高的nDCG评分。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reading Comprehension in Information Retrieval (RCIR) for Personalized Results

ABSTRACT Recent research on personalized retrieval technology has been actively conducted to meet the needs of users for seeking adequate information. To refine the retrieval, researchers are considering user behavior patterns in a variety of ways. In this study, we use eye‐tracking metadata to predict users' levels of comprehension as textual evidence for IR processes. Furthermore, we incorporated eye‐tracking metadata on the Automated Readability Index (ARI), a readability assessment tool of an English text. Our research is largely divided into two tasks: i) comprehension evaluation task (CET) and ii) comprehension‐based retrieval task (CRT). In the CET task, for predicting the comprehension level, we applied various regression models. Among them, the Voting regressor demonstrated the highest performance with a Spearman's 𝜌 of 0.68. In the CRT task, we incorporated the level of comprehension predicted in the CET task and ARI score into the ranking results. We derived a sentenceBERT to find the relevant text for a query and the Normalized Discounted Cumulative Gain (nDCG) for evaluating the CRT task. The nDCG score for Comprehension Level only and that with ARI together were 0.65 and 0.78, respectively. Thus, applying ARI resulted in a higher nDCG score compared to comprehension level only.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the Association for Information Science and Technology Social Sciences-Library and Information Sciences

CiteScore

1.30

自引率

0.00%

发文量

164

期刊介绍： Information not localized