面向个性化结果的信息检索阅读理解

Q3 Social Sciences
Yumi Kim, Heesop Kim
{"title":"面向个性化结果的信息检索阅读理解","authors":"Yumi Kim, Heesop Kim","doi":"10.1002/pra2.929","DOIUrl":null,"url":null,"abstract":"ABSTRACT Recent research on personalized retrieval technology has been actively conducted to meet the needs of users for seeking adequate information. To refine the retrieval, researchers are considering user behavior patterns in a variety of ways. In this study, we use eye‐tracking metadata to predict users' levels of comprehension as textual evidence for IR processes. Furthermore, we incorporated eye‐tracking metadata on the Automated Readability Index (ARI), a readability assessment tool of an English text. Our research is largely divided into two tasks: i) comprehension evaluation task (CET) and ii) comprehension‐based retrieval task (CRT). In the CET task, for predicting the comprehension level, we applied various regression models. Among them, the Voting regressor demonstrated the highest performance with a Spearman's 𝜌 of 0.68. In the CRT task, we incorporated the level of comprehension predicted in the CET task and ARI score into the ranking results. We derived a sentenceBERT to find the relevant text for a query and the Normalized Discounted Cumulative Gain (nDCG) for evaluating the CRT task. The nDCG score for Comprehension Level only and that with ARI together were 0.65 and 0.78, respectively. Thus, applying ARI resulted in a higher nDCG score compared to comprehension level only.","PeriodicalId":37833,"journal":{"name":"Proceedings of the Association for Information Science and Technology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reading Comprehension in Information Retrieval (<scp>RCIR</scp>) for Personalized Results\",\"authors\":\"Yumi Kim, Heesop Kim\",\"doi\":\"10.1002/pra2.929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Recent research on personalized retrieval technology has been actively conducted to meet the needs of users for seeking adequate information. To refine the retrieval, researchers are considering user behavior patterns in a variety of ways. In this study, we use eye‐tracking metadata to predict users' levels of comprehension as textual evidence for IR processes. Furthermore, we incorporated eye‐tracking metadata on the Automated Readability Index (ARI), a readability assessment tool of an English text. Our research is largely divided into two tasks: i) comprehension evaluation task (CET) and ii) comprehension‐based retrieval task (CRT). In the CET task, for predicting the comprehension level, we applied various regression models. Among them, the Voting regressor demonstrated the highest performance with a Spearman's 𝜌 of 0.68. In the CRT task, we incorporated the level of comprehension predicted in the CET task and ARI score into the ranking results. We derived a sentenceBERT to find the relevant text for a query and the Normalized Discounted Cumulative Gain (nDCG) for evaluating the CRT task. The nDCG score for Comprehension Level only and that with ARI together were 0.65 and 0.78, respectively. Thus, applying ARI resulted in a higher nDCG score compared to comprehension level only.\",\"PeriodicalId\":37833,\"journal\":{\"name\":\"Proceedings of the Association for Information Science and Technology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Association for Information Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/pra2.929\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Association for Information Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/pra2.929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 0

摘要

近年来,个性化检索技术的研究一直在积极进行,以满足用户对充分信息的需求。为了改进检索,研究人员正在以各种方式考虑用户行为模式。在本研究中,我们使用眼动追踪元数据来预测用户的理解水平,作为IR过程的文本证据。此外,我们将眼动追踪元数据纳入了自动可读性指数(ARI),这是一种英语文本的可读性评估工具。我们的研究主要分为两个任务:理解评价任务(CET)和基于理解的检索任务(CRT)。在CET任务中,为了预测理解水平,我们应用了各种回归模型。其中,Voting回归量表现最好,Spearman’s𝜌为0.68。在CRT任务中,我们将CET任务和ARI分数预测的理解水平纳入排名结果。我们派生了一个sentenceBERT来查找查询的相关文本,并派生了一个归一化贴现累积增益(nDCG)来评估CRT任务。单纯理解水平组和合并ARI组的nDCG得分分别为0.65和0.78。因此,与仅应用理解水平相比,应用ARI可导致更高的nDCG评分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Reading Comprehension in Information Retrieval (RCIR) for Personalized Results
ABSTRACT Recent research on personalized retrieval technology has been actively conducted to meet the needs of users for seeking adequate information. To refine the retrieval, researchers are considering user behavior patterns in a variety of ways. In this study, we use eye‐tracking metadata to predict users' levels of comprehension as textual evidence for IR processes. Furthermore, we incorporated eye‐tracking metadata on the Automated Readability Index (ARI), a readability assessment tool of an English text. Our research is largely divided into two tasks: i) comprehension evaluation task (CET) and ii) comprehension‐based retrieval task (CRT). In the CET task, for predicting the comprehension level, we applied various regression models. Among them, the Voting regressor demonstrated the highest performance with a Spearman's 𝜌 of 0.68. In the CRT task, we incorporated the level of comprehension predicted in the CET task and ARI score into the ranking results. We derived a sentenceBERT to find the relevant text for a query and the Normalized Discounted Cumulative Gain (nDCG) for evaluating the CRT task. The nDCG score for Comprehension Level only and that with ARI together were 0.65 and 0.78, respectively. Thus, applying ARI resulted in a higher nDCG score compared to comprehension level only.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Proceedings of the Association for Information Science and Technology
Proceedings of the Association for Information Science and Technology Social Sciences-Library and Information Sciences
CiteScore
1.30
自引率
0.00%
发文量
164
期刊介绍: Information not localized
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信