Evaluation of Natural Language Processing Techniques for Information Retrieval

Hope Nabankema
{"title":"Evaluation of Natural Language Processing Techniques for Information Retrieval","authors":"Hope Nabankema","doi":"10.47941/ejikm.1752","DOIUrl":null,"url":null,"abstract":"Purpose: The main objective of the study was to investigate the evaluation of Natural Language Processing techniques for information retrieval. \nMethodology: The study adopted a desktop research methodology. Desk research refers to secondary data or that which can be collected without fieldwork. Desk research is basically involved in collecting data from existing resources hence it is often considered a low cost technique as compared to field research, as the main cost is involved in executive’s time, telephone charges and directories. Thus, the study relied on already published studies, reports and statistics. This secondary data was easily accessed through the online journals and library. \nFindings: The findings reveal that there exists a contextual and methodological gap relating to Natural Language Processing techniques for information retrieval. Preliminary empirical review revealed that NLP methods significantly improved the accuracy and efficiency of information retrieval systems. Through systematic evaluation, various NLP techniques, including tokenization, named entity recognition, semantic parsing, and word embeddings, were found to enhance retrieval performance across diverse datasets and domains. By considering context and user intent, researchers aimed to develop more contextually aware and personalized information retrieval systems. The study emphasized the need for further research to explore hybrid approaches and domain-specific adaptations, ultimately highlighting the transformative potential of NLP in revolutionizing information access and utilization. \nUnique Contribution to Theory, Practice and Policy: Information Foraging theory, Relevance theory and Cognitive Load theory may be used to anchor future studies on Natural Language Processing techniques. The study provided recommendations to enhance information retrieval systems. It suggested integrating advanced NLP techniques such as named entity recognition and sentiment analysis to improve query understanding and document relevance. Additionally, the study recommended adopting word embeddings and semantic parsing techniques for better semantic understanding of user queries and documents. It emphasized the importance of domain-specific adaptations and continuous evaluation of NLP techniques to tailor them to specific application domains and keep pace with evolving user needs and technological advancements.","PeriodicalId":154013,"journal":{"name":"European Journal of Information and Knowledge Management","volume":"47 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Information and Knowledge Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47941/ejikm.1752","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: The main objective of the study was to investigate the evaluation of Natural Language Processing techniques for information retrieval. Methodology: The study adopted a desktop research methodology. Desk research refers to secondary data or that which can be collected without fieldwork. Desk research is basically involved in collecting data from existing resources hence it is often considered a low cost technique as compared to field research, as the main cost is involved in executive’s time, telephone charges and directories. Thus, the study relied on already published studies, reports and statistics. This secondary data was easily accessed through the online journals and library. Findings: The findings reveal that there exists a contextual and methodological gap relating to Natural Language Processing techniques for information retrieval. Preliminary empirical review revealed that NLP methods significantly improved the accuracy and efficiency of information retrieval systems. Through systematic evaluation, various NLP techniques, including tokenization, named entity recognition, semantic parsing, and word embeddings, were found to enhance retrieval performance across diverse datasets and domains. By considering context and user intent, researchers aimed to develop more contextually aware and personalized information retrieval systems. The study emphasized the need for further research to explore hybrid approaches and domain-specific adaptations, ultimately highlighting the transformative potential of NLP in revolutionizing information access and utilization. Unique Contribution to Theory, Practice and Policy: Information Foraging theory, Relevance theory and Cognitive Load theory may be used to anchor future studies on Natural Language Processing techniques. The study provided recommendations to enhance information retrieval systems. It suggested integrating advanced NLP techniques such as named entity recognition and sentiment analysis to improve query understanding and document relevance. Additionally, the study recommended adopting word embeddings and semantic parsing techniques for better semantic understanding of user queries and documents. It emphasized the importance of domain-specific adaptations and continuous evaluation of NLP techniques to tailor them to specific application domains and keep pace with evolving user needs and technological advancements.
评估用于信息检索的自然语言处理技术
目的:本研究的主要目的是调查自然语言处理技术对信息检索的评估。研究方法:研究采用了桌面研究方法。案头研究指的是二手数据或无需实地考察即可收集到的数据。案头研究基本上是从现有资源中收集数据,因此,与实地研究相比,案头研究通常被认为是一种低成本技术,因为主要成本涉及执行人员的时间、电话费和目录。因此,本研究依赖于已出版的研究、报告和统计数据。这些二手数据可通过在线期刊和图书馆轻松获取。研究结果:研究结果表明,自然语言处理技术在信息检索方面存在背景和方法上的差距。初步的经验审查显示,NLP 方法大大提高了信息检索系统的准确性和效率。通过系统评估,发现各种 NLP 技术(包括标记化、命名实体识别、语义解析和词嵌入)都能提高不同数据集和领域的检索性能。通过考虑上下文和用户意图,研究人员旨在开发出更具上下文意识和个性化的信息检索系统。研究强调了进一步研究探索混合方法和特定领域适应性的必要性,最终凸显了 NLP 在彻底改变信息获取和利用方面的变革潜力。对理论、实践和政策的独特贡献:信息觅食理论、相关性理论和认知负荷理论可用于今后的自然语言处理技术研究。该研究为增强信息检索系统提供了建议。它建议整合先进的自然语言处理技术,如命名实体识别和情感分析,以提高查询理解能力和文档相关性。此外,研究还建议采用词嵌入和语义解析技术,以便更好地理解用户查询和文档的语义。研究强调,必须针对特定领域对 NLP 技术进行调整和持续评估,使其适应特定应用领域,并跟上不断变化的用户需求和技术进步的步伐。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信