Comment-enriched index terms improve the relevance and novelty of the ranking of the commented medical articles retrieved by an NLP system

IF 3.1 3区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS
Kianoosh Rashidi, H. Sotudeh, A. Nikseresht
{"title":"Comment-enriched index terms improve the relevance and novelty of the ranking of the commented medical articles retrieved by an NLP system","authors":"Kianoosh Rashidi, H. Sotudeh, A. Nikseresht","doi":"10.1108/oir-05-2022-0283","DOIUrl":null,"url":null,"abstract":"PurposeThis study aimed to investigate how the enrichment of medical documents' index terms by their comments improves the relevance and novelty of the top-ranked results retrieved by an NLP system.Design/methodology/approachA semi-experimental pre-test and post-test research was designed to compare NLP-based indexes before and after being expanded by the comment terms. The experiments were conducted on a test collection of 13,957 documents commented by F1000-Prime reviewers. They were indexed at title, abstract, body and full-text levels. In total, 100 seed documents were randomly selected and served as queries. The textual similarity of the documents and queries was calculated using Lucene-more-like-this function and evaluated by the semantic similarity of their MeSH. The results novelty was measured using maximal marginal relevance and evaluated by their MeSH novelties. Normalized discounted cumulative gain was used to compare the basic and expanded indexes' precisions at 10, 20 and 50 top ranks.FindingsThe relevance and novelty of the results ranked at the top precision points was improved after expanding the indexes by the comment terms. The finding implies that meta-texts are effective in representing their mother documents, by adding dynamic elements to their rather static contents. It also provides further evidence about the merits of the application of social intelligence and collective wisdom reflected in the actions and reactions of users in tackling the challenges faced by NLP-based systems.Originality/valueThis is the first study to confirm that social comments on scientific papers improve the performance of information systems in terms of relevance and novelty.Peer reviewThe peer review history for this article is available at: https://publons.com/publon/10.1108/OIR-05-2022-0283.","PeriodicalId":54683,"journal":{"name":"Online Information Review","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2022-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online Information Review","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1108/oir-05-2022-0283","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

PurposeThis study aimed to investigate how the enrichment of medical documents' index terms by their comments improves the relevance and novelty of the top-ranked results retrieved by an NLP system.Design/methodology/approachA semi-experimental pre-test and post-test research was designed to compare NLP-based indexes before and after being expanded by the comment terms. The experiments were conducted on a test collection of 13,957 documents commented by F1000-Prime reviewers. They were indexed at title, abstract, body and full-text levels. In total, 100 seed documents were randomly selected and served as queries. The textual similarity of the documents and queries was calculated using Lucene-more-like-this function and evaluated by the semantic similarity of their MeSH. The results novelty was measured using maximal marginal relevance and evaluated by their MeSH novelties. Normalized discounted cumulative gain was used to compare the basic and expanded indexes' precisions at 10, 20 and 50 top ranks.FindingsThe relevance and novelty of the results ranked at the top precision points was improved after expanding the indexes by the comment terms. The finding implies that meta-texts are effective in representing their mother documents, by adding dynamic elements to their rather static contents. It also provides further evidence about the merits of the application of social intelligence and collective wisdom reflected in the actions and reactions of users in tackling the challenges faced by NLP-based systems.Originality/valueThis is the first study to confirm that social comments on scientific papers improve the performance of information systems in terms of relevance and novelty.Peer reviewThe peer review history for this article is available at: https://publons.com/publon/10.1108/OIR-05-2022-0283.
评论丰富的索引项提高了NLP系统检索到的评论医学文章排名的相关性和新颖性
目的本研究旨在探讨通过医学文献的评论来丰富医学文献的索引词,如何提高NLP系统检索到的排名靠前的结果的相关性和新颖性。设计/方法/方法设计半实验前测和后测研究,比较基于nlp的索引被评论词扩展前后的差异。实验是在F1000-Prime审稿人评论的13957篇论文的测试集上进行的。它们按标题、摘要、正文和全文进行索引。总共随机选择100个种子文档作为查询。使用Lucene-more-like-this函数计算文档和查询的文本相似度,并通过其MeSH的语义相似度进行评估。结果的新颖性用最大边际相关性来衡量,并通过其MeSH新颖性来评估。使用归一化贴现累积增益来比较基本指数和扩展指数在10、20和50位的精度。结果:在通过评论项扩展索引后,排名在精度点前的结果的相关性和新颖性得到了提高。这一发现表明,通过向其静态内容添加动态元素,元文本可以有效地表示其母文档。它还提供了进一步的证据,证明在解决基于nlp的系统所面临的挑战时,反映在用户的行动和反应中的社会智能和集体智慧的应用的优点。原创性/价值这是第一个证实对科学论文的社会评论在相关性和新颖性方面提高信息系统性能的研究。同行评议本文的同行评议历史可在:https://publons.com/publon/10.1108/OIR-05-2022-0283。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Online Information Review
Online Information Review 工程技术-计算机:信息系统
CiteScore
6.90
自引率
16.10%
发文量
67
审稿时长
6 months
期刊介绍: The journal provides a multi-disciplinary forum for scholars from a range of fields, including information studies/iSchools, data studies, internet studies, media and communication studies and information systems. Publishes research on the social, political and ethical aspects of emergent digital information practices and platforms, and welcomes submissions that draw upon critical and socio-technical perspectives in order to address these developments. Welcomes empirical, conceptual and methodological contributions on any topics relevant to the broad field of digital information and communication, however we are particularly interested in receiving submissions that address emerging issues around the below topics. Coverage includes (but is not limited to): •Online communities, social networking and social media, including online political communication; crowdsourcing; positive computing and wellbeing. •The social drivers and implications of emerging data practices, including open data; big data; data journeys and flows; and research data management. •Digital transformations including organisations’ use of information technologies (e.g. Internet of Things and digitisation of user experience) to improve economic and social welfare, health and wellbeing, and protect the environment. •Developments in digital scholarship and the production and use of scholarly content. •Online and digital research methods, including their ethical aspects.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信