Search for Authors of Publications among Users of the Large Scientometric Data System Using the WAND Method

V. Vasenin, D. D. Zaslavskiy
{"title":"Search for Authors of Publications among Users of the Large Scientometric Data System Using the WAND Method","authors":"V. Vasenin, D. D. Zaslavskiy","doi":"10.17587/prin.15.26-34","DOIUrl":null,"url":null,"abstract":"Currently, the processing of search queries in big data systems is an important area of research. Its results find applications in various fields, including research, development and technological work (R&D). One of the main tasks in this area is accounting, analysis and promoting its participants through competitive means. To achieve this, information and analytical scientometric systems are developed to aggregate published R&D results. The article discusses a specific task arising in such systems, namely, the task of determining the involvement of authors in writing a scientific publication. Information and analytical systems store records of publications and their authors, but often there are no mechanisms that allow determining the relationship between the publication and the authors with high accuracy. The goal of the task, which is presented in the article, is to restore missing relationships. The algorithm presented in the article is based on the assumption that R&D work is carried out by teams of authors, and to determine the authors of the publication, it is enough to identify these teams. The materials of this article will be valuable to researchers and practitioners involved in automating processes within large information-analytical systems in the field of scientometrics and bibliometrics. Implementing the heuristic of authorship teams can significantly enhance the accuracy and performance of several similar-purpose systems, particularly those requiring real-time query processing.","PeriodicalId":513113,"journal":{"name":"Programmnaya Ingeneria","volume":"46 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Programmnaya Ingeneria","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17587/prin.15.26-34","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Currently, the processing of search queries in big data systems is an important area of research. Its results find applications in various fields, including research, development and technological work (R&D). One of the main tasks in this area is accounting, analysis and promoting its participants through competitive means. To achieve this, information and analytical scientometric systems are developed to aggregate published R&D results. The article discusses a specific task arising in such systems, namely, the task of determining the involvement of authors in writing a scientific publication. Information and analytical systems store records of publications and their authors, but often there are no mechanisms that allow determining the relationship between the publication and the authors with high accuracy. The goal of the task, which is presented in the article, is to restore missing relationships. The algorithm presented in the article is based on the assumption that R&D work is carried out by teams of authors, and to determine the authors of the publication, it is enough to identify these teams. The materials of this article will be valuable to researchers and practitioners involved in automating processes within large information-analytical systems in the field of scientometrics and bibliometrics. Implementing the heuristic of authorship teams can significantly enhance the accuracy and performance of several similar-purpose systems, particularly those requiring real-time query processing.
使用 WAND 方法在大型科学计量数据系统用户中搜索论文作者
目前,在大数据系统中处理搜索查询是一个重要的研究领域。其成果可应用于各个领域,包括研究、开发和技术工作(研发)。该领域的主要任务之一是核算、分析并通过竞争手段促进其参与者。为此,开发了信息和分析科学计量系统,以汇总已公布的研发成果。本文讨论了此类系统中出现的一项具体任务,即确定作者参与撰写科学出版物的任务。信息和分析系统存储了出版物及其作者的记录,但往往没有能够高精度确定出版物与作者之间关系的机制。文章中介绍的这项任务的目标就是恢复缺失的关系。文章中介绍的算法基于这样一个假设,即研发工作是由作者团队完成的,而要确定出版物的作者,只需识别这些团队即可。本文的资料对于科学计量学和文献计量学领域中参与大型信息分析系统流程自动化的研究人员和从业人员很有价值。采用作者团队启发式方法可以大大提高一些类似用途系统的准确性和性能,尤其是那些需要实时查询处理的系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信