An author credit allocation method with improved distinguishability and robustness

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-06-01 DOI:10.2478/jdis-2023-0016

Yang Li, Tao Jia

{"title":"An author credit allocation method with improved distinguishability and robustness","authors":"Yang Li, Tao Jia","doi":"10.2478/jdis-2023-0016","DOIUrl":null,"url":null,"abstract":"Abstract Purpose The purpose of this study is to propose an improved credit allocation method that makes the leading author of the paper more distinguishable and makes the deification more robust under malicious manipulations. Design/methodology/approach We utilize a modified Sigmoid function to handle the fat-tail distributed citation counts. We also remove the target paper in calculating the contribution of co-citations. Following previous studies, we use 30 Nobel Prize-winning papers and their citation networks based on the American Physical Society (APS) and the Microsoft Academic Graph (MAG) dataset to test the accuracy of our proposed method (NCCAS). In addition, we use 654,148 articles published in the field of computer science from 2000 to 2009 in the MAG dataset to validate the distinguishability and robustness of NCCAS. Finding Compared with the state-of-the-art methods, NCCAS gives the most accurate prediction of Nobel laureates. Furthermore, the leading author of the paper identified by NCCAS is more distinguishable compared with other co-authors. The results by NCCAS are also more robust to malicious manipulation. Finally, we perform ablation studies to show the contribution of different components in our methods. Research limitations Due to limited ground truth on the true leading author of a work, the accuracy of NCCAS and other related methods can only be tested in Nobel Physics Prize-winning papers. Practical implications NCCAS is successfully applied to a large number of publications, demonstrating its potential in analyzing the relationship between the contribution and the recognition of authors with different by-line orders. Originality/value Compared with existing methods, NCCAS not only identifies the leading author of a paper more accurately, but also makes the deification more distinguishable and more robust, providing a new tool for related studies.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"8 1","pages":"15 - 46"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of data and information science (Warsaw, Poland)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/jdis-2023-0016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract Purpose The purpose of this study is to propose an improved credit allocation method that makes the leading author of the paper more distinguishable and makes the deification more robust under malicious manipulations. Design/methodology/approach We utilize a modified Sigmoid function to handle the fat-tail distributed citation counts. We also remove the target paper in calculating the contribution of co-citations. Following previous studies, we use 30 Nobel Prize-winning papers and their citation networks based on the American Physical Society (APS) and the Microsoft Academic Graph (MAG) dataset to test the accuracy of our proposed method (NCCAS). In addition, we use 654,148 articles published in the field of computer science from 2000 to 2009 in the MAG dataset to validate the distinguishability and robustness of NCCAS. Finding Compared with the state-of-the-art methods, NCCAS gives the most accurate prediction of Nobel laureates. Furthermore, the leading author of the paper identified by NCCAS is more distinguishable compared with other co-authors. The results by NCCAS are also more robust to malicious manipulation. Finally, we perform ablation studies to show the contribution of different components in our methods. Research limitations Due to limited ground truth on the true leading author of a work, the accuracy of NCCAS and other related methods can only be tested in Nobel Physics Prize-winning papers. Practical implications NCCAS is successfully applied to a large number of publications, demonstrating its potential in analyzing the relationship between the contribution and the recognition of authors with different by-line orders. Originality/value Compared with existing methods, NCCAS not only identifies the leading author of a paper more accurately, but also makes the deification more distinguishable and more robust, providing a new tool for related studies.

查看原文本刊更多论文

一种提高可分辨性和鲁棒性的作者信用分配方法

摘要目的本研究的目的是提出一种改进的信用分配方法，使论文的主要作者更容易区分，并使神化在恶意操作下更稳健。设计/方法论/方法我们利用改进的Sigmoid函数来处理胖尾分布的引文计数。我们还删除了计算共同引用贡献的目标论文。根据之前的研究，我们使用了30篇诺贝尔奖获奖论文及其基于美国物理学会（APS）和微软学术图谱（MAG）数据集的引文网络来测试我们提出的方法（NCCAS）的准确性。此外，我们在MAG数据集中使用了2000年至2009年在计算机科学领域发表的654148篇文章来验证NCCAS的可区分性和稳健性。发现与最先进的方法相比，NCCAS给出了诺贝尔奖获得者最准确的预测。此外，与其他合著者相比，NCCAS确定的论文的主要作者更容易区分。NCCAS的结果对恶意操纵也更具鲁棒性。最后，我们进行消融研究，以显示不同成分在我们的方法中的贡献。研究局限性由于作品真正的主要作者的基本事实有限，NCCAS和其他相关方法的准确性只能在诺贝尔物理学奖获奖论文中进行测试。实际意义NCCAS已成功应用于大量出版物，证明了其在分析贡献与对具有不同行序的作者的认可之间的关系方面的潜力。独创性/价值与现有方法相比，NCCAS不仅更准确地确定了论文的主要作者，而且使神化更加可区分和稳健，为相关研究提供了新的工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of data and information science (Warsaw, Poland)

自引率

0.00%

发文量