Anna Severin, Michaela Strinzel, Matthias Egger, Tiago Barros, Alexander Sokolov, Julia Vilstrup Mouatt, Stefan Müller
{"title":"Relationship between journal impact factor and the thoroughness and helpfulness of peer reviews.","authors":"Anna Severin, Michaela Strinzel, Matthias Egger, Tiago Barros, Alexander Sokolov, Julia Vilstrup Mouatt, Stefan Müller","doi":"10.1371/journal.pbio.3002238","DOIUrl":null,"url":null,"abstract":"<p><p>The Journal Impact Factor is often used as a proxy measure for journal quality, but the empirical evidence is scarce. In particular, it is unclear how peer review characteristics for a journal relate to its impact factor. We analysed 10,000 peer review reports submitted to 1,644 biomedical journals with impact factors ranging from 0.21 to 74.7. Two researchers hand-coded sentences using categories of content related to the thoroughness of the review (Materials and Methods, Presentation and Reporting, Results and Discussion, Importance and Relevance) and helpfulness (Suggestion and Solution, Examples, Praise, Criticism). We fine-tuned and validated transformer machine learning language models to classify sentences. We then examined the association between the number and percentage of sentences addressing different content categories and 10 groups defined by the Journal Impact Factor. The median length of reviews increased with higher impact factor, from 185 words (group 1) to 387 words (group 10). The percentage of sentences addressing Materials and Methods was greater in the highest Journal Impact Factor journals than in the lowest Journal Impact Factor group. The results for Presentation and Reporting went in the opposite direction, with the highest Journal Impact Factor journals giving less emphasis to such content. For helpfulness, reviews for higher impact factor journals devoted relatively less attention to Suggestion and Solution than lower impact factor journals. In conclusion, peer review in journals with higher impact factors tends to be more thorough, particularly in addressing study methods while giving relatively less emphasis to presentation or suggesting solutions. Differences were modest and variability high, indicating that the Journal Impact Factor is a bad predictor of the quality of peer review of an individual manuscript.</p>","PeriodicalId":20240,"journal":{"name":"PLoS Biology","volume":"21 8","pages":"e3002238"},"PeriodicalIF":7.8000,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464996/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1371/journal.pbio.3002238","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/8/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The Journal Impact Factor is often used as a proxy measure for journal quality, but the empirical evidence is scarce. In particular, it is unclear how peer review characteristics for a journal relate to its impact factor. We analysed 10,000 peer review reports submitted to 1,644 biomedical journals with impact factors ranging from 0.21 to 74.7. Two researchers hand-coded sentences using categories of content related to the thoroughness of the review (Materials and Methods, Presentation and Reporting, Results and Discussion, Importance and Relevance) and helpfulness (Suggestion and Solution, Examples, Praise, Criticism). We fine-tuned and validated transformer machine learning language models to classify sentences. We then examined the association between the number and percentage of sentences addressing different content categories and 10 groups defined by the Journal Impact Factor. The median length of reviews increased with higher impact factor, from 185 words (group 1) to 387 words (group 10). The percentage of sentences addressing Materials and Methods was greater in the highest Journal Impact Factor journals than in the lowest Journal Impact Factor group. The results for Presentation and Reporting went in the opposite direction, with the highest Journal Impact Factor journals giving less emphasis to such content. For helpfulness, reviews for higher impact factor journals devoted relatively less attention to Suggestion and Solution than lower impact factor journals. In conclusion, peer review in journals with higher impact factors tends to be more thorough, particularly in addressing study methods while giving relatively less emphasis to presentation or suggesting solutions. Differences were modest and variability high, indicating that the Journal Impact Factor is a bad predictor of the quality of peer review of an individual manuscript.
期刊影响因子通常被用作衡量期刊质量的指标,但经验证据很少。特别是,目前尚不清楚期刊的同行评审特征与其影响因素之间的关系。我们分析了提交给1644份生物医学期刊的10000份同行评审报告,影响因素从0.21到74.7不等。两名研究人员使用与审查的彻底性相关的内容类别(材料和方法、陈述和报告、结果和讨论、重要性和相关性)和有用性(建议和解决方案、示例、赞扬、批评)手工编码句子。我们对transformer机器学习语言模型进行了微调和验证,以对句子进行分类。然后,我们研究了针对不同内容类别和期刊影响因子定义的10组的句子数量和百分比之间的关联。评论的中位长度随着影响因素的增加而增加,从185个单词(第一组)增加到387个单词(第十组)。在期刊影响因子最高的期刊中,涉及材料和方法的句子百分比高于期刊影响因子最低的期刊组。Presentation and Reporting的结果正好相反,期刊影响因子最高的期刊对此类内容的重视程度较低。为了提供帮助,与影响因子较低的期刊相比,影响因子较高的期刊的评论对建议和解决方案的关注相对较少。总之,影响因素较高的期刊的同行评审往往更彻底,尤其是在研究方法方面,而相对较少强调陈述或建议解决方案。差异不大,变异性很高,这表明期刊影响因素对单个稿件的同行评审质量是一个糟糕的预测因素。
期刊介绍:
PLOS Biology is an open-access, peer-reviewed general biology journal published by PLOS, a nonprofit organization of scientists and physicians dedicated to making the world's scientific and medical literature freely accessible. The journal publishes new articles online weekly, with issues compiled and published monthly.
ISSN Numbers:
eISSN: 1545-7885
ISSN: 1544-9173