Comparative judgement as a research tool: A meta-analysis of application and reliability.

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods Pub Date : 2025-07-10 DOI:10.3758/s13428-025-02744-w

George Kinnear, Ian Jones, Ben Davies

{"title":"Comparative judgement as a research tool: A meta-analysis of application and reliability.","authors":"George Kinnear, Ian Jones, Ben Davies","doi":"10.3758/s13428-025-02744-w","DOIUrl":null,"url":null,"abstract":"<p><p>Comparative judgement (CJ) provides methods for constructing measurement scales, by asking assessors to make a series of pairwise comparisons of the artefacts or representations to be scored. Researchers using CJ need to decide how many assessors to recruit and how many comparisons to collect. They also need to gauge the reliability of the resulting measurement scale, with two different estimates in widespread use: scale separation reliability (SSR) and split-halves reliability (SHR). Previous research has offered guidance on these issues, but with either limited empirical support or focused only on education research. In this paper, we offer guidance based on our analysis of 101 CJ datasets that we collated from previous research across a range of disciplines. We present two novel findings, with substantive implications for future CJ research. First, we find that collecting ten comparisons for every representation is generally sufficient; a more lenient guideline than previously published. Second, we conclude that SSR can serve as a reliable proxy for inter-rater reliability, but recommend that researchers use a higher threshold of .8, rather than the current standard of .7.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 8","pages":"222"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12246014/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavior Research Methods","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.3758/s13428-025-02744-w","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Comparative judgement (CJ) provides methods for constructing measurement scales, by asking assessors to make a series of pairwise comparisons of the artefacts or representations to be scored. Researchers using CJ need to decide how many assessors to recruit and how many comparisons to collect. They also need to gauge the reliability of the resulting measurement scale, with two different estimates in widespread use: scale separation reliability (SSR) and split-halves reliability (SHR). Previous research has offered guidance on these issues, but with either limited empirical support or focused only on education research. In this paper, we offer guidance based on our analysis of 101 CJ datasets that we collated from previous research across a range of disciplines. We present two novel findings, with substantive implications for future CJ research. First, we find that collecting ten comparisons for every representation is generally sufficient; a more lenient guideline than previously published. Second, we conclude that SSR can serve as a reliable proxy for inter-rater reliability, but recommend that researchers use a higher threshold of .8, rather than the current standard of .7.

查看原文本刊更多论文

比较判断作为研究工具：应用与可靠性的元分析。

比较判断（CJ）提供了构建测量量表的方法，通过要求评估者对要评分的人工制品或表征进行一系列两两比较。使用CJ的研究人员需要决定招募多少评估员和收集多少比较。他们还需要衡量结果测量量表的可靠性，广泛使用两种不同的估计：量表分离可靠性（SSR）和分裂半可靠性（SHR）。以前的研究对这些问题提供了指导，但要么是有限的实证支持，要么只关注教育研究。在本文中，我们基于对101个CJ数据集的分析提供了指导，这些数据集是我们从以前的研究中整理出来的，涵盖了一系列学科。我们提出了两个新的发现，对未来的CJ研究具有实质性的意义。首先，我们发现对每个表征收集10个比较通常是足够的；这是一个比以前发布的更宽松的指导方针。其次，我们得出结论，SSR可以作为评估者间信度的可靠代理，但建议研究者使用更高的阈值0.8，而不是目前标准的0.7。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Behavior Research Methods Multiple-

CiteScore

10.30

自引率

9.30%

发文量

266

期刊介绍： Behavior Research Methods publishes articles concerned with the methods, techniques, and instrumentation of research in experimental psychology. The journal focuses particularly on the use of computer technology in psychological research. An annual special issue is devoted to this field.