{"title":"Reliability for multilevel data: A correlation approach.","authors":"Tzu-Yao Lin, Francis Tuerlinckx, Sophie Vanbelle","doi":"10.1037/met0000738","DOIUrl":null,"url":null,"abstract":"<p><p>Studying the reliability of a measurement instrument is essential. Despite the recognition of the importance of reliability in psychology and medicine and the various reliability coefficients that have been proposed, research on reliability for nested or multilevel data, ubiquitously in observational studies, remains limited. Two recent articles (Schönbrodt et al., 2022; ten Hove et al., 2022) address how to quantify reliability in multilevel settings based on generalizability theory. Specifically, ten Hove et al. (2022) defined between-cluster and within-cluster interrater intraclass correlation coefficients for multilevel designs where persons or raters are nested within clusters. Schönbrodt et al. (2022) also defined reliability coefficients at between-cluster and within-cluster (i.e., between-person) levels for designs where persons nested in couples are assessed numerous times daily over a number of days. Nevertheless, when applied to a common design, both approaches give inconsistent results regarding their definition of cluster-level reliability. In this article, we propose an alternative approach to defining reliability coefficients for multilevel data that are based on calculating the expected correlation between repeated measurements. We will compare our approach with that of Schönbrodt et al. (2022) and ten Hove et al. (2022) and explain the differences between the three approaches in a number of common nested data structures: (a) raters crossed with both persons and clusters, but persons are nested within clusters, (b) raters nested within both persons and clusters, and (c) persons nested in clusters and crossed with raters and days. (PsycInfo Database Record (c) 2025 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.6000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychological methods","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/met0000738","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Studying the reliability of a measurement instrument is essential. Despite the recognition of the importance of reliability in psychology and medicine and the various reliability coefficients that have been proposed, research on reliability for nested or multilevel data, ubiquitously in observational studies, remains limited. Two recent articles (Schönbrodt et al., 2022; ten Hove et al., 2022) address how to quantify reliability in multilevel settings based on generalizability theory. Specifically, ten Hove et al. (2022) defined between-cluster and within-cluster interrater intraclass correlation coefficients for multilevel designs where persons or raters are nested within clusters. Schönbrodt et al. (2022) also defined reliability coefficients at between-cluster and within-cluster (i.e., between-person) levels for designs where persons nested in couples are assessed numerous times daily over a number of days. Nevertheless, when applied to a common design, both approaches give inconsistent results regarding their definition of cluster-level reliability. In this article, we propose an alternative approach to defining reliability coefficients for multilevel data that are based on calculating the expected correlation between repeated measurements. We will compare our approach with that of Schönbrodt et al. (2022) and ten Hove et al. (2022) and explain the differences between the three approaches in a number of common nested data structures: (a) raters crossed with both persons and clusters, but persons are nested within clusters, (b) raters nested within both persons and clusters, and (c) persons nested in clusters and crossed with raters and days. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
期刊介绍:
Psychological Methods is devoted to the development and dissemination of methods for collecting, analyzing, understanding, and interpreting psychological data. Its purpose is the dissemination of innovations in research design, measurement, methodology, and quantitative and qualitative analysis to the psychological community; its further purpose is to promote effective communication about related substantive and methodological issues. The audience is expected to be diverse and to include those who develop new procedures, those who are responsible for undergraduate and graduate training in design, measurement, and statistics, as well as those who employ those procedures in research.