Hollie Thomson, Stephanie Crawford, Jonathan J Evans
{"title":"An investigation into the intra and inter rater scoring reliability of the Addenbrooke's Cognitive Examination-III.","authors":"Hollie Thomson, Stephanie Crawford, Jonathan J Evans","doi":"10.1080/23279095.2025.2489632","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Cognitive screening tests are essential to the process of early detection and diagnosis of dementia. The Addenbrooke's Cognitive Examination-III (ACE-III) is one such tool. Rater reliability in scoring is an important psychometric property of all tests.</p><p><strong>Aims: </strong>To investigate rater accuracy in scoring the ACE-III across different raters and by the same raters at two different time points. A secondary exploratory analysis examined whether scoring accuracy is associated with participants' training and experience with the ACE-III.</p><p><strong>Methods: </strong>A filmed vignette of the ACE-III being administered to an older adult actor (mock patient) was used to assess scoring accuracy across different raters. The vignette had pre-determined \"true\" scores. Participants were UK National Health Service staff who routinely administer and score the ACE-III as part of their clinical practice. They were asked to view the filmed vignette and complete an ACE-III scoring sheet. After two months, participants scored the same vignette again.</p><p><strong>Results and conclusions: </strong>At Time 1, 20% of participants' scores matched the true score, with 32% deviating by 3-5 points, and an overall range of 10 points. At Time 2, 24% of scores matched the true score, with 11% deviating by three points, and an overall range of six points. Errors were mainly accounted for by the domains requiring subjective judgements, namely the visuospatial and language subtests. Intra-rater consistency was low to moderate. Previous experience of using the ACE-III, nor previous ACE-III training, led to statisically significant differences in scoring performance. Health professionals should consider these findings when scoring the ACE-III and utilize the ACE-III administration and scoring guide to improve accuracy.</p>","PeriodicalId":51308,"journal":{"name":"Applied Neuropsychology-Adult","volume":" ","pages":"1-8"},"PeriodicalIF":1.4000,"publicationDate":"2025-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Neuropsychology-Adult","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1080/23279095.2025.2489632","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Cognitive screening tests are essential to the process of early detection and diagnosis of dementia. The Addenbrooke's Cognitive Examination-III (ACE-III) is one such tool. Rater reliability in scoring is an important psychometric property of all tests.
Aims: To investigate rater accuracy in scoring the ACE-III across different raters and by the same raters at two different time points. A secondary exploratory analysis examined whether scoring accuracy is associated with participants' training and experience with the ACE-III.
Methods: A filmed vignette of the ACE-III being administered to an older adult actor (mock patient) was used to assess scoring accuracy across different raters. The vignette had pre-determined "true" scores. Participants were UK National Health Service staff who routinely administer and score the ACE-III as part of their clinical practice. They were asked to view the filmed vignette and complete an ACE-III scoring sheet. After two months, participants scored the same vignette again.
Results and conclusions: At Time 1, 20% of participants' scores matched the true score, with 32% deviating by 3-5 points, and an overall range of 10 points. At Time 2, 24% of scores matched the true score, with 11% deviating by three points, and an overall range of six points. Errors were mainly accounted for by the domains requiring subjective judgements, namely the visuospatial and language subtests. Intra-rater consistency was low to moderate. Previous experience of using the ACE-III, nor previous ACE-III training, led to statisically significant differences in scoring performance. Health professionals should consider these findings when scoring the ACE-III and utilize the ACE-III administration and scoring guide to improve accuracy.
背景:认知筛查测试对于早期发现和诊断痴呆症至关重要。阿登布鲁克认知测验- iii (ACE-III)就是这样一个工具。评分的可靠性是所有测试的重要心理测量特性。目的:探讨不同评分者和同一评分者在两个不同时间点对ACE-III进行评分的准确性。二次探索性分析检查了评分准确性是否与参与者的培训和ACE-III经验有关。方法:一段老年演员(模拟病人)使用ACE-III的视频片段来评估不同评分者的评分准确性。小插图有预先确定的“真实”分数。参与者是英国国家卫生服务的工作人员,他们经常管理和评分ACE-III作为他们临床实践的一部分。他们被要求观看视频片段并完成ACE-III评分表。两个月后,参与者再次对同样的小插曲进行打分。结果和结论:在时间1,20%的参与者的得分与真实得分相符,32%的人偏离3-5分,总体范围为10分。在时间2,24%的得分与真实得分相符,11%的得分与真实得分相差3分,总体误差为6分。错误主要是由需要主观判断的领域造成的,即视觉空间和语言子测试。评分内一致性低至中等。以前使用ACE-III的经验,以及以前的ACE-III培训,导致评分性能的统计学差异。卫生专业人员在对ACE-III评分时应考虑这些结果,并利用ACE-III管理和评分指南来提高准确性。
期刊介绍:
pplied Neuropsychology-Adult publishes clinical neuropsychological articles concerning assessment, brain functioning and neuroimaging, neuropsychological treatment, and rehabilitation in adults. Full-length articles and brief communications are included. Case studies of adult patients carefully assessing the nature, course, or treatment of clinical neuropsychological dysfunctions in the context of scientific literature, are suitable. Review manuscripts addressing critical issues are encouraged. Preference is given to papers of clinical relevance to others in the field. All submitted manuscripts are subject to initial appraisal by the Editor-in-Chief, and, if found suitable for further considerations are peer reviewed by independent, anonymous expert referees. All peer review is single-blind and submission is online via ScholarOne Manuscripts.