从专家判断和言语协议分析研究阅读测试任务的技巧：两种方法的异同

IF 2.8 2区文学 0 LANGUAGE & LINGUISTICS

Language Assessment Quarterly Pub Date : 2021-03-23 DOI:10.1080/15434303.2021.1881964

Xiaohua Liu, J. Read

{"title":"从专家判断和言语协议分析研究阅读测试任务的技巧：两种方法的异同","authors":"Xiaohua Liu, J. Read","doi":"10.1080/15434303.2021.1881964","DOIUrl":null,"url":null,"abstract":"ABSTRACT Expert judgement has been frequently employed with reading assessments to gauge the skills potentially measured by test tasks, for purposes such as construct validation or producing diagnostic information. Despite the critical role it plays in such endeavours, few studies have triangulated its results with other types of data such as reported test-taking processes. A lack of such triangulation may bring the validity of experts’ judgements into question and undermine the credibility of subsequent procedures that build on them. In light of this, this study compared two groups of language experts’ judgements on the content of two sets of reading test tasks with ten university students’ verbal reports on solving those tasks. It was found that convergence was achieved between the two information sources for about 53% of the test tasks on what they were mainly assessing. However, there was a bigger gap between them regarding the specific skills involved in each task. A careful examination of the discrepancies between the two sources revealed that they are attributable to a number of factors. This study highlights the need to cross-check the results of expert judgement with other data sources. Implications for future test development and research are also discussed.","PeriodicalId":46873,"journal":{"name":"Language Assessment Quarterly","volume":"18 1","pages":"357 - 381"},"PeriodicalIF":2.8000,"publicationDate":"2021-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/15434303.2021.1881964","citationCount":"2","resultStr":"{\"title\":\"Investigating the Skills Involved in Reading Test Tasks through Expert Judgement and Verbal Protocol Analysis: Convergence and Divergence between the Two Methods\",\"authors\":\"Xiaohua Liu, J. Read\",\"doi\":\"10.1080/15434303.2021.1881964\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Expert judgement has been frequently employed with reading assessments to gauge the skills potentially measured by test tasks, for purposes such as construct validation or producing diagnostic information. Despite the critical role it plays in such endeavours, few studies have triangulated its results with other types of data such as reported test-taking processes. A lack of such triangulation may bring the validity of experts’ judgements into question and undermine the credibility of subsequent procedures that build on them. In light of this, this study compared two groups of language experts’ judgements on the content of two sets of reading test tasks with ten university students’ verbal reports on solving those tasks. It was found that convergence was achieved between the two information sources for about 53% of the test tasks on what they were mainly assessing. However, there was a bigger gap between them regarding the specific skills involved in each task. A careful examination of the discrepancies between the two sources revealed that they are attributable to a number of factors. This study highlights the need to cross-check the results of expert judgement with other data sources. Implications for future test development and research are also discussed.\",\"PeriodicalId\":46873,\"journal\":{\"name\":\"Language Assessment Quarterly\",\"volume\":\"18 1\",\"pages\":\"357 - 381\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2021-03-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/15434303.2021.1881964\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Language Assessment Quarterly\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1080/15434303.2021.1881964\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Language Assessment Quarterly","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1080/15434303.2021.1881964","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}

引用次数: 2

摘要

摘要专家判断经常被用于阅读评估，以衡量测试任务可能衡量的技能，用于结构验证或产生诊断信息。尽管它在这些努力中发挥着关键作用，但很少有研究将其结果与其他类型的数据（如报告的考试过程）进行三角化。缺乏这种三角测量可能会使专家判断的有效性受到质疑，并损害建立在这些判断基础上的后续程序的可信度。有鉴于此，本研究将两组语言专家对两组阅读测试任务内容的判断与十名大学生对解决这些任务的口头报告进行了比较。研究发现，在他们主要评估的测试任务中，约53%的测试任务在两个信息源之间实现了趋同。然而，在每项任务所涉及的具体技能方面，他们之间的差距更大。对两个来源之间的差异进行仔细审查后发现，这些差异可归因于若干因素。这项研究强调了将专家判断结果与其他数据来源进行交叉核对的必要性。还讨论了对未来测试开发和研究的启示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Investigating the Skills Involved in Reading Test Tasks through Expert Judgement and Verbal Protocol Analysis: Convergence and Divergence between the Two Methods

ABSTRACT Expert judgement has been frequently employed with reading assessments to gauge the skills potentially measured by test tasks, for purposes such as construct validation or producing diagnostic information. Despite the critical role it plays in such endeavours, few studies have triangulated its results with other types of data such as reported test-taking processes. A lack of such triangulation may bring the validity of experts’ judgements into question and undermine the credibility of subsequent procedures that build on them. In light of this, this study compared two groups of language experts’ judgements on the content of two sets of reading test tasks with ten university students’ verbal reports on solving those tasks. It was found that convergence was achieved between the two information sources for about 53% of the test tasks on what they were mainly assessing. However, there was a bigger gap between them regarding the specific skills involved in each task. A careful examination of the discrepancies between the two sources revealed that they are attributable to a number of factors. This study highlights the need to cross-check the results of expert judgement with other data sources. Implications for future test development and research are also discussed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Language Assessment Quarterly Multiple-

CiteScore

6.40

自引率

3.40%

发文量