对外汉语课堂评价工具的测量、质量和评定量表功能

对外汉语研究 Pub Date : 2019-04-24 DOI:10.4324/9780203709740-10

Shuai Li, Yali Feng, Ting-Sheng Wen

{"title":"对外汉语课堂评价工具的测量、质量和评定量表功能","authors":"Shuai Li, Yali Feng, Ting-Sheng Wen","doi":"10.4324/9780203709740-10","DOIUrl":null,"url":null,"abstract":"It is a wide practice that Chinese language instructors develop their own instruments for classroom assessment and make important pedagogical decisions (e.g., assigning grades) accordingly. However, the quality of such instruments has rarely been discussed in the literature. This chapter focuses on the measurement quality of an instructor-developed test used as a final written exam in an undergraduate Chinese language course in the U.S. The test was designed to assess the linguistic knowledge taught in the course and contained 37 binary-scored (0/1) items and 17 constructed-response items. Two four-category rating scales were developed to evaluate the constructed responses. Examinees were 88 students enrolled in the Chinese course. Results showed acceptable overall measurement quality of the test as indicated by measures of difficulty, discrimination, reliability, and Rasch model fit. The two rating scales, however, were found to include excessive score categories, suggesting measurement redundancy. The findings of this study are intended to raise the awareness among CSL instructors of the potential limitations of their self-developed assessment instruments.","PeriodicalId":62305,"journal":{"name":"对外汉语研究","volume":"53 1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Measurement Quality and Rating Scale Functioning of a CSL Classroom Assessment Instrument\",\"authors\":\"Shuai Li, Yali Feng, Ting-Sheng Wen\",\"doi\":\"10.4324/9780203709740-10\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is a wide practice that Chinese language instructors develop their own instruments for classroom assessment and make important pedagogical decisions (e.g., assigning grades) accordingly. However, the quality of such instruments has rarely been discussed in the literature. This chapter focuses on the measurement quality of an instructor-developed test used as a final written exam in an undergraduate Chinese language course in the U.S. The test was designed to assess the linguistic knowledge taught in the course and contained 37 binary-scored (0/1) items and 17 constructed-response items. Two four-category rating scales were developed to evaluate the constructed responses. Examinees were 88 students enrolled in the Chinese course. Results showed acceptable overall measurement quality of the test as indicated by measures of difficulty, discrimination, reliability, and Rasch model fit. The two rating scales, however, were found to include excessive score categories, suggesting measurement redundancy. The findings of this study are intended to raise the awareness among CSL instructors of the potential limitations of their self-developed assessment instruments.\",\"PeriodicalId\":62305,\"journal\":{\"name\":\"对外汉语研究\",\"volume\":\"53 1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"对外汉语研究\",\"FirstCategoryId\":\"1092\",\"ListUrlMain\":\"https://doi.org/10.4324/9780203709740-10\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"对外汉语研究","FirstCategoryId":"1092","ListUrlMain":"https://doi.org/10.4324/9780203709740-10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

汉语教师开发自己的课堂评估工具，并据此做出重要的教学决策(如评分)，这是一种广泛的做法。然而，这些仪器的质量在文献中很少被讨论。本章的重点是在美国的本科汉语课程的期末笔试中使用教师开发的测试的测量质量。该测试旨在评估课程中教授的语言知识，包含37个二元得分(0/1)项目和17个构造反应项目。开发了两个四类评定量表来评估构建的反应。参加考试的88名学生选修了中文课程。结果显示测试的总体测量质量是可接受的，如难度、辨别、信度和Rasch模型拟合的测量。然而，这两个评定量表被发现包括过多的得分类别，表明测量冗余。本研究的结果旨在提高对外汉语教学教师对其自行开发的评估工具的潜在局限性的认识。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Measurement Quality and Rating Scale Functioning of a CSL Classroom Assessment Instrument

It is a wide practice that Chinese language instructors develop their own instruments for classroom assessment and make important pedagogical decisions (e.g., assigning grades) accordingly. However, the quality of such instruments has rarely been discussed in the literature. This chapter focuses on the measurement quality of an instructor-developed test used as a final written exam in an undergraduate Chinese language course in the U.S. The test was designed to assess the linguistic knowledge taught in the course and contained 37 binary-scored (0/1) items and 17 constructed-response items. Two four-category rating scales were developed to evaluate the constructed responses. Examinees were 88 students enrolled in the Chinese course. Results showed acceptable overall measurement quality of the test as indicated by measures of difficulty, discrimination, reliability, and Rasch model fit. The two rating scales, however, were found to include excessive score categories, suggesting measurement redundancy. The findings of this study are intended to raise the awareness among CSL instructors of the potential limitations of their self-developed assessment instruments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

对外汉语研究

自引率

0.00%

发文量

450