用多面Rasch评定量表测量模型考察音乐表演评定中评价者的判断。

Journal of applied measurement Pub Date : 2019-01-01

Pey Shin Ooi, George Engelhard

{"title":"用多面Rasch评定量表测量模型考察音乐表演评定中评价者的判断。","authors":"Pey Shin Ooi, George Engelhard","doi":"","DOIUrl":null,"url":null,"abstract":"The fairness of raters in music performance assessment has become an important concern in the field of music. The assessment of students' music performance depends in a fundamental way on rater judgements. The quality of rater judgements is crucial to provide fair, meaningful and informative assessments of music performance. There are many external factors that can influence the quality of rater judgements. Previous research has used different measurement models to examine the quality of rater judgements (e.g., generalizability theory). There are limitations with the previous analysis methods that are based on classical test theory and its extensions. In this study, we use modern measurement theory (Rasch measurement theory) to examine the quality of rater judgements. The many-facets Rasch rating scale model is employed to investigate the extent of rater-invariant measurement in the context of music performance assessments related to university degrees in Malaysia (159 students rated by 24 raters). We examine the rating scale structure, the severity levels of the raters, and the judged difficulty of the items. We also examine the interaction effects across musical instrument subgroups (keyboard, strings, woodwinds, brass, percussions and vocal). The results suggest that there were differences in severity levels among the raters. The results of this study also suggest that raters had different severity levels when rating different musical instrument subgroups. The implications for research, theory and practice in the assessment of music performance are included in this paper.","PeriodicalId":73608,"journal":{"name":"Journal of applied measurement","volume":"20 1","pages":"79-99"},"PeriodicalIF":0.0000,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Examining Rater Judgements in Music Performance Assessment using Many-Facets Rasch Rating Scale Measurement Model.\",\"authors\":\"Pey Shin Ooi, George Engelhard\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The fairness of raters in music performance assessment has become an important concern in the field of music. The assessment of students' music performance depends in a fundamental way on rater judgements. The quality of rater judgements is crucial to provide fair, meaningful and informative assessments of music performance. There are many external factors that can influence the quality of rater judgements. Previous research has used different measurement models to examine the quality of rater judgements (e.g., generalizability theory). There are limitations with the previous analysis methods that are based on classical test theory and its extensions. In this study, we use modern measurement theory (Rasch measurement theory) to examine the quality of rater judgements. The many-facets Rasch rating scale model is employed to investigate the extent of rater-invariant measurement in the context of music performance assessments related to university degrees in Malaysia (159 students rated by 24 raters). We examine the rating scale structure, the severity levels of the raters, and the judged difficulty of the items. We also examine the interaction effects across musical instrument subgroups (keyboard, strings, woodwinds, brass, percussions and vocal). The results suggest that there were differences in severity levels among the raters. The results of this study also suggest that raters had different severity levels when rating different musical instrument subgroups. The implications for research, theory and practice in the assessment of music performance are included in this paper.\",\"PeriodicalId\":73608,\"journal\":{\"name\":\"Journal of applied measurement\",\"volume\":\"20 1\",\"pages\":\"79-99\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of applied measurement\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of applied measurement","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

评价人员在音乐演奏评价中的公平性问题已成为音乐学界关注的一个重要问题。对学生音乐表现的评价，从根本上讲取决于对学生的判断。评估师的判断质量对于提供公平、有意义和信息丰富的音乐表演评估至关重要。有许多外部因素可以影响评级判断的质量。以前的研究使用了不同的测量模型来检验评级判断的质量(例如，概率论)。以往基于经典测试理论及其扩展的分析方法存在局限性。在本研究中，我们使用现代测量理论(Rasch测量理论)来检验评估师判断的质量。采用多面Rasch评分量表模型来调查与马来西亚大学学位相关的音乐表演评估背景下的评分不变测量程度(由24名评分者评分的159名学生)。我们考察了评定量表的结构、评定者的严重程度和评定项目的难易程度。我们还研究了乐器子组(键盘，弦乐，木管乐器，铜管乐器，打击乐器和声乐)的相互作用效果。结果表明，评分者的严重程度存在差异。本研究的结果也表明，评分者对不同乐器亚组的评分有不同的严重程度。本文对音乐表演评估的研究、理论和实践意义进行了探讨。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

本刊更多论文

Examining Rater Judgements in Music Performance Assessment using Many-Facets Rasch Rating Scale Measurement Model.

The fairness of raters in music performance assessment has become an important concern in the field of music. The assessment of students' music performance depends in a fundamental way on rater judgements. The quality of rater judgements is crucial to provide fair, meaningful and informative assessments of music performance. There are many external factors that can influence the quality of rater judgements. Previous research has used different measurement models to examine the quality of rater judgements (e.g., generalizability theory). There are limitations with the previous analysis methods that are based on classical test theory and its extensions. In this study, we use modern measurement theory (Rasch measurement theory) to examine the quality of rater judgements. The many-facets Rasch rating scale model is employed to investigate the extent of rater-invariant measurement in the context of music performance assessments related to university degrees in Malaysia (159 students rated by 24 raters). We examine the rating scale structure, the severity levels of the raters, and the judged difficulty of the items. We also examine the interaction effects across musical instrument subgroups (keyboard, strings, woodwinds, brass, percussions and vocal). The results suggest that there were differences in severity levels among the raters. The results of this study also suggest that raters had different severity levels when rating different musical instrument subgroups. The implications for research, theory and practice in the assessment of music performance are included in this paper.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of applied measurement

自引率

0.00%

发文量