Evaluating German PISA stratification designs: a simulation study

IF 2.6 Q1 EDUCATION & EDUCATIONAL RESEARCH
Julia Mang, Helmut Küchenhoff, Sabine Meinck
{"title":"Evaluating German PISA stratification designs: a simulation study","authors":"Julia Mang, Helmut Küchenhoff, Sabine Meinck","doi":"10.1186/s40536-024-00203-0","DOIUrl":null,"url":null,"abstract":"<p>Stratification is an important design feature of many studies using complex sampling designs and it is often used in large-scale assessment (LSA) studies, such as the <i>Programme for International Student Assessment</i> (PISA), for two main reasons. First, stratification variables that achieve a high between and low within strata variance can improve the efficiency of a survey design. Second, stratification allows one to, explicitly or implicitly, control for sample sizes across subpopulations. It ensures that some parts of a population are in the sample in predetermined proportions. In this study, we determine through simulation which stratification scheme is best for PISA in Germany. For this, we consider the constraints imposed by the international sampling design, the available information about schools, and specific national characteristics of the German educational system. We examine seven different stratification designs selected based on scenarios used in past LSAs in Germany and theoretical considerations for future implementations. The chosen scenarios were compared with two reference scenarios: (1) an unstratified design and (2) a synthetic optimal stratification design. The simulation study reveals that the stratification design currently applied in PISA produces satisfactory results regarding sampling precision. The present stratification design is based on Germany's federal states and school types. However, this approach leads to small strata, which has been problematic for estimating sampling variance in previous cycles. Therefore, alternative stratification scenarios were considered and, in addition to overcoming the small-strata problem, also led to smaller standard errors for estimates of student mean performance in mathematics, science, and reading. As a result of this study, we recommend considering three different stratification designs for Germany in future cycles of PISA. These recommendations aim to: (1) improve the sampling efficiency while keeping the sample size constant, (2) follow a sound methodological approach, and (3) make conservative and cautious changes while maintaining a reflection of the structure of the German federal school system with different school types. These suggestions include a reinvented stratification of grouped German federal states and designs with school types as explicit stratifiers and federal states as implicit stratifiers.</p>","PeriodicalId":37009,"journal":{"name":"Large-Scale Assessments in Education","volume":"94 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Large-Scale Assessments in Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s40536-024-00203-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0

Abstract

Stratification is an important design feature of many studies using complex sampling designs and it is often used in large-scale assessment (LSA) studies, such as the Programme for International Student Assessment (PISA), for two main reasons. First, stratification variables that achieve a high between and low within strata variance can improve the efficiency of a survey design. Second, stratification allows one to, explicitly or implicitly, control for sample sizes across subpopulations. It ensures that some parts of a population are in the sample in predetermined proportions. In this study, we determine through simulation which stratification scheme is best for PISA in Germany. For this, we consider the constraints imposed by the international sampling design, the available information about schools, and specific national characteristics of the German educational system. We examine seven different stratification designs selected based on scenarios used in past LSAs in Germany and theoretical considerations for future implementations. The chosen scenarios were compared with two reference scenarios: (1) an unstratified design and (2) a synthetic optimal stratification design. The simulation study reveals that the stratification design currently applied in PISA produces satisfactory results regarding sampling precision. The present stratification design is based on Germany's federal states and school types. However, this approach leads to small strata, which has been problematic for estimating sampling variance in previous cycles. Therefore, alternative stratification scenarios were considered and, in addition to overcoming the small-strata problem, also led to smaller standard errors for estimates of student mean performance in mathematics, science, and reading. As a result of this study, we recommend considering three different stratification designs for Germany in future cycles of PISA. These recommendations aim to: (1) improve the sampling efficiency while keeping the sample size constant, (2) follow a sound methodological approach, and (3) make conservative and cautious changes while maintaining a reflection of the structure of the German federal school system with different school types. These suggestions include a reinvented stratification of grouped German federal states and designs with school types as explicit stratifiers and federal states as implicit stratifiers.

Abstract Image

评估德国国际学生评估项目分层设计:模拟研究
分层是许多采用复杂抽样设计的研究的一个重要设计特征,它经常用于大规模评估 (LSA)研究,如国际学生评估项目(PISA),主要有两个原因。首先,分层变量的层间方差大、层内方差小,可以提高调查设计的效率。其次,分层可以或明或暗地控制各子人群的样本量。它能确保人口中的某些部分按预定比例进入样本。在本研究中,我们通过模拟确定哪种分层方案最适合德国国际学生评估项目。为此,我们考虑了国际抽样设计所带来的限制、现有的学校信息以及德国教育系统的具体国家特征。我们研究了七种不同的分层设计,这些设计是根据德国过去的 LSA 所使用的方案以及对未来实施的理论考虑而选定的。我们将所选方案与两种参考方案进行了比较:(1) 无分层设计和 (2) 合成最优分层设计。模拟研究表明,PISA 目前采用的分层设计在抽样精度方面取得了令人满意的结果。目前的分层设计基于德国的联邦州和学校类型。然而,这种方法导致分层较小,这在以前的周期中对抽样方差的估计造成了问题。因此,我们考虑了其他分层方案,除了克服小分层问题外,还缩小了数学、科学和阅读方面学生平均成绩估计值的标准误差。通过这项研究,我们建议德国在未来的国际学生评估项目中考虑三种不同的分层设计。这些建议旨在(1) 在保持样本量不变的情况下,提高抽样效率;(2) 遵循合理的方法论;(3) 在保持反映德国联邦学校系统不同学校类型结构的情况下,做出保守而谨慎的改变。这些建议包括对德国联邦州进行重新分层,以及将学校类型作为显性分层和联邦州作为隐性分层的设计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Large-Scale Assessments in Education
Large-Scale Assessments in Education Social Sciences-Education
CiteScore
4.30
自引率
6.50%
发文量
16
审稿时长
13 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信