统计建模以更好地理解计算机科学学生

M. Sahami
{"title":"统计建模以更好地理解计算机科学学生","authors":"M. Sahami","doi":"10.1145/2899415.2925470","DOIUrl":null,"url":null,"abstract":"While educational data mining has often focused on modeling behavior at the level of individual students, we consider developing statistical models to give us insight into the dynamics of student populations. In this talk, we consider two case studies in this vein. The first involves analyzing the evolution of gender balance in a college computer science program, showing that focusing on percentages of underrepresented groups in the overall population may not always provide an accurate portrayal of the impact of various program changes. We propose a new statistical model based on Fisher's Noncentral Hypergeometric Distribution that better captures how program changes are impacting the dynamics of gender balance in a population, especially in the case where the overall population is rapidly increasing (as has been the case in CS in recent years). Our second study looks at the performance of student populations in an introductory college programming course during the past eight years to better understand the evolving mix of students' abilities given the rapid growth in the number of students taking CS courses. Often accompanying such growth is a concern from faculty that the additional students choosing to pursue computing may not have the same aptitude for the subject as was seen in prior student populations. To directly address this question, we present a statistical analysis of students' performance using mixture modeling. Importantly, in this setting many variables that would normally confound such a study are directly controlled for. We find that the distribution of student performance during this period, as reflected in their programming assignment scores, remains remarkably stable despite the large growth in course enrollments. The results of this analysis also show how conflicting perceptions of students' abilities among faculty can be consistently explained. The presentation includes work done jointly with Sarah Evans, Chris Piech, and Katie Redmond.","PeriodicalId":313971,"journal":{"name":"Proceedings of the 2016 ACM Conference on Innovation and Technology in Computer Science Education","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Statistical Modeling to Better Understand CS Students\",\"authors\":\"M. Sahami\",\"doi\":\"10.1145/2899415.2925470\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"While educational data mining has often focused on modeling behavior at the level of individual students, we consider developing statistical models to give us insight into the dynamics of student populations. In this talk, we consider two case studies in this vein. The first involves analyzing the evolution of gender balance in a college computer science program, showing that focusing on percentages of underrepresented groups in the overall population may not always provide an accurate portrayal of the impact of various program changes. We propose a new statistical model based on Fisher's Noncentral Hypergeometric Distribution that better captures how program changes are impacting the dynamics of gender balance in a population, especially in the case where the overall population is rapidly increasing (as has been the case in CS in recent years). Our second study looks at the performance of student populations in an introductory college programming course during the past eight years to better understand the evolving mix of students' abilities given the rapid growth in the number of students taking CS courses. Often accompanying such growth is a concern from faculty that the additional students choosing to pursue computing may not have the same aptitude for the subject as was seen in prior student populations. To directly address this question, we present a statistical analysis of students' performance using mixture modeling. Importantly, in this setting many variables that would normally confound such a study are directly controlled for. We find that the distribution of student performance during this period, as reflected in their programming assignment scores, remains remarkably stable despite the large growth in course enrollments. The results of this analysis also show how conflicting perceptions of students' abilities among faculty can be consistently explained. The presentation includes work done jointly with Sarah Evans, Chris Piech, and Katie Redmond.\",\"PeriodicalId\":313971,\"journal\":{\"name\":\"Proceedings of the 2016 ACM Conference on Innovation and Technology in Computer Science Education\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2016 ACM Conference on Innovation and Technology in Computer Science Education\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2899415.2925470\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2016 ACM Conference on Innovation and Technology in Computer Science Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2899415.2925470","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

虽然教育数据挖掘通常侧重于对单个学生的行为进行建模,但我们考虑开发统计模型,以深入了解学生群体的动态。在这次演讲中,我们将考虑这方面的两个案例研究。第一项研究分析了一所大学计算机科学课程中性别平衡的演变,结果表明,关注未被充分代表的群体在总人口中的比例,可能并不总是能准确地反映各种课程变化的影响。我们提出了一个基于Fisher非中心超几何分布的新统计模型,该模型更好地捕捉了项目变化如何影响人口中性别平衡的动态,特别是在总体人口快速增长的情况下(如近年来CS的情况)。我们的第二项研究着眼于过去八年中大学编程入门课程的学生群体的表现,以便更好地理解考虑到学习计算机科学课程的学生数量的快速增长,学生能力的演变组合。伴随这种增长而来的往往是教师们的担忧,即选择攻读计算机专业的额外学生可能不像以前的学生群体那样具有同样的能力。为了直接解决这个问题,我们使用混合模型对学生的表现进行了统计分析。重要的是,在这种情况下,许多通常会混淆这种研究的变量被直接控制。我们发现,在此期间,学生的表现分布,反映在他们的编程作业分数上,尽管课程入学人数大幅增长,但仍然非常稳定。这一分析的结果还表明,教师之间对学生能力的相互矛盾的看法是如何得到一致解释的。演示包括与Sarah Evans, Chris Piech和Katie Redmond共同完成的工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Statistical Modeling to Better Understand CS Students
While educational data mining has often focused on modeling behavior at the level of individual students, we consider developing statistical models to give us insight into the dynamics of student populations. In this talk, we consider two case studies in this vein. The first involves analyzing the evolution of gender balance in a college computer science program, showing that focusing on percentages of underrepresented groups in the overall population may not always provide an accurate portrayal of the impact of various program changes. We propose a new statistical model based on Fisher's Noncentral Hypergeometric Distribution that better captures how program changes are impacting the dynamics of gender balance in a population, especially in the case where the overall population is rapidly increasing (as has been the case in CS in recent years). Our second study looks at the performance of student populations in an introductory college programming course during the past eight years to better understand the evolving mix of students' abilities given the rapid growth in the number of students taking CS courses. Often accompanying such growth is a concern from faculty that the additional students choosing to pursue computing may not have the same aptitude for the subject as was seen in prior student populations. To directly address this question, we present a statistical analysis of students' performance using mixture modeling. Importantly, in this setting many variables that would normally confound such a study are directly controlled for. We find that the distribution of student performance during this period, as reflected in their programming assignment scores, remains remarkably stable despite the large growth in course enrollments. The results of this analysis also show how conflicting perceptions of students' abilities among faculty can be consistently explained. The presentation includes work done jointly with Sarah Evans, Chris Piech, and Katie Redmond.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信