Combining empirical and machine learning techniques to predict math expertise using pen signal features

Proceedings of the 2014 ACM workshop on Multimodal Learning Analytics Workshop and Grand Challenge Pub Date : 2014-11-12 DOI:10.1145/2666633.2666638

Jianlong Zhou, Kevin Hang, S. Oviatt, Kun Yu, Fang Chen

{"title":"Combining empirical and machine learning techniques to predict math expertise using pen signal features","authors":"Jianlong Zhou, Kevin Hang, S. Oviatt, Kun Yu, Fang Chen","doi":"10.1145/2666633.2666638","DOIUrl":null,"url":null,"abstract":"Multimodal learning analytics aims to automatically analyze students' natural communication patterns based on speech, writing, and other modalities during learning activities. This research used the Math Data Corpus, which contains time-synchronized multimodal data from collaborating students as they jointly solved problems varying in difficulty. The aim was to investigate how reliably pen signal features, which were extracted as students wrote with digital pens and paper, could identify which student in a group was the dominant domain expert. An additional aim was to improve prediction of expertise based on joint bootstrapping of empirical science and machine learning techniques. To accomplish this, empirical analyses first identified which data partitioning and pen signal features were most reliably associated with expertise. Then alternative machine learning techniques compared classification accuracies based on all pen features, versus empirically selected ones. The best unguided classification accuracy was 70.8%, which improved to 83.3% with empirical guidance. These results demonstrate that handwriting signal features can predict domain expertise in math with high reliability. Hybrid methods also can outperform black-box machine learning in both accuracy and transparency.","PeriodicalId":123577,"journal":{"name":"Proceedings of the 2014 ACM workshop on Multimodal Learning Analytics Workshop and Grand Challenge","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2014 ACM workshop on Multimodal Learning Analytics Workshop and Grand Challenge","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2666633.2666638","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

Multimodal learning analytics aims to automatically analyze students' natural communication patterns based on speech, writing, and other modalities during learning activities. This research used the Math Data Corpus, which contains time-synchronized multimodal data from collaborating students as they jointly solved problems varying in difficulty. The aim was to investigate how reliably pen signal features, which were extracted as students wrote with digital pens and paper, could identify which student in a group was the dominant domain expert. An additional aim was to improve prediction of expertise based on joint bootstrapping of empirical science and machine learning techniques. To accomplish this, empirical analyses first identified which data partitioning and pen signal features were most reliably associated with expertise. Then alternative machine learning techniques compared classification accuracies based on all pen features, versus empirically selected ones. The best unguided classification accuracy was 70.8%, which improved to 83.3% with empirical guidance. These results demonstrate that handwriting signal features can predict domain expertise in math with high reliability. Hybrid methods also can outperform black-box machine learning in both accuracy and transparency.

查看原文本刊更多论文

结合经验和机器学习技术，使用笔信号特征预测数学专业知识

多模态学习分析旨在自动分析学生在学习活动中基于语音、写作和其他模态的自然交流模式。本研究使用数学数据语料库，该语料库包含合作学生在共同解决不同难度问题时的时间同步多模态数据。这项研究的目的是研究笔信号特征的可靠性，这些特征是在学生用数字笔和纸写字时提取出来的，它能识别出一组学生中哪个学生是占主导地位的领域专家。另一个目标是改进基于经验科学和机器学习技术联合引导的专业知识预测。为了实现这一点，实证分析首先确定了哪些数据分区和笔信号特征与专业知识最可靠地相关。然后，替代的机器学习技术将基于所有笔特征的分类准确性与经验选择的分类准确性进行比较。无引导的最佳分类准确率为70.8%，有经验指导的分类准确率为83.3%。这些结果表明，笔迹信号特征可以高可靠性地预测数学领域专业知识。混合方法在准确性和透明度方面也优于黑箱机器学习。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2014 ACM workshop on Multimodal Learning Analytics Workshop and Grand Challenge

自引率

0.00%

发文量