Evaluating Sentence-BERT-powered learning analytics for automated assessment of students' causal diagrams

IF 5.1 2区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH
Héctor J. Pijeira-Díaz, Shashank Subramanya, Janneke van de Pol, Anique de Bruin
{"title":"Evaluating Sentence-BERT-powered learning analytics for automated assessment of students' causal diagrams","authors":"Héctor J. Pijeira-Díaz,&nbsp;Shashank Subramanya,&nbsp;Janneke van de Pol,&nbsp;Anique de Bruin","doi":"10.1111/jcal.12992","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>When learning causal relations, completing causal diagrams enhances students' comprehension judgements to some extent. To potentially boost this effect, advances in natural language processing (NLP) enable real-time formative feedback based on the automated assessment of students' diagrams, which can involve the correctness of both the responses and their position in the causal chain. However, the responsible adoption and effectiveness of automated diagram assessment depend on its reliability.</p>\n </section>\n \n <section>\n \n <h3> Objectives</h3>\n \n <p>In this study, we compare two Dutch pre-trained models (i.e., based on RobBERT and BERTje) in combination with two machine-learning classifiers—Support Vector Machine (SVM) and Neural Networks (NN), in terms of different indicators of automated diagram assessment reliability. We also contrast two techniques (i.e., semantic similarity and machine learning) for estimating the correct position of a student diagram response in the causal chain.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>For training and evaluation of the models, we capitalize on a human-labelled dataset containing 2900+ causal diagrams completed by 700+ secondary school students, accumulated from previous diagramming experiments.</p>\n </section>\n \n <section>\n \n <h3> Results and Conclusions</h3>\n \n <p>In predicting correct responses, 86% accuracy and Cohen's <i>κ</i> of 0.69 were reached, with combinations using SVM being roughly three-times faster (important for real-time applications) than their NN counterparts. In terms of predicting the response position in the causal diagrams, 92% accuracy and 0.89 Cohen's <i>κ</i> were reached.</p>\n </section>\n \n <section>\n \n <h3> Implications</h3>\n \n <p>Taken together, these evaluation figures equip educational designers for decision-making on when these NLP-powered learning analytics are warranted for automated formative feedback in causal relation learning; thereby potentially enabling real-time feedback for learners and reducing teachers' workload.</p>\n </section>\n </div>","PeriodicalId":48071,"journal":{"name":"Journal of Computer Assisted Learning","volume":"40 6","pages":"2667-2680"},"PeriodicalIF":5.1000,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jcal.12992","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer Assisted Learning","FirstCategoryId":"95","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jcal.12992","RegionNum":2,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0

Abstract

Background

When learning causal relations, completing causal diagrams enhances students' comprehension judgements to some extent. To potentially boost this effect, advances in natural language processing (NLP) enable real-time formative feedback based on the automated assessment of students' diagrams, which can involve the correctness of both the responses and their position in the causal chain. However, the responsible adoption and effectiveness of automated diagram assessment depend on its reliability.

Objectives

In this study, we compare two Dutch pre-trained models (i.e., based on RobBERT and BERTje) in combination with two machine-learning classifiers—Support Vector Machine (SVM) and Neural Networks (NN), in terms of different indicators of automated diagram assessment reliability. We also contrast two techniques (i.e., semantic similarity and machine learning) for estimating the correct position of a student diagram response in the causal chain.

Methods

For training and evaluation of the models, we capitalize on a human-labelled dataset containing 2900+ causal diagrams completed by 700+ secondary school students, accumulated from previous diagramming experiments.

Results and Conclusions

In predicting correct responses, 86% accuracy and Cohen's κ of 0.69 were reached, with combinations using SVM being roughly three-times faster (important for real-time applications) than their NN counterparts. In terms of predicting the response position in the causal diagrams, 92% accuracy and 0.89 Cohen's κ were reached.

Implications

Taken together, these evaluation figures equip educational designers for decision-making on when these NLP-powered learning analytics are warranted for automated formative feedback in causal relation learning; thereby potentially enabling real-time feedback for learners and reducing teachers' workload.

Abstract Image

评估由句子-BERT 驱动的学习分析技术,以自动评估学生的因果关系图
在学习因果关系时,完成因果关系图可以在一定程度上提高学生的理解判断能力。为了提高这种效果,自然语言处理(NLP)技术的进步使得基于学生图表自动评估的实时形成性反馈成为可能,这可能涉及回答的正确性及其在因果链中的位置。在本研究中,我们比较了两种荷兰预训练模型(即基于 RobBERT 和 BERTje 的模型)与两种机器学习分类器(支持向量机 (SVM) 和神经网络 (NN))在自动图表评估可靠性的不同指标方面的组合。为了对模型进行训练和评估,我们利用了一个人工标注的数据集,该数据集包含由 700 多名中学生完成的 2900 多张因果图,这些数据都是在以前的图表制作实验中积累的。在预测正确答案方面,准确率达到了 86%,科恩κ值为 0.69,使用 SVM 的组合比使用 NN 的组合快大约三倍(对于实时应用非常重要)。总之,这些评估数据为教育设计者提供了决策依据,以确定在因果关系学习中何时需要使用这些由 NLP 驱动的学习分析技术来实现自动形成性反馈,从而为学习者提供实时反馈并减轻教师的工作量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Computer Assisted Learning
Journal of Computer Assisted Learning EDUCATION & EDUCATIONAL RESEARCH-
CiteScore
9.70
自引率
6.00%
发文量
116
期刊介绍: The Journal of Computer Assisted Learning is an international peer-reviewed journal which covers the whole range of uses of information and communication technology to support learning and knowledge exchange. It aims to provide a medium for communication among researchers as well as a channel linking researchers, practitioners, and policy makers. JCAL is also a rich source of material for master and PhD students in areas such as educational psychology, the learning sciences, instructional technology, instructional design, collaborative learning, intelligent learning systems, learning analytics, open, distance and networked learning, and educational evaluation and assessment. This is the case for formal (e.g., schools), non-formal (e.g., workplace learning) and informal learning (e.g., museums and libraries) situations and environments. Volumes often include one Special Issue which these provides readers with a broad and in-depth perspective on a specific topic. First published in 1985, JCAL continues to have the aim of making the outcomes of contemporary research and experience accessible. During this period there have been major technological advances offering new opportunities and approaches in the use of a wide range of technologies to support learning and knowledge transfer more generally. There is currently much emphasis on the use of network functionality and the challenges its appropriate uses pose to teachers/tutors working with students locally and at a distance. JCAL welcomes: -Empirical reports, single studies or programmatic series of studies on the use of computers and information technologies in learning and assessment -Critical and original meta-reviews of literature on the use of computers for learning -Empirical studies on the design and development of innovative technology-based systems for learning -Conceptual articles on issues relating to the Aims and Scope
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信