Automatic essay scoring for discussion forum in online learning based on semantic and keyword similarities

Q1 Social Sciences

AAOU Journal Pub Date : 2023-10-11 DOI:10.1108/aaouj-02-2023-0027

Bachriah Fatwa Dhini, Abba Suganda Girsang, Unggul Utan Sufandi, Heny Kurniawati

{"title":"Automatic essay scoring for discussion forum in online learning based on semantic and keyword similarities","authors":"Bachriah Fatwa Dhini, Abba Suganda Girsang, Unggul Utan Sufandi, Heny Kurniawati","doi":"10.1108/aaouj-02-2023-0027","DOIUrl":null,"url":null,"abstract":"Purpose The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes essay scoring, which is conducted through two parameters, semantic and keyword similarities, using a SentenceTransformers pre-trained model that can construct the highest vector embedding. Combining these models is used to optimize the model with increasing accuracy. Design/methodology/approach The development of the model in the study is divided into seven stages: (1) data collection, (2) pre-processing data, (3) selected pre-trained SentenceTransformers model, (4) semantic similarity (sentence pair), (5) keyword similarity, (6) calculate final score and (7) evaluating model. Findings The multilingual paraphrase-multilingual-MiniLM-L12-v2 and distilbert-base-multilingual-cased-v1 models got the highest scores from comparisons of 11 pre-trained multilingual models of SentenceTransformers with Indonesian data (Dhini and Girsang, 2023). Both multilingual models were adopted in this study. A combination of two parameters is obtained by comparing the response of the keyword extraction responses with the rubric keywords. Based on the experimental results, proposing a combination can increase the evaluation results by 0.2. Originality/value This study uses discussion forum data from the general biology course in online learning at the open university for the 2020.2 and 2021.2 semesters. Forum discussion ratings are still manual. In this survey, the authors created a model that automatically calculates the value of discussion forums, which are essays based on the lecturer's answers moreover rubrics.","PeriodicalId":32802,"journal":{"name":"AAOU Journal","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AAOU Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/aaouj-02-2023-0027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes essay scoring, which is conducted through two parameters, semantic and keyword similarities, using a SentenceTransformers pre-trained model that can construct the highest vector embedding. Combining these models is used to optimize the model with increasing accuracy. Design/methodology/approach The development of the model in the study is divided into seven stages: (1) data collection, (2) pre-processing data, (3) selected pre-trained SentenceTransformers model, (4) semantic similarity (sentence pair), (5) keyword similarity, (6) calculate final score and (7) evaluating model. Findings The multilingual paraphrase-multilingual-MiniLM-L12-v2 and distilbert-base-multilingual-cased-v1 models got the highest scores from comparisons of 11 pre-trained multilingual models of SentenceTransformers with Indonesian data (Dhini and Girsang, 2023). Both multilingual models were adopted in this study. A combination of two parameters is obtained by comparing the response of the keyword extraction responses with the rubric keywords. Based on the experimental results, proposing a combination can increase the evaluation results by 0.2. Originality/value This study uses discussion forum data from the general biology course in online learning at the open university for the 2020.2 and 2021.2 semesters. Forum discussion ratings are still manual. In this survey, the authors created a model that automatically calculates the value of discussion forums, which are essays based on the lecturer's answers moreover rubrics.

查看原文本刊更多论文

基于语义和关键词相似度的在线学习论坛作文自动评分

作者在论坛中构建了一个自动论文评分(AES)模型，其结果与人类评估者给出的分数进行比较。本研究提出论文评分，通过语义和关键词相似度两个参数进行评分，使用senencetransformers预训练模型，该模型可以构建最高的向量嵌入。结合这些模型对模型进行优化，提高了模型的精度。本研究中模型的开发分为七个阶段:(1)数据收集，(2)数据预处理，(3)选择预训练的SentenceTransformers模型，(4)语义相似度(句子对)，(5)关键词相似度，(6)计算最终分数，(7)评估模型。通过对11个预先训练的sensentetransformers多语言模型与印度尼西亚数据(Dhini和Girsang, 2023)进行比较，多语言phasase - multilinguase - minilm - l12 -v2和distilbert-base- multilingucasedv1模型得分最高。本研究采用了两种多语言模型。通过比较关键字提取响应与标题关键字的响应，得到两个参数的组合。根据实验结果，提出一种组合可以使评价结果提高0.2。本研究使用了开放大学2020.2和2021.2学期在线学习普通生物学课程的讨论论坛数据。论坛讨论评分仍然是手动的。在这项调查中，作者创建了一个模型，自动计算讨论论坛的价值，这是基于讲师的答案和标题的文章。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊