{"title":"从 MOOC 讨论中提取紧急问题:基于 BERT 的多输出分类方法","authors":"Mujtaba Sultani, Negin Daneshpour","doi":"10.1007/s13369-024-09090-7","DOIUrl":null,"url":null,"abstract":"<div><p>Online discussion forums are widely used by students to ask and answer questions related to their learning topics. However, not all questions posted by students receive timely and appropriate feedback from instructors, which can affect the quality and effectiveness of the online learning experience. Therefore, it is important to automatically identify and prioritize student questions from online discussion forums, so that instructors can provide better support and guidance to the students. In this paper, we propose a novel hybrid convolutional neural network (CNN) + bidirectional gated recurrent unit (Bi-GRU) multi-output classification model, which can perform this task with high accuracy and efficiency. Our model consists of two outputs: the first one classifies whether the post is a question or not, and the second one classifies whether the classified question is urgent or not urgent. Our model leverages the advantages of both CNN and Bi-GRU layers to capture both local and global features of the input data, as well as the Bidirectional Encoder Representations from Transformers (BERT) model to provide rich and contextualized word embeddings. The model achieves an <i>F</i>1-weighted score of 94.8% when classifying whether the posts are questions or not, and obtains an 88.5% <i>F</i>1-weighted score while classifying the question into urgent and non-urgent. Distinguishing and classifying urgent student questions with high accuracy and coverage can help instructors provide timely and appropriate feedback, a key factor in reducing dropout rates and improving completion rates.</p></div>","PeriodicalId":54354,"journal":{"name":"Arabian Journal for Science and Engineering","volume":"50 2","pages":"1169 - 1190"},"PeriodicalIF":2.6000,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Extracting Urgent Questions from MOOC Discussions: A BERT-Based Multi-output Classification Approach\",\"authors\":\"Mujtaba Sultani, Negin Daneshpour\",\"doi\":\"10.1007/s13369-024-09090-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Online discussion forums are widely used by students to ask and answer questions related to their learning topics. However, not all questions posted by students receive timely and appropriate feedback from instructors, which can affect the quality and effectiveness of the online learning experience. Therefore, it is important to automatically identify and prioritize student questions from online discussion forums, so that instructors can provide better support and guidance to the students. In this paper, we propose a novel hybrid convolutional neural network (CNN) + bidirectional gated recurrent unit (Bi-GRU) multi-output classification model, which can perform this task with high accuracy and efficiency. Our model consists of two outputs: the first one classifies whether the post is a question or not, and the second one classifies whether the classified question is urgent or not urgent. Our model leverages the advantages of both CNN and Bi-GRU layers to capture both local and global features of the input data, as well as the Bidirectional Encoder Representations from Transformers (BERT) model to provide rich and contextualized word embeddings. The model achieves an <i>F</i>1-weighted score of 94.8% when classifying whether the posts are questions or not, and obtains an 88.5% <i>F</i>1-weighted score while classifying the question into urgent and non-urgent. Distinguishing and classifying urgent student questions with high accuracy and coverage can help instructors provide timely and appropriate feedback, a key factor in reducing dropout rates and improving completion rates.</p></div>\",\"PeriodicalId\":54354,\"journal\":{\"name\":\"Arabian Journal for Science and Engineering\",\"volume\":\"50 2\",\"pages\":\"1169 - 1190\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Arabian Journal for Science and Engineering\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s13369-024-09090-7\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arabian Journal for Science and Engineering","FirstCategoryId":"103","ListUrlMain":"https://link.springer.com/article/10.1007/s13369-024-09090-7","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
在线论坛被学生广泛用于提出和回答与学习主题相关的问题。然而,并非所有学生发布的问题都能得到教师及时、适当的反馈,这会影响在线学习体验的质量和效果。因此,从在线论坛中自动识别学生的问题并对其进行优先排序非常重要,这样教师就能为学生提供更好的支持和指导。在本文中,我们提出了一种新型混合卷积神经网络(CNN)+ 双向门控递归单元(Bi-GRU)多输出分类模型,它可以高精度、高效率地完成这项任务。我们的模型由两个输出组成:第一个输出对帖子是否为问题进行分类,第二个输出对分类后的问题是否紧急进行分类。我们的模型充分利用了 CNN 层和 Bi-GRU 层的优势来捕捉输入数据的局部和全局特征,并利用来自变换器的双向编码器表示(BERT)模型来提供丰富的上下文化单词嵌入。在对帖子是否为问题进行分类时,该模型的 F1 加权得分达到 94.8%,在对问题进行紧急和非紧急分类时,该模型的 F1 加权得分达到 88.5%。以较高的准确率和覆盖率对学生的紧急问题进行区分和分类,可以帮助教师及时提供适当的反馈,这是降低辍学率和提高完成率的关键因素。
Extracting Urgent Questions from MOOC Discussions: A BERT-Based Multi-output Classification Approach
Online discussion forums are widely used by students to ask and answer questions related to their learning topics. However, not all questions posted by students receive timely and appropriate feedback from instructors, which can affect the quality and effectiveness of the online learning experience. Therefore, it is important to automatically identify and prioritize student questions from online discussion forums, so that instructors can provide better support and guidance to the students. In this paper, we propose a novel hybrid convolutional neural network (CNN) + bidirectional gated recurrent unit (Bi-GRU) multi-output classification model, which can perform this task with high accuracy and efficiency. Our model consists of two outputs: the first one classifies whether the post is a question or not, and the second one classifies whether the classified question is urgent or not urgent. Our model leverages the advantages of both CNN and Bi-GRU layers to capture both local and global features of the input data, as well as the Bidirectional Encoder Representations from Transformers (BERT) model to provide rich and contextualized word embeddings. The model achieves an F1-weighted score of 94.8% when classifying whether the posts are questions or not, and obtains an 88.5% F1-weighted score while classifying the question into urgent and non-urgent. Distinguishing and classifying urgent student questions with high accuracy and coverage can help instructors provide timely and appropriate feedback, a key factor in reducing dropout rates and improving completion rates.
期刊介绍:
King Fahd University of Petroleum & Minerals (KFUPM) partnered with Springer to publish the Arabian Journal for Science and Engineering (AJSE).
AJSE, which has been published by KFUPM since 1975, is a recognized national, regional and international journal that provides a great opportunity for the dissemination of research advances from the Kingdom of Saudi Arabia, MENA and the world.