Language independent analysis and classification of discussion threads in Coursera MOOC forums

Lorenzo A. Rossi, O. Gnawali
{"title":"Language independent analysis and classification of discussion threads in Coursera MOOC forums","authors":"Lorenzo A. Rossi, O. Gnawali","doi":"10.1109/IRI.2014.7051952","DOIUrl":null,"url":null,"abstract":"In this work, we analyze the discussion threads from the forums of 60 Massive Open Online Courses (MOOCs) offered by Coursera and taught in 4 different languages. The types of interactions in such threads vary: there are discussions on close ended problems (e.g. solutions to assignments), open ended topics, course logistics, or just small talk among fellow students. We first study the evolution of the forum activities with respect to the normalized course duration. Then we investigate several language independent features to classify the discussion threads based on the types of the interactions among the users. We use default Coursera subforum categories (Study Groups, Assignments, Lectures, ...) to define the classes of interest and so the labels. We extract features related to structure, popularity, temporal dynamics of threads and diversity of the ids of the users. Text related features, word count aside, are avoided to apply the methods across discussion threads written in different languages and with various technical terminologies. Experiments show a classification performance with ROCAUC between 0.58 and 0.89, depending on the subforum class considered and with possibly noisy labels.","PeriodicalId":360013,"journal":{"name":"Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"70","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2014.7051952","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 70

Abstract

In this work, we analyze the discussion threads from the forums of 60 Massive Open Online Courses (MOOCs) offered by Coursera and taught in 4 different languages. The types of interactions in such threads vary: there are discussions on close ended problems (e.g. solutions to assignments), open ended topics, course logistics, or just small talk among fellow students. We first study the evolution of the forum activities with respect to the normalized course duration. Then we investigate several language independent features to classify the discussion threads based on the types of the interactions among the users. We use default Coursera subforum categories (Study Groups, Assignments, Lectures, ...) to define the classes of interest and so the labels. We extract features related to structure, popularity, temporal dynamics of threads and diversity of the ids of the users. Text related features, word count aside, are avoided to apply the methods across discussion threads written in different languages and with various technical terminologies. Experiments show a classification performance with ROCAUC between 0.58 and 0.89, depending on the subforum class considered and with possibly noisy labels.
Coursera MOOC论坛讨论主题的语言独立分析与分类
在这项工作中,我们分析了来自Coursera以4种不同语言提供的60门大规模开放在线课程(MOOCs)论坛的讨论主题。在这些线程中,交互的类型各不相同:有关于封闭式问题(例如作业的解决方案)的讨论,开放式主题,课程后勤,或者只是同学之间的闲聊。我们首先研究了论坛活动相对于标准化课程持续时间的演变。然后,我们研究了几个与语言无关的特征,根据用户之间的交互类型对讨论线程进行分类。我们使用默认的Coursera子论坛类别(学习小组、作业、讲座等)来定义感兴趣的类别和标签。我们提取与结构、流行度、线程时间动态和用户id多样性相关的特征。除了单词计数之外,还避免了与文本相关的特性,以便在使用不同语言和各种技术术语编写的讨论线程中应用这些方法。实验表明,ROCAUC的分类性能在0.58和0.89之间,这取决于所考虑的子论坛类别和可能带有噪声的标签。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信