{"title":"Xiao-Shih: The Educational Intelligent Question Answering Bot on Chinese-Based MOOCs","authors":"Hao-Hsuan Hsu, N. Huang","doi":"10.1109/ICMLA.2018.00213","DOIUrl":null,"url":null,"abstract":"In this study, the educational intelligent question answering bot named Xiao-Shih has been developed for solving learners' questions as instructors and teaching assistants on MOOCs. Experiments were conducted with Xiao-Shih in a paid course titled \"Python for Data Science\" on \"ShareCourse\" which is one of the largest Chinese-based MOOC platform in Taiwan. Over one thousand discussion threads posted in both English and Chinese languages were retrieved to train the bot by natural language processing (NLP) and Random Forest (RF) in machine learning algorithms. This paper presents the implementation details on developing Xiao-Shih. First, we developed an initial version of Xiao-Shih with simply NLP techniques and text similarity approaches. However, Xiao-Shih only obtains 0.413 precision at best with different thresholds of the question similarity. Therefore, features and labels of answering correctness have been collected for the next version of Xiao-Shih. Trained by Random Forest with 70% of the entire dataset, Xiao-Shih obtains 0.833 precision with test dataset. With this educational intelligent question answering bot, learners can solve their problems immediately in seconds rather than wait for humans' response in hours even days. Moreover, Xiao-Shih can also ease instructors' and teaching assistants' burden on answering questions.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"119 1","pages":"1316-1321"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2018.00213","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
In this study, the educational intelligent question answering bot named Xiao-Shih has been developed for solving learners' questions as instructors and teaching assistants on MOOCs. Experiments were conducted with Xiao-Shih in a paid course titled "Python for Data Science" on "ShareCourse" which is one of the largest Chinese-based MOOC platform in Taiwan. Over one thousand discussion threads posted in both English and Chinese languages were retrieved to train the bot by natural language processing (NLP) and Random Forest (RF) in machine learning algorithms. This paper presents the implementation details on developing Xiao-Shih. First, we developed an initial version of Xiao-Shih with simply NLP techniques and text similarity approaches. However, Xiao-Shih only obtains 0.413 precision at best with different thresholds of the question similarity. Therefore, features and labels of answering correctness have been collected for the next version of Xiao-Shih. Trained by Random Forest with 70% of the entire dataset, Xiao-Shih obtains 0.833 precision with test dataset. With this educational intelligent question answering bot, learners can solve their problems immediately in seconds rather than wait for humans' response in hours even days. Moreover, Xiao-Shih can also ease instructors' and teaching assistants' burden on answering questions.
本研究开发了教育智能问答机器人“小诗”,作为mooc的讲师和助教来解决学习者的问题。实验是在台湾最大的中文MOOC平台“sharesource”上的一门名为“Python for Data Science”的付费课程中进行的。在机器学习算法中,通过自然语言处理(NLP)和随机森林(RF)来训练机器人,检索了1000多个中英文讨论话题。本文介绍了开发小石的实现细节。首先,我们用简单的自然语言处理技术和文本相似度方法开发了一个初始版本的“小诗”。而对于不同的问题相似度阈值,xiaoshih最多只能得到0.413的精度。因此,为下一个版本的“小诗”收集了回答正确性的特征和标签。用整个数据集的70%进行随机森林训练,用测试数据集得到了0.833的精度。有了这个教育智能问答机器人,学习者可以在几秒钟内立即解决他们的问题,而不是等待人类几个小时甚至几天的回应。此外,小诗还可以减轻教师和助教回答问题的负担。