Each Encounter Counts: Modeling Language Learning and Forgetting

LAK23: 13th International Learning Analytics and Knowledge Conference Pub Date : 2023-03-13 DOI:10.1145/3576050.3576062

B. Ma, G. Hettiarachchi, Sora Fukui, Yuji Ando

{"title":"Each Encounter Counts: Modeling Language Learning and Forgetting","authors":"B. Ma, G. Hettiarachchi, Sora Fukui, Yuji Ando","doi":"10.1145/3576050.3576062","DOIUrl":null,"url":null,"abstract":"Language learning applications usually estimate the learner’s language knowledge over time to provide personalized practice content for each learner at the optimal timing. However, accurately predicting language knowledge or linguistic skills is much more challenging than math or science knowledge, as many language tasks involve memorization and retrieval. Learners must memorize a large number of words and meanings, which are prone to be forgotten without practice. Although a few studies consider forgetting when modeling learners’ language knowledge, they tend to apply traditional models, consider only partial information about forgetting, and ignore linguistic features that may significantly influence learning and forgetting. This paper focuses on modeling and predicting learners’ knowledge by considering their forgetting behavior and linguistic features in language learning. Specifically, we first explore the existence of forgetting behavior and cross-effects in real-world language learning datasets through empirical studies. Based on these, we propose a model for predicting the probability of recalling a word given a learner’s practice history. The model incorporates key information related to forgetting, question formats, and semantic similarities between words using the attention mechanism. Experiments on two real-world datasets show that the proposed model improves performance compared to baselines. Moreover, the results indicate that combining multiple types of forgetting information and item format improves performance. In addition, we find that incorporating semantic features, such as word embeddings, to model similarities between words in a learner’s practice history and their effects on memory also improves the model.","PeriodicalId":394433,"journal":{"name":"LAK23: 13th International Learning Analytics and Knowledge Conference","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"LAK23: 13th International Learning Analytics and Knowledge Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3576050.3576062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Language learning applications usually estimate the learner’s language knowledge over time to provide personalized practice content for each learner at the optimal timing. However, accurately predicting language knowledge or linguistic skills is much more challenging than math or science knowledge, as many language tasks involve memorization and retrieval. Learners must memorize a large number of words and meanings, which are prone to be forgotten without practice. Although a few studies consider forgetting when modeling learners’ language knowledge, they tend to apply traditional models, consider only partial information about forgetting, and ignore linguistic features that may significantly influence learning and forgetting. This paper focuses on modeling and predicting learners’ knowledge by considering their forgetting behavior and linguistic features in language learning. Specifically, we first explore the existence of forgetting behavior and cross-effects in real-world language learning datasets through empirical studies. Based on these, we propose a model for predicting the probability of recalling a word given a learner’s practice history. The model incorporates key information related to forgetting, question formats, and semantic similarities between words using the attention mechanism. Experiments on two real-world datasets show that the proposed model improves performance compared to baselines. Moreover, the results indicate that combining multiple types of forgetting information and item format improves performance. In addition, we find that incorporating semantic features, such as word embeddings, to model similarities between words in a learner’s practice history and their effects on memory also improves the model.

查看原文本刊更多论文

每一次相遇都很重要:模拟语言学习和遗忘

语言学习应用通常会估算学习者一段时间内的语言知识，以便在最佳时机为每位学习者提供个性化的练习内容。然而，准确预测语言知识或语言技能比数学或科学知识更具挑战性，因为许多语言任务涉及记忆和检索。学习者必须记住大量的单词和含义，这些单词和含义很容易在不练习的情况下被遗忘。虽然一些研究在对学习者的语言知识建模时考虑了遗忘，但它们往往采用传统的模型，只考虑了关于遗忘的部分信息，而忽略了可能显著影响学习和遗忘的语言特征。本文主要从学习者的遗忘行为和语言学习特点出发，对学习者的知识进行建模和预测。具体而言，我们首先通过实证研究探索了遗忘行为和交叉效应在现实世界语言学习数据集中的存在。在此基础上，我们提出了一个模型来预测给定学习者的练习历史记忆单词的概率。该模型结合了与遗忘、问题格式和使用注意机制的词之间的语义相似性相关的关键信息。在两个真实数据集上的实验表明，与基线相比，该模型的性能有所提高。此外，研究结果还表明，将多种遗忘信息类型与项目格式相结合可以提高学习成绩。此外，我们发现结合语义特征(如词嵌入)来模拟学习者练习历史中单词之间的相似性及其对记忆的影响也可以改进模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

LAK23: 13th International Learning Analytics and Knowledge Conference

自引率

0.00%

发文量