Reinforcement Learning for Online Learning Recommendation System

Wacharawan Intayoad, Chayapol Kamyod, P. Temdee
{"title":"Reinforcement Learning for Online Learning Recommendation System","authors":"Wacharawan Intayoad, Chayapol Kamyod, P. Temdee","doi":"10.1109/GWS.2018.8686513","DOIUrl":null,"url":null,"abstract":"In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.","PeriodicalId":256742,"journal":{"name":"2018 Global Wireless Summit (GWS)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Global Wireless Summit (GWS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GWS.2018.8686513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

In the learning environment, individual learner requires flexible and suitable learning processes. Online learning should be able to recommend appropriate learning objects (LOs) to an individual in real-time. Most of the existing approaches of online learning recommendation systems are based on collaborative filtering methods. Such methods have a limitation on realtime adaption and require the prior knowledge of students and LOs. Therefore, this study proposes a real-time recommendation method which is suitable for flexible and complex environments. The proposed method is based on Reinforcement Learning problem. The method is able to explore the environment to get information and exploit the information to make a decision. We evaluate the proposed method with the real world data. We vary e-greedy, the learning rate, and the discount rate for a tradeoff between the exploration and exploitation.
在线学习推荐系统的强化学习
在学习环境中,个体学习者需要灵活和合适的学习过程。在线学习应该能够实时向个人推荐合适的学习对象(LOs)。现有的在线学习推荐系统大多是基于协同过滤的方法。这种方法在实时适应方面存在局限性,并且需要学生和LOs的先验知识。因此,本研究提出了一种适合于灵活复杂环境的实时推荐方法。该方法基于强化学习问题。该方法能够探索环境以获取信息,并利用信息做出决策。我们用真实世界的数据来评估所提出的方法。我们改变了e-greedy,学习率,以及在探索和开发之间权衡的折现率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信