European Workshop on Reinforcement Learning最新文献

筛选
英文 中文
Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits 多手土匪游戏排名公式的自动发现
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_5
Francis Maes, L. Wehenkel, D. Ernst
{"title":"Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits","authors":"Francis Maes, L. Wehenkel, D. Ernst","doi":"10.1007/978-3-642-29946-9_5","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_5","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115146208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control 基于损失函数和核的梯度改进策略控制算法
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_7
Matthew W. Robards, P. Sunehag
{"title":"Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control","authors":"Matthew W. Robards, P. Sunehag","doi":"10.1007/978-3-642-29946-9_7","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_7","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128130351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Invited Talk: Increasing Representational Power and Scaling Inference in Reinforcement Learning 特邀演讲:在强化学习中增加表征能力和尺度推理
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_2
K. Kersting
{"title":"Invited Talk: Increasing Representational Power and Scaling Inference in Reinforcement Learning","authors":"K. Kersting","doi":"10.1007/978-3-642-29946-9_2","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_2","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121535827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Active Learning of MDP Models MDP模型的主动学习
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_8
Mauricio Araya-López, O. Buffet, Vincent Thomas, F. Charpillet
{"title":"Active Learning of MDP Models","authors":"Mauricio Araya-López, O. Buffet, Vincent Thomas, F. Charpillet","doi":"10.1007/978-3-642-29946-9_8","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_8","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131919378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
MapReduce for Parallel Reinforcement Learning MapReduce用于并行强化学习
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_30
Yuxi Li, Dale Schuurmans
{"title":"MapReduce for Parallel Reinforcement Learning","authors":"Yuxi Li, Dale Schuurmans","doi":"10.1007/978-3-642-29946-9_30","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_30","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"14 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114043634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Transfer Learning in Multi-Agent Reinforcement Learning Domains 多智能体强化学习领域的迁移学习
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_25
G. Boutsioukis, Ioannis Partalas, I. Vlahavas
{"title":"Transfer Learning in Multi-Agent Reinforcement Learning Domains","authors":"G. Boutsioukis, Ioannis Partalas, I. Vlahavas","doi":"10.1007/978-3-642-29946-9_25","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_25","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"524 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123415986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 59
A Framework for Computing Bounds for the Return of a Policy 计算策略返回边界的框架
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_21
Cosmin Paduraru, Doina Precup, Joelle Pineau
{"title":"A Framework for Computing Bounds for the Return of a Policy","authors":"Cosmin Paduraru, Doina Precup, Joelle Pineau","doi":"10.1007/978-3-642-29946-9_21","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_21","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129749637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Value Function Approximation through Sparse Bayesian Modeling 稀疏贝叶斯建模的值函数逼近
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_15
Nikolaos Tziortziotis, K. Blekas
{"title":"Value Function Approximation through Sparse Bayesian Modeling","authors":"Nikolaos Tziortziotis, K. Blekas","doi":"10.1007/978-3-642-29946-9_15","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_15","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"225 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115492868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning 面向开发学习的主动课程分类支持系统的提出与评价
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_32
K. Miyazaki, M. Ida
{"title":"Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning","authors":"K. Miyazaki, M. Ida","doi":"10.1007/978-3-642-29946-9_32","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_32","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122760344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Handling Ambiguous Effects in Action Learning 处理行动学习中的模糊效果
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_9
Boris Lesner, B. Zanuttini
{"title":"Handling Ambiguous Effects in Action Learning","authors":"Boris Lesner, B. Zanuttini","doi":"10.1007/978-3-642-29946-9_9","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_9","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127947258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信