European Workshop on Reinforcement Learning最新文献

筛选
英文 中文
Options with Exceptions 带有例外的选项
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_18
Munu Sairamesh, Balaraman Ravindran
{"title":"Options with Exceptions","authors":"Munu Sairamesh, Balaraman Ravindran","doi":"10.1007/978-3-642-29946-9_18","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_18","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"114 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129417646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Reinforcement Learning with a Bilinear Q Function 基于双线性Q函数的强化学习
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_11
C. Elkan
{"title":"Reinforcement Learning with a Bilinear Q Function","authors":"C. Elkan","doi":"10.1007/978-3-642-29946-9_11","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_11","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130093941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Regularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization 正则化最小二乘时间差分学习与嵌套l_1和l_2惩罚
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_13
Matthew W. Hoffman, A. Lazaric, M. Ghavamzadeh, R. Munos
{"title":"Regularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization","authors":"Matthew W. Hoffman, A. Lazaric, M. Ghavamzadeh, R. Munos","doi":"10.1007/978-3-642-29946-9_13","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_13","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128565955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
Invited Talk: UCRL and Autonomous Exploration 特邀演讲:UCRL与自主探索
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_1
P. Auer
{"title":"Invited Talk: UCRL and Autonomous Exploration","authors":"P. Auer","doi":"10.1007/978-3-642-29946-9_1","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_1","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"223 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134312857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Goal-Directed Online Learning of Predictive Models 预测模型的目标导向在线学习
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_6
S. W. Ong, Y. Grinberg, Joelle Pineau
{"title":"Goal-Directed Online Learning of Predictive Models","authors":"S. W. Ong, Y. Grinberg, Joelle Pineau","doi":"10.1007/978-3-642-29946-9_6","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_6","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133944276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Compound Reinforcement Learning: Theory and an Application to Finance 复合强化学习:理论及其在金融中的应用
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_31
Tohgoroh Matsui, Takashi Goto, K. Izumi, Yu Chen
{"title":"Compound Reinforcement Learning: Theory and an Application to Finance","authors":"Tohgoroh Matsui, Takashi Goto, K. Izumi, Yu Chen","doi":"10.1007/978-3-642-29946-9_31","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_31","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114628027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An Extension of a Hierarchical Reinforcement Learning Algorithm for Multiagent Settings 多智能体设置的层次强化学习算法的扩展
European Workshop on Reinforcement Learning Pub Date : 2011-09-09 DOI: 10.1007/978-3-642-29946-9_26
I. Lambrou, Vassilis Vassiliades, C. Christodoulou
{"title":"An Extension of a Hierarchical Reinforcement Learning Algorithm for Multiagent Settings","authors":"I. Lambrou, Vassilis Vassiliades, C. Christodoulou","doi":"10.1007/978-3-642-29946-9_26","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_26","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114903024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bayesian Multitask Inverse Reinforcement Learning 贝叶斯多任务逆强化学习
European Workshop on Reinforcement Learning Pub Date : 2011-06-18 DOI: 10.1007/978-3-642-29946-9_27
Christos Dimitrakakis, C. Rothkopf
{"title":"Bayesian Multitask Inverse Reinforcement Learning","authors":"Christos Dimitrakakis, C. Rothkopf","doi":"10.1007/978-3-642-29946-9_27","DOIUrl":"https://doi.org/10.1007/978-3-642-29946-9_27","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124335795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 104
Lazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance Trees 不完全干扰树集合上决策优化的不确定懒规划
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_1
Boris Defourny, D. Ernst, L. Wehenkel
{"title":"Lazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance Trees","authors":"Boris Defourny, D. Ernst, L. Wehenkel","doi":"10.1007/978-3-540-89722-4_1","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_1","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125727655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Basis Expansion in Natural Actor Critic Methods 自然演员评价方法的基础拓展
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_9
Sertan Girgin, P. Preux
{"title":"Basis Expansion in Natural Actor Critic Methods","authors":"Sertan Girgin, P. Preux","doi":"10.1007/978-3-540-89722-4_9","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_9","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131499240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信