European Workshop on Reinforcement Learning最新文献

筛选
英文 中文
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning 利用因子mdp中的加性结构进行强化学习
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_2
T. Degris, Olivier Sigaud, Pierre-Henri Wuillemin
{"title":"Exploiting Additive Structure in Factored MDPs for Reinforcement Learning","authors":"T. Degris, Olivier Sigaud, Pierre-Henri Wuillemin","doi":"10.1007/978-3-540-89722-4_2","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_2","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127611078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Probabilistic Inference for Fast Learning in Control 控制中快速学习的概率推理
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_18
C. Rasmussen, M. Deisenroth
{"title":"Probabilistic Inference for Fast Learning in Control","authors":"C. Rasmussen, M. Deisenroth","doi":"10.1007/978-3-540-89722-4_18","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_18","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131397390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Policy Learning - A Unified Perspective with Applications in Robotics 政策学习-机器人应用的统一视角
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_17
Jan Peters, J. Kober, D. Nguyen-Tuong
{"title":"Policy Learning - A Unified Perspective with Applications in Robotics","authors":"Jan Peters, J. Kober, D. Nguyen-Tuong","doi":"10.1007/978-3-540-89722-4_17","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_17","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116739157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets 具有变化动作集的dec - mdp的批处理模式强化学习方法的评价
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_7
T. Gabel, Martin A. Riedmiller
{"title":"Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets","authors":"T. Gabel, Martin A. Riedmiller","doi":"10.1007/978-3-540-89722-4_7","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_7","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116393005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Use of Reinforcement Learning in Two Real Applications 在两个实际应用中使用强化学习
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_15
J. Martín-Guerrero, E. Soria-Olivas, M. Martínez-Sober, Antonio J. Serrano, J. R. M. Benedito, J. Gómez-Sanchís
{"title":"Use of Reinforcement Learning in Two Real Applications","authors":"J. Martín-Guerrero, E. Soria-Olivas, M. Martínez-Sober, Antonio J. Serrano, J. R. M. Benedito, J. Gómez-Sanchís","doi":"10.1007/978-3-540-89722-4_15","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_15","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"216 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132050934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
United We Stand: Population Based Methods for Solving Unknown POMDPs 团结一致:解决未知pomdp的基于人口的方法
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_19
Noel Welsh, J. Wyatt
{"title":"United We Stand: Population Based Methods for Solving Unknown POMDPs","authors":"Noel Welsh, J. Wyatt","doi":"10.1007/978-3-540-89722-4_19","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_19","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131123417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tile Coding Based on Hyperplane Tiles 基于超平面贴图的贴图编码
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_14
D. Loiacono, P. Lanzi
{"title":"Tile Coding Based on Hyperplane Tiles","authors":"D. Loiacono, P. Lanzi","doi":"10.1007/978-3-540-89722-4_14","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_14","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115342941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Bayesian Reward Filtering 贝叶斯奖励过滤
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_8
M. Geist, O. Pietquin, G. Fricout
{"title":"Bayesian Reward Filtering","authors":"M. Geist, O. Pietquin, G. Fricout","doi":"10.1007/978-3-540-89722-4_8","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_8","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124718583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Optimistic Planning of Deterministic Systems 确定性系统的乐观规划
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_12
Jean-François Hren, R. Munos
{"title":"Optimistic Planning of Deterministic Systems","authors":"Jean-François Hren, R. Munos","doi":"10.1007/978-3-540-89722-4_12","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_12","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129862051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 90
Regularized Fitted Q-iteration: Application to Planning 正则拟合q迭代:在规划中的应用
European Workshop on Reinforcement Learning Pub Date : 2008-11-27 DOI: 10.1007/978-3-540-89722-4_5
A. Farahmand, M. Ghavamzadeh, Csaba Szepesvari, Shie Mannor
{"title":"Regularized Fitted Q-iteration: Application to Planning","authors":"A. Farahmand, M. Ghavamzadeh, Csaba Szepesvari, Shie Mannor","doi":"10.1007/978-3-540-89722-4_5","DOIUrl":"https://doi.org/10.1007/978-3-540-89722-4_5","url":null,"abstract":"","PeriodicalId":432284,"journal":{"name":"European Workshop on Reinforcement Learning","volume":"105 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134475396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信