交易效用与不确定性:应用信息价值解决强化学习中的探索-利用困境

Handbook of Reinforcement Learning and Control Pub Date : 1900-01-01 DOI:10.1007/978-3-030-60990-0_19

I. Sledge, J. Príncipe

引用次数: 0

摘要

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Trading Utility and Uncertainty: Applying the Value of Information to Resolve the Exploration–Exploitation Dilemma in Reinforcement Learning

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

自引率

0.00%

发文量