Reinforcement Learning for Continuous Control: A Quantum Normalized Advantage Function Approach

2023 IEEE International Conference on Quantum Software (QSW) Pub Date : 2023-07-01 DOI:10.1109/QSW59989.2023.00020

Yaofu Liu, Chang Xu, Siyuan Jin

引用次数: 0

Abstract

In this study, we present a new approach to quantum reinforcement learning that can handle tasks with a range of continuous actions. Our method uses a quantum version of the classic normalized advantage function (QNAF), only needing the Q-value network created by a quantum neural network and avoiding any policy network. We implemented the method by TensorFlow framework. When tested against standard Gym benchmarks, QNAF outperforms classical NAF and prior quantum methods in terms of fewer adjustable parameters. Furthermore, it shows improved stability, reliably converging regardless of changes in initial random parameters.

查看原文本刊更多论文

连续控制的强化学习:量子归一化优势函数方法

在这项研究中，我们提出了一种新的量子强化学习方法，可以处理一系列连续动作的任务。我们的方法使用经典归一化优势函数(QNAF)的量子版本，只需要量子神经网络创建的q值网络，避免了任何策略网络。我们通过TensorFlow框架实现了该方法。在针对标准Gym基准进行测试时，QNAF在可调参数较少方面优于经典NAF和先前的量子方法。此外，无论初始随机参数如何变化，该方法都具有较好的稳定性和可靠的收敛性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Quantum Software (QSW)

自引率

0.00%

发文量