Iterative architecture for value iteration using memristors

14th IEEE International Conference on Nanotechnology Pub Date : 2014-12-01 DOI:10.1109/NANO.2014.6967997

I. Ebong, P. Mazumder

引用次数: 0

Abstract

Memristors promise higher device density and design flexibility. Besides utilizing memristors for digital memory, another promising avenue for adoption is the advancement of neural network circuits capable of learning. Neural network implementations with memristors have been proposed, including memristor synaptic training methodologies. This work highlights applications of a neural learning methodology inspired by Q-learning. Memristors are used as analog storage elements to store a large Q-table. The method is qualified with a maze problem in order to show that the proposed network can be used to learn to approximate an optimal path to solving the maze problem. Brief results highlighting the methodology on a maze problem and discussion on generating random keys are provided. This work combines model-free reinforcement learning with neural networks.

查看原文本刊更多论文

使用忆阻器进行值迭代的迭代架构

忆阻器承诺更高的器件密度和设计灵活性。除了将忆阻器用于数字存储器之外，另一个有希望采用的途径是能够学习的神经网络电路的进步。神经网络实现与忆阻器已经提出，包括忆阻器突触训练方法。这项工作强调了受q学习启发的神经学习方法的应用。忆阻器被用作模拟存储元件来存储一个大的q表。该方法通过一个迷宫问题进行了限定，以表明所提出的网络可以用来学习逼近解决迷宫问题的最优路径。简要介绍了迷宫问题的研究方法，并讨论了生成随机密钥的方法。这项工作将无模型强化学习与神经网络相结合。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

14th IEEE International Conference on Nanotechnology

自引率

0.00%

发文量