Dynamics of Supervised and Reinforcement Learning in the Non-Linear Perceptron.

ArXiv Pub Date : 2025-02-24

Christian Schmid, James M Murray

{"title":"Dynamics of Supervised and Reinforcement Learning in the Non-Linear Perceptron.","authors":"Christian Schmid, James M Murray","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>The ability of a brain or a neural network to efficiently learn depends crucially on both the task structure and the learning rule. Previous works have analyzed the dynamical equations describing learning in the relatively simplified context of the perceptron under assumptions of a student-teacher framework or a linearized output. While these assumptions have facilitated theoretical understanding, they have precluded a detailed understanding of the roles of the nonlinearity and input-data distribution in determining the learning dynamics, limiting the applicability of the theories to real biological or artificial neural networks. Here, we use a stochastic-process approach to derive flow equations describing learning, applying this framework to the case of a nonlinear perceptron performing binary classification. We characterize the effects of the learning rule (supervised or reinforcement learning, SL/RL) and input-data distribution on the perceptron's learning curve and the forgetting curve as subsequent tasks are learned. In particular, we find that the input-data noise differently affects the learning speed under SL vs. RL, as well as determines how quickly learning of a task is overwritten by subsequent learning. Additionally, we verify our approach with real data using the MNIST dataset. This approach points a way toward analyzing learning dynamics for more-complex circuit architectures.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11398553/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ArXiv","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The ability of a brain or a neural network to efficiently learn depends crucially on both the task structure and the learning rule. Previous works have analyzed the dynamical equations describing learning in the relatively simplified context of the perceptron under assumptions of a student-teacher framework or a linearized output. While these assumptions have facilitated theoretical understanding, they have precluded a detailed understanding of the roles of the nonlinearity and input-data distribution in determining the learning dynamics, limiting the applicability of the theories to real biological or artificial neural networks. Here, we use a stochastic-process approach to derive flow equations describing learning, applying this framework to the case of a nonlinear perceptron performing binary classification. We characterize the effects of the learning rule (supervised or reinforcement learning, SL/RL) and input-data distribution on the perceptron's learning curve and the forgetting curve as subsequent tasks are learned. In particular, we find that the input-data noise differently affects the learning speed under SL vs. RL, as well as determines how quickly learning of a task is overwritten by subsequent learning. Additionally, we verify our approach with real data using the MNIST dataset. This approach points a way toward analyzing learning dynamics for more-complex circuit architectures.

本刊更多论文

非线性感知器中监督学习和强化学习的动态变化

大脑或神经网络能否高效学习，关键取决于任务结构和学习规则。以往的研究在学生-教师框架或线性化输出的假设下，分析了相对简化的感知器背景下描述学习的动态方程。虽然这些假设有助于理论理解，但却无法详细了解非线性和输入数据分布在决定学习动态中的作用，从而限制了这些理论在实际生物或人工神经网络中的适用性。在此，我们使用随机过程方法推导出描述学习的流动方程，并将此框架应用于执行二元分类的非线性感知器。我们描述了学习规则（监督学习或强化学习，SL/RL）和输入数据分布对感知器学习曲线和遗忘曲线的影响。特别是，我们发现输入数据噪声对 SL 与 RL 学习速度的影响不同，同时也决定了任务学习被后续学习覆盖的速度。此外，我们还利用 MNIST 数据集的真实数据验证了我们的方法。这种方法为分析更复杂电路架构的学习动态指明了方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ArXiv

自引率

0.00%

发文量