A Demonstration of Stability-Plasticity Imbalance in Multi-agent, Decomposition-Based Learning

2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA) Pub Date : 2015-12-01 DOI:10.1109/ICMLA.2015.106

Sean C. Mondesire, R. P. Wiegand

{"title":"A Demonstration of Stability-Plasticity Imbalance in Multi-agent, Decomposition-Based Learning","authors":"Sean C. Mondesire, R. P. Wiegand","doi":"10.1109/ICMLA.2015.106","DOIUrl":null,"url":null,"abstract":"Layered learning is a machine learning paradigm used in conjunction with direct-policy search reinforcement learning methods to find high performance agent behaviors for complex tasks. At its core, layered learning is a decomposition-based paradigm that shares many characteristics with robot shaping, transfer learning, hierarchical decomposition, and incremental learning. Previous studies have provided evidence that layered learning has the ability to outperform standard monolithic methods of learning in many cases. The dilemma of balancing stability and plasticity is a common problem in machine learning that causes learning agents to compromise between retaining learned information to perform a task with new incoming information. Although existing work implies that there is a stability-plasticity imbalance that greatly limits layered learning agents' ability to learn optimally, no work explicitly verifies the existence of the imbalance or its causes. This work investigates the stability-plasticity imbalance and demonstrates that indeed, layered learning heavily favors plasticity, which can cause learned subtask proficiency to be lost when new tasks are learned. We conclude by identifying potential causes of the imbalance in layered learning and provide high level advice about how to mitigate the imbalance's negative effects.","PeriodicalId":288427,"journal":{"name":"2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2015.106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Layered learning is a machine learning paradigm used in conjunction with direct-policy search reinforcement learning methods to find high performance agent behaviors for complex tasks. At its core, layered learning is a decomposition-based paradigm that shares many characteristics with robot shaping, transfer learning, hierarchical decomposition, and incremental learning. Previous studies have provided evidence that layered learning has the ability to outperform standard monolithic methods of learning in many cases. The dilemma of balancing stability and plasticity is a common problem in machine learning that causes learning agents to compromise between retaining learned information to perform a task with new incoming information. Although existing work implies that there is a stability-plasticity imbalance that greatly limits layered learning agents' ability to learn optimally, no work explicitly verifies the existence of the imbalance or its causes. This work investigates the stability-plasticity imbalance and demonstrates that indeed, layered learning heavily favors plasticity, which can cause learned subtask proficiency to be lost when new tasks are learned. We conclude by identifying potential causes of the imbalance in layered learning and provide high level advice about how to mitigate the imbalance's negative effects.

查看原文本刊更多论文

基于分解的多智能体学习中稳定性-可塑性不平衡的论证

分层学习是一种机器学习范式，与直接策略搜索强化学习方法结合使用，用于为复杂任务寻找高性能代理行为。分层学习的核心是一种基于分解的范式，它与机器人塑造、迁移学习、分层分解和增量学习有许多共同的特点。先前的研究已经提供了证据，证明分层学习在许多情况下有能力胜过标准的单一学习方法。平衡稳定性和可塑性的困境是机器学习中的一个常见问题，它导致学习代理在保留所学信息和使用新传入信息执行任务之间做出妥协。虽然现有的研究表明，存在一种稳定性-可塑性失衡，极大地限制了分层学习智能体的最佳学习能力，但没有研究明确证实这种失衡的存在及其原因。本研究研究了稳定性和可塑性的不平衡，并证明了分层学习确实非常有利于可塑性，这可能导致学习新任务时习得的子任务熟练程度丧失。最后，我们确定了分层学习中不平衡的潜在原因，并就如何减轻不平衡的负面影响提供了高层次的建议。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)

自引率

0.00%

发文量