{"title":"Bayesian curriculum generation in sparse reward reinforcement learning environments","authors":"Onur Akgün , N. Kemal Üre","doi":"10.1016/j.jestch.2025.102048","DOIUrl":null,"url":null,"abstract":"<div><div>This paper introduces the Bayesian Curriculum Generation Algorithm, a sophisticated approach for curriculum learning in sparse reward reinforcement learning contexts. Diverging from traditional methodologies, this algorithm utilizes Bayesian networks to dynamically create tasks by altering problem parameters, thereby impacting task difficulty. It operates independently from the core reinforcement learning algorithm, enabling compatibility with a variety of RL techniques. A notable feature of our algorithm is its capability for unsupervised task classification, utilizing a clustering process applicable to both image outputs and scalar values. This method efficiently categorizes tasks based on difficulty, circumventing the need for exhaustive training for each task. However, the effectiveness of this approach is contingent upon the presence of definable parameters within the environment and necessitates domain expertise to determine the appropriate tool, be it image output or scalar parameter analysis. The algorithm selects tasks from a curated pool corresponding to specific difficulty levels and adapts according to the agent’s performance. Successful task completion triggers the generation of more complex tasks, whereas encountering challenges results in the maintenance or minor adjustment of task complexity. This adaptive feature significantly enhances the efficiency of the learning process. Empirical evaluations conducted in various environments, characterized by maze-like structures, discrete or continuous settings, and the presence of adversarial entities hindering the agent’s mission, demonstrate the algorithm’s efficacy and its superiority over conventional methods. The Bayesian Curriculum Generation Algorithm represents a significant advancement in reinforcement learning, providing a dynamic and adaptable solution for complex learning challenges.</div></div>","PeriodicalId":48609,"journal":{"name":"Engineering Science and Technology-An International Journal-Jestech","volume":"66 ","pages":"Article 102048"},"PeriodicalIF":5.1000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Science and Technology-An International Journal-Jestech","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S221509862500103X","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces the Bayesian Curriculum Generation Algorithm, a sophisticated approach for curriculum learning in sparse reward reinforcement learning contexts. Diverging from traditional methodologies, this algorithm utilizes Bayesian networks to dynamically create tasks by altering problem parameters, thereby impacting task difficulty. It operates independently from the core reinforcement learning algorithm, enabling compatibility with a variety of RL techniques. A notable feature of our algorithm is its capability for unsupervised task classification, utilizing a clustering process applicable to both image outputs and scalar values. This method efficiently categorizes tasks based on difficulty, circumventing the need for exhaustive training for each task. However, the effectiveness of this approach is contingent upon the presence of definable parameters within the environment and necessitates domain expertise to determine the appropriate tool, be it image output or scalar parameter analysis. The algorithm selects tasks from a curated pool corresponding to specific difficulty levels and adapts according to the agent’s performance. Successful task completion triggers the generation of more complex tasks, whereas encountering challenges results in the maintenance or minor adjustment of task complexity. This adaptive feature significantly enhances the efficiency of the learning process. Empirical evaluations conducted in various environments, characterized by maze-like structures, discrete or continuous settings, and the presence of adversarial entities hindering the agent’s mission, demonstrate the algorithm’s efficacy and its superiority over conventional methods. The Bayesian Curriculum Generation Algorithm represents a significant advancement in reinforcement learning, providing a dynamic and adaptable solution for complex learning challenges.
期刊介绍:
Engineering Science and Technology, an International Journal (JESTECH) (formerly Technology), a peer-reviewed quarterly engineering journal, publishes both theoretical and experimental high quality papers of permanent interest, not previously published in journals, in the field of engineering and applied science which aims to promote the theory and practice of technology and engineering. In addition to peer-reviewed original research papers, the Editorial Board welcomes original research reports, state-of-the-art reviews and communications in the broadly defined field of engineering science and technology.
The scope of JESTECH includes a wide spectrum of subjects including:
-Electrical/Electronics and Computer Engineering (Biomedical Engineering and Instrumentation; Coding, Cryptography, and Information Protection; Communications, Networks, Mobile Computing and Distributed Systems; Compilers and Operating Systems; Computer Architecture, Parallel Processing, and Dependability; Computer Vision and Robotics; Control Theory; Electromagnetic Waves, Microwave Techniques and Antennas; Embedded Systems; Integrated Circuits, VLSI Design, Testing, and CAD; Microelectromechanical Systems; Microelectronics, and Electronic Devices and Circuits; Power, Energy and Energy Conversion Systems; Signal, Image, and Speech Processing)
-Mechanical and Civil Engineering (Automotive Technologies; Biomechanics; Construction Materials; Design and Manufacturing; Dynamics and Control; Energy Generation, Utilization, Conversion, and Storage; Fluid Mechanics and Hydraulics; Heat and Mass Transfer; Micro-Nano Sciences; Renewable and Sustainable Energy Technologies; Robotics and Mechatronics; Solid Mechanics and Structure; Thermal Sciences)
-Metallurgical and Materials Engineering (Advanced Materials Science; Biomaterials; Ceramic and Inorgnanic Materials; Electronic-Magnetic Materials; Energy and Environment; Materials Characterizastion; Metallurgy; Polymers and Nanocomposites)