{"title":"Instructions with Complex Control-Flow Entailing Machine Learning","authors":"S. Shinde, Harneet Singh Bali","doi":"10.1109/SNPD54884.2022.10051797","DOIUrl":null,"url":null,"abstract":"Reinforcement learning is when the system is allowed to make its own decisions based on what it learns. There are 2 types of observations, formative and summative. These observations have been identified as crucially important for neural network training of complicated tasks with conditional control flow. The central theme of this paper is applying reinforcement learning to follow instructions with complex control-flow. The authors study a special but important subset of multi- task reinforcement learning problems, namely instructions with complex control-flow in this work. They develop an encoding and attention architecture to achieve the research objective.","PeriodicalId":425462,"journal":{"name":"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNPD54884.2022.10051797","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Reinforcement learning is when the system is allowed to make its own decisions based on what it learns. There are 2 types of observations, formative and summative. These observations have been identified as crucially important for neural network training of complicated tasks with conditional control flow. The central theme of this paper is applying reinforcement learning to follow instructions with complex control-flow. The authors study a special but important subset of multi- task reinforcement learning problems, namely instructions with complex control-flow in this work. They develop an encoding and attention architecture to achieve the research objective.