{"title":"具有复杂控制流的机器学习指令","authors":"S. Shinde, Harneet Singh Bali","doi":"10.1109/SNPD54884.2022.10051797","DOIUrl":null,"url":null,"abstract":"Reinforcement learning is when the system is allowed to make its own decisions based on what it learns. There are 2 types of observations, formative and summative. These observations have been identified as crucially important for neural network training of complicated tasks with conditional control flow. The central theme of this paper is applying reinforcement learning to follow instructions with complex control-flow. The authors study a special but important subset of multi- task reinforcement learning problems, namely instructions with complex control-flow in this work. They develop an encoding and attention architecture to achieve the research objective.","PeriodicalId":425462,"journal":{"name":"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Instructions with Complex Control-Flow Entailing Machine Learning\",\"authors\":\"S. Shinde, Harneet Singh Bali\",\"doi\":\"10.1109/SNPD54884.2022.10051797\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement learning is when the system is allowed to make its own decisions based on what it learns. There are 2 types of observations, formative and summative. These observations have been identified as crucially important for neural network training of complicated tasks with conditional control flow. The central theme of this paper is applying reinforcement learning to follow instructions with complex control-flow. The authors study a special but important subset of multi- task reinforcement learning problems, namely instructions with complex control-flow in this work. They develop an encoding and attention architecture to achieve the research objective.\",\"PeriodicalId\":425462,\"journal\":{\"name\":\"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SNPD54884.2022.10051797\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNPD54884.2022.10051797","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Instructions with Complex Control-Flow Entailing Machine Learning
Reinforcement learning is when the system is allowed to make its own decisions based on what it learns. There are 2 types of observations, formative and summative. These observations have been identified as crucially important for neural network training of complicated tasks with conditional control flow. The central theme of this paper is applying reinforcement learning to follow instructions with complex control-flow. The authors study a special but important subset of multi- task reinforcement learning problems, namely instructions with complex control-flow in this work. They develop an encoding and attention architecture to achieve the research objective.