{"title":"Simulation of Homogenous Fish Schools in the Presence of Food and Predators using Reinforcement Learning","authors":"Ravipas Wangananont, Norapat Buppodom, Sanpat Chanthanuraks, Vishnu Kotrajaras","doi":"10.1109/iSAI-NLP56921.2022.9960278","DOIUrl":null,"url":null,"abstract":"We utilized Deep Reinforcement Learning to incor-porate schooling, foraging, and predator avoidance behaviors into a single fish behavior model. We used Proximal Policy Optimization (PPO) with Intrinsic Curiosity Reward (ICR) to make fish agents learn in our Unity Environment. We created an interactive control system on Unity that allows users to visualize and manipulate the simulation using only a mouse and keyboard. We compared our model with three variations: one without schooling reward, one without foraging reward, and one without predator avoidance reward. Our original model (schooling, foraging, and predator avoidance) clearly illustrated the unification of all three behaviors.","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSAI-NLP56921.2022.9960278","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
We utilized Deep Reinforcement Learning to incor-porate schooling, foraging, and predator avoidance behaviors into a single fish behavior model. We used Proximal Policy Optimization (PPO) with Intrinsic Curiosity Reward (ICR) to make fish agents learn in our Unity Environment. We created an interactive control system on Unity that allows users to visualize and manipulate the simulation using only a mouse and keyboard. We compared our model with three variations: one without schooling reward, one without foraging reward, and one without predator avoidance reward. Our original model (schooling, foraging, and predator avoidance) clearly illustrated the unification of all three behaviors.