{"title":"Deep Reinforcement Learning based Dynamic Object Detection and Tracking from a Moving Platform","authors":"Chinmay Shinde, Rolif Lima, K. Das","doi":"10.1109/ICC47138.2019.9123158","DOIUrl":null,"url":null,"abstract":"This article proposes Deep Reinforcement learning inspired image based visual servoing (DRL-IBVS) controller for aerial robots. The proposed DRL-IBVS controller uses monocular images to map target’s bounding box based errors to linear-velocity command to the robot for following the target. Deep learning based object detection algorithm YOLOv2 architecture is used to identify the target in the image. The noise in detection is filtered using multi-object Bayes filter, where the prediction model utilizes a correlation tracker for target prior estimation. Filtered bounding box output is fed to the Deep Deterministic Policy Gradient (DDPG) for performing the visual servoing. DDPG agent is trained separately using the pinhole camera model and the kinematic relation between two drones (follower and target). Gazebo-based simulation environment is used to successfully validate the controller.","PeriodicalId":231050,"journal":{"name":"2019 Sixth Indian Control Conference (ICC)","volume":"377 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Sixth Indian Control Conference (ICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICC47138.2019.9123158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This article proposes Deep Reinforcement learning inspired image based visual servoing (DRL-IBVS) controller for aerial robots. The proposed DRL-IBVS controller uses monocular images to map target’s bounding box based errors to linear-velocity command to the robot for following the target. Deep learning based object detection algorithm YOLOv2 architecture is used to identify the target in the image. The noise in detection is filtered using multi-object Bayes filter, where the prediction model utilizes a correlation tracker for target prior estimation. Filtered bounding box output is fed to the Deep Deterministic Policy Gradient (DDPG) for performing the visual servoing. DDPG agent is trained separately using the pinhole camera model and the kinematic relation between two drones (follower and target). Gazebo-based simulation environment is used to successfully validate the controller.