{"title":"基于强化学习的种群编码脉冲神经网络无地图导航","authors":"Rui Xu, Yifei Wu, Xiaoling Qin, Peng Zhao","doi":"10.1109/ICCSI55536.2022.9970598","DOIUrl":null,"url":null,"abstract":"Most of the navigation methods currently applied to mobile robots cost too much in building and maintaining maps. Therefore, it is crucial to implement mapless navigation for mobile robots. Although the recent deep reinforcement learning (DRL) methods have been able to make full use of the on-board resources to explore unknown space, their high energy cost limits their application. The low energy consumption of the spiking neural network (SNN) can help the DRL to overcome this difficulty. In this paper, we combine the SNN with the deep deterministic policy gradient (DDPG) method. To address the problem of long intervals between two adjacent pulses in the integrate-and-fire (LIF) neuron model, we change the way the membrane voltage resets and the dynamics of the neuron during the refractory period. On this basis, the environmental information was encoded using neuron population coding method and the two networks were trained jointly using an extended spatial-temporal backpropagation (STBP) method. The simulation results show that the proposed method achieves a higher success rate in navigation compared to traditional deep learning algorithms.","PeriodicalId":421514,"journal":{"name":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Population-coded Spiking Neural Network with Reinforcement Learning for Mapless Navigation\",\"authors\":\"Rui Xu, Yifei Wu, Xiaoling Qin, Peng Zhao\",\"doi\":\"10.1109/ICCSI55536.2022.9970598\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most of the navigation methods currently applied to mobile robots cost too much in building and maintaining maps. Therefore, it is crucial to implement mapless navigation for mobile robots. Although the recent deep reinforcement learning (DRL) methods have been able to make full use of the on-board resources to explore unknown space, their high energy cost limits their application. The low energy consumption of the spiking neural network (SNN) can help the DRL to overcome this difficulty. In this paper, we combine the SNN with the deep deterministic policy gradient (DDPG) method. To address the problem of long intervals between two adjacent pulses in the integrate-and-fire (LIF) neuron model, we change the way the membrane voltage resets and the dynamics of the neuron during the refractory period. On this basis, the environmental information was encoded using neuron population coding method and the two networks were trained jointly using an extended spatial-temporal backpropagation (STBP) method. The simulation results show that the proposed method achieves a higher success rate in navigation compared to traditional deep learning algorithms.\",\"PeriodicalId\":421514,\"journal\":{\"name\":\"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSI55536.2022.9970598\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSI55536.2022.9970598","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Population-coded Spiking Neural Network with Reinforcement Learning for Mapless Navigation
Most of the navigation methods currently applied to mobile robots cost too much in building and maintaining maps. Therefore, it is crucial to implement mapless navigation for mobile robots. Although the recent deep reinforcement learning (DRL) methods have been able to make full use of the on-board resources to explore unknown space, their high energy cost limits their application. The low energy consumption of the spiking neural network (SNN) can help the DRL to overcome this difficulty. In this paper, we combine the SNN with the deep deterministic policy gradient (DDPG) method. To address the problem of long intervals between two adjacent pulses in the integrate-and-fire (LIF) neuron model, we change the way the membrane voltage resets and the dynamics of the neuron during the refractory period. On this basis, the environmental information was encoded using neuron population coding method and the two networks were trained jointly using an extended spatial-temporal backpropagation (STBP) method. The simulation results show that the proposed method achieves a higher success rate in navigation compared to traditional deep learning algorithms.