{"title":"脉冲神经网络智能体的初步实验研究","authors":"Adam Barton","doi":"10.1109/NEUREL.2018.8587034","DOIUrl":null,"url":null,"abstract":"The aim of this paper is a proposal of a spiking neural network circuit design continuously modifying synaptic strengths between neurons through the spike-timing-dependent plasticity to fulfil the agent objective. The network consists of Izhikevich neurons controlling an agent which uses three sensors to associate obstacles to the defined classes in its environment. The environment is formed as a torus of revolution. The reinforcement of synapses of the control network is based on the spike-timing-dependent plasticity (STDP), which is modulated by extracellular dopamine. The controlled agent may perform four different output actions: move forward, turn left, turn right, and release repulse beacon to avoid all obstacles with a negative impact on the agent. The testing environment has been populated with obstacles randomly placed in the torus. Experiments confirmed that the agent is able to associate negative and positive object classes in its environment. The continuation of this work was outlined in the conclusion.","PeriodicalId":371831,"journal":{"name":"2018 14th Symposium on Neural Networks and Applications (NEUREL)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"The Spiking Neural Network Agent – The Initial Experimental Study\",\"authors\":\"Adam Barton\",\"doi\":\"10.1109/NEUREL.2018.8587034\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of this paper is a proposal of a spiking neural network circuit design continuously modifying synaptic strengths between neurons through the spike-timing-dependent plasticity to fulfil the agent objective. The network consists of Izhikevich neurons controlling an agent which uses three sensors to associate obstacles to the defined classes in its environment. The environment is formed as a torus of revolution. The reinforcement of synapses of the control network is based on the spike-timing-dependent plasticity (STDP), which is modulated by extracellular dopamine. The controlled agent may perform four different output actions: move forward, turn left, turn right, and release repulse beacon to avoid all obstacles with a negative impact on the agent. The testing environment has been populated with obstacles randomly placed in the torus. Experiments confirmed that the agent is able to associate negative and positive object classes in its environment. The continuation of this work was outlined in the conclusion.\",\"PeriodicalId\":371831,\"journal\":{\"name\":\"2018 14th Symposium on Neural Networks and Applications (NEUREL)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 14th Symposium on Neural Networks and Applications (NEUREL)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NEUREL.2018.8587034\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 14th Symposium on Neural Networks and Applications (NEUREL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NEUREL.2018.8587034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The Spiking Neural Network Agent – The Initial Experimental Study
The aim of this paper is a proposal of a spiking neural network circuit design continuously modifying synaptic strengths between neurons through the spike-timing-dependent plasticity to fulfil the agent objective. The network consists of Izhikevich neurons controlling an agent which uses three sensors to associate obstacles to the defined classes in its environment. The environment is formed as a torus of revolution. The reinforcement of synapses of the control network is based on the spike-timing-dependent plasticity (STDP), which is modulated by extracellular dopamine. The controlled agent may perform four different output actions: move forward, turn left, turn right, and release repulse beacon to avoid all obstacles with a negative impact on the agent. The testing environment has been populated with obstacles randomly placed in the torus. Experiments confirmed that the agent is able to associate negative and positive object classes in its environment. The continuation of this work was outlined in the conclusion.