An Adaptive Sampling and Edge Detection Approach for Encoding Static Images for Spiking Neural Networks

2021 12th International Green and Sustainable Computing Conference (IGSC) Pub Date : 2021-10-18 DOI:10.1109/IGSC54211.2021.9651610

Peyton S. Chandarana, Jun Ou, Ramtin Zand

{"title":"An Adaptive Sampling and Edge Detection Approach for Encoding Static Images for Spiking Neural Networks","authors":"Peyton S. Chandarana, Jun Ou, Ramtin Zand","doi":"10.1109/IGSC54211.2021.9651610","DOIUrl":null,"url":null,"abstract":"Current state-of-the-art methods of image classification using convolutional neural networks are often constrained by both latency and power consumption. This places a limit on the devices, particularly low-power edge devices, that can employ these methods. Spiking neural networks (SNNs) are considered to be the third generation of artificial neural networks which aim to address these latency and power constraints by taking inspiration from biological neuronal communication processes. Before data such as images can be input into an SNN, however, they must be first encoded into spike trains. Herein, we propose a method for encoding static images into temporal spike trains using edge detection and an adaptive signal sampling method for use in SNNs. The edge detection process consists of first performing Canny edge detection on the 2D static images and then converting the edge detected images into two X and Y signals using an image-to-signal conversion method. The adaptive signaling approach consists of sampling the signals such that the signals maintain enough detail and are sensitive to abrupt changes in the signal. Temporal encoding mechanisms such as threshold-based representation (TBR) and step-forward (SF) are then able to be used to convert the sampled signals into spike trains. We use various error and indicator metrics to optimize and evaluate the efficiency and precision of the proposed image encoding approach. Comparison results between the original and reconstructed signals from spike trains generated using edge-detection and adaptive temporal encoding mechanism exhibit $18\\times$ and $7\\times$ reduction in average root mean square error (RMSE) compared to the conventional SF and TBR encoding, respectively, while used for encoding MNIST dataset.","PeriodicalId":334989,"journal":{"name":"2021 12th International Green and Sustainable Computing Conference (IGSC)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Green and Sustainable Computing Conference (IGSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IGSC54211.2021.9651610","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Current state-of-the-art methods of image classification using convolutional neural networks are often constrained by both latency and power consumption. This places a limit on the devices, particularly low-power edge devices, that can employ these methods. Spiking neural networks (SNNs) are considered to be the third generation of artificial neural networks which aim to address these latency and power constraints by taking inspiration from biological neuronal communication processes. Before data such as images can be input into an SNN, however, they must be first encoded into spike trains. Herein, we propose a method for encoding static images into temporal spike trains using edge detection and an adaptive signal sampling method for use in SNNs. The edge detection process consists of first performing Canny edge detection on the 2D static images and then converting the edge detected images into two X and Y signals using an image-to-signal conversion method. The adaptive signaling approach consists of sampling the signals such that the signals maintain enough detail and are sensitive to abrupt changes in the signal. Temporal encoding mechanisms such as threshold-based representation (TBR) and step-forward (SF) are then able to be used to convert the sampled signals into spike trains. We use various error and indicator metrics to optimize and evaluate the efficiency and precision of the proposed image encoding approach. Comparison results between the original and reconstructed signals from spike trains generated using edge-detection and adaptive temporal encoding mechanism exhibit $18\times$ and $7\times$ reduction in average root mean square error (RMSE) compared to the conventional SF and TBR encoding, respectively, while used for encoding MNIST dataset.

查看原文本刊更多论文

一种用于脉冲神经网络静态图像编码的自适应采样和边缘检测方法

目前使用卷积神经网络的图像分类方法通常受到延迟和功耗的限制。这就限制了可以采用这些方法的设备，特别是低功耗边缘设备。脉冲神经网络(snn)被认为是第三代人工神经网络，旨在通过从生物神经元通信过程中获得灵感来解决这些延迟和功率限制问题。然而，在像图像这样的数据可以输入SNN之前，它们必须首先被编码到尖峰序列中。在此，我们提出了一种使用边缘检测和自适应信号采样方法将静态图像编码为时间尖峰序列的方法，用于snn。边缘检测过程包括首先对二维静态图像进行Canny边缘检测，然后使用图像到信号的转换方法将检测到的边缘图像转换为两个X和Y信号。自适应信令方法包括对信号进行采样，使信号保持足够的细节，并对信号的突变敏感。时间编码机制，如基于阈值的表示(TBR)和步进(SF)，然后能够用于将采样信号转换成尖峰序列。我们使用各种误差和指标指标来优化和评估所提出的图像编码方法的效率和精度。使用边缘检测和自适应时间编码机制生成的尖峰序列的原始信号和重建信号的比较结果显示，与传统的SF和TBR编码相比，在编码MNIST数据集时，平均均方根误差(RMSE)分别降低了18倍和7倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 12th International Green and Sustainable Computing Conference (IGSC)

自引率

0.00%

发文量