Nikolaos Toulgaridis, E. Bougioukou, T. Antonakopoulos
{"title":"Architecture and implementation of a Restricted Boltzmann Machine for handwritten digits recognition","authors":"Nikolaos Toulgaridis, E. Bougioukou, T. Antonakopoulos","doi":"10.1109/MOCAST.2017.7937689","DOIUrl":null,"url":null,"abstract":"Restricted Boltzmann Machines are artificial neural networks used in many types of statistical classification. In this work we present the architecture and implementation of such a neural network for fast recognition of hand-written digits. We use fixed and floating point arithmetic for minimizing the required hardware resources, and the use of pipeline results to a processing rate of more than 1 Mimages/sec per RBM. Four neural networks have been used on a PCIe-based hardware accelerator that uses a Virtex-7 FPGA, and that results to a total processing rate of more than 4 Mimages/sec.","PeriodicalId":202381,"journal":{"name":"2017 6th International Conference on Modern Circuits and Systems Technologies (MOCAST)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 6th International Conference on Modern Circuits and Systems Technologies (MOCAST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MOCAST.2017.7937689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Restricted Boltzmann Machines are artificial neural networks used in many types of statistical classification. In this work we present the architecture and implementation of such a neural network for fast recognition of hand-written digits. We use fixed and floating point arithmetic for minimizing the required hardware resources, and the use of pipeline results to a processing rate of more than 1 Mimages/sec per RBM. Four neural networks have been used on a PCIe-based hardware accelerator that uses a Virtex-7 FPGA, and that results to a total processing rate of more than 4 Mimages/sec.
受限玻尔兹曼机是用于多种统计分类的人工神经网络。在这项工作中,我们提出了这种快速识别手写数字的神经网络的架构和实现。我们使用固定和浮点算法来最小化所需的硬件资源,并且使用流水线导致每个RBM的处理速率超过1 m /秒。在使用Virtex-7 FPGA的基于pcie的硬件加速器上使用了四个神经网络,其结果是总处理速率超过4 m /秒。