TOT-Net: An Endeavor Toward Optimizing Ternary Neural Networks

2019 22nd Euromicro Conference on Digital System Design (DSD) Pub Date : 2019-08-01 DOI:10.1109/DSD.2019.00052

Najmeh Nazari, Mohammad Loni, M. Salehi, M. Daneshtalab, Mikael Sjödin

{"title":"TOT-Net: An Endeavor Toward Optimizing Ternary Neural Networks","authors":"Najmeh Nazari, Mohammad Loni, M. Salehi, M. Daneshtalab, Mikael Sjödin","doi":"10.1109/DSD.2019.00052","DOIUrl":null,"url":null,"abstract":"High computation demands and big memory resources are the major implementation challenges of Convolutional Neural Networks (CNNs) especially for low-power and resource-limited embedded devices. Many binarized neural networks are recently proposed to address these issues. Although they have significantly decreased computation and memory footprint, they have suffered from accuracy loss especially for large datasets. In this paper, we propose TOT-Net, a ternarized neural network with [-1, 0, 1] values for both weights and activation functions that has simultaneously achieved a higher level of accuracy and less computational load. In fact, first, TOT-Net introduces a simple bitwise logic for convolution computations to reduce the cost of multiply operations. To improve the accuracy, selecting proper activation function and learning rate are influential, but also difficult. As the second contribution, we propose a novel piece-wise activation function, and optimized learning rate for different datasets. Our findings first reveal that 0.01 is a preferable learning rate for the studied datasets. Third, by using an evolutionary optimization approach, we found novel piece-wise activation functions customized for TOT-Net. According to the experimental results, TOT-Net achieves 2.15%, 8.77%, and 5.7/5.52% better accuracy compared to XNOR-Net on CIFAR-10, CIFAR-100, and ImageNet top-5/top-1 datasets, respectively.","PeriodicalId":217233,"journal":{"name":"2019 22nd Euromicro Conference on Digital System Design (DSD)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 22nd Euromicro Conference on Digital System Design (DSD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSD.2019.00052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

Abstract

High computation demands and big memory resources are the major implementation challenges of Convolutional Neural Networks (CNNs) especially for low-power and resource-limited embedded devices. Many binarized neural networks are recently proposed to address these issues. Although they have significantly decreased computation and memory footprint, they have suffered from accuracy loss especially for large datasets. In this paper, we propose TOT-Net, a ternarized neural network with [-1, 0, 1] values for both weights and activation functions that has simultaneously achieved a higher level of accuracy and less computational load. In fact, first, TOT-Net introduces a simple bitwise logic for convolution computations to reduce the cost of multiply operations. To improve the accuracy, selecting proper activation function and learning rate are influential, but also difficult. As the second contribution, we propose a novel piece-wise activation function, and optimized learning rate for different datasets. Our findings first reveal that 0.01 is a preferable learning rate for the studied datasets. Third, by using an evolutionary optimization approach, we found novel piece-wise activation functions customized for TOT-Net. According to the experimental results, TOT-Net achieves 2.15%, 8.77%, and 5.7/5.52% better accuracy compared to XNOR-Net on CIFAR-10, CIFAR-100, and ImageNet top-5/top-1 datasets, respectively.

查看原文本刊更多论文

TOT-Net:优化三元神经网络的努力

高计算需求和大内存资源是卷积神经网络(cnn)实现的主要挑战，特别是对于低功耗和资源有限的嵌入式设备。最近提出了许多二值化神经网络来解决这些问题。尽管它们显著地减少了计算和内存占用，但它们的准确性受到了影响，特别是对于大型数据集。在本文中，我们提出了TOT-Net，这是一种权值和激活函数值都为[- 1,0,1]的三元化神经网络，它同时达到了更高的精度和更少的计算负荷。事实上，首先，TOT-Net为卷积计算引入了一个简单的位逻辑，以减少乘法运算的成本。选择合适的激活函数和学习率是提高准确率的重要因素，也是难点。作为第二个贡献，我们提出了一个新的分段激活函数，并优化了不同数据集的学习率。我们的研究结果首先表明，对于所研究的数据集来说，0.01是一个较好的学习率。第三，通过采用进化优化方法，我们找到了针对TOT-Net定制的新的分段激活函数。实验结果表明，在CIFAR-10、CIFAR-100和ImageNet top-5/top-1数据集上，TOT-Net的准确率分别比XNOR-Net高2.15%、8.77%和5.7/5.52%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 22nd Euromicro Conference on Digital System Design (DSD)

自引率

0.00%

发文量