Efficient Unaligned Memory Access of Tightly Packed Weights for Deep Neural Network Inference on Edge Devices

2021 IEEE 27th International Symposium for Design and Technology in Electronic Packaging (SIITME) Pub Date : 2021-10-27 DOI:10.1109/SIITME53254.2021.9663723

Ciprian Seiculescu

{"title":"Efficient Unaligned Memory Access of Tightly Packed Weights for Deep Neural Network Inference on Edge Devices","authors":"Ciprian Seiculescu","doi":"10.1109/SIITME53254.2021.9663723","DOIUrl":null,"url":null,"abstract":"The increase in computational power enabled complex problems to be solved by employing new techniques from the field of Artificial Intelligence (AI) based on Deep Neural Networks (DNN) and Deep Learning (DL). A recent trend is to apply these techniques that have proven to generate excellent results to Edge computing. However, Edge computing is based on simple low power devices, which are severely restricted in terms of computational power and especially by the available memory size. Being able to pack the Neural Network parameters in the available memory efficiently is a must. Normally, memory systems expect transactions to be aligned to the bus size for maximum performance. This can result in inefficient memory utilization, as the groups of parameters required to be read in parallel need to be stored aligned in memory. In this paper, I present a memory controller to provide unaligned memory transfers at full bus width. When using this controller, the memory efficiency can be increased by 25%, while preserving the memory access time.","PeriodicalId":426485,"journal":{"name":"2021 IEEE 27th International Symposium for Design and Technology in Electronic Packaging (SIITME)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 27th International Symposium for Design and Technology in Electronic Packaging (SIITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIITME53254.2021.9663723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The increase in computational power enabled complex problems to be solved by employing new techniques from the field of Artificial Intelligence (AI) based on Deep Neural Networks (DNN) and Deep Learning (DL). A recent trend is to apply these techniques that have proven to generate excellent results to Edge computing. However, Edge computing is based on simple low power devices, which are severely restricted in terms of computational power and especially by the available memory size. Being able to pack the Neural Network parameters in the available memory efficiently is a must. Normally, memory systems expect transactions to be aligned to the bus size for maximum performance. This can result in inefficient memory utilization, as the groups of parameters required to be read in parallel need to be stored aligned in memory. In this paper, I present a memory controller to provide unaligned memory transfers at full bus width. When using this controller, the memory efficiency can be increased by 25%, while preserving the memory access time.

查看原文本刊更多论文

边缘设备上深度神经网络推理中紧包权的高效非对齐内存访问

计算能力的提高使得基于深度神经网络(DNN)和深度学习(DL)的人工智能(AI)领域的新技术能够解决复杂的问题。最近的一个趋势是将这些技术应用于边缘计算，这些技术已被证明可以产生出色的结果。然而，边缘计算基于简单的低功耗设备，这在计算能力方面受到严重限制，特别是受可用内存大小的限制。能够有效地在可用内存中打包神经网络参数是必须的。通常，内存系统希望事务与总线大小保持一致，以获得最大性能。这可能导致内存利用率低下，因为需要并行读取的参数组需要在内存中进行对齐存储。在本文中，我提出了一个内存控制器，以提供全总线宽度的非对齐内存传输。当使用该控制器时，内存效率可提高25%，同时保留内存访问时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 27th International Symposium for Design and Technology in Electronic Packaging (SIITME)

自引率

0.00%

发文量