多核神经网络芯片上无死锁的高效映射

2019 IEEE 15th International Conference on Control and Automation (ICCA) Pub Date : 2019-07-16 DOI:10.1109/ICCA.2019.8899911

Qi Zhao, Lei Deng, Guoqi Li, Guanrui Wang, Cheng Ma

{"title":"多核神经网络芯片上无死锁的高效映射","authors":"Qi Zhao, Lei Deng, Guoqi Li, Guanrui Wang, Cheng Ma","doi":"10.1109/ICCA.2019.8899911","DOIUrl":null,"url":null,"abstract":"Many-core neural network chip is widely developed for the deep learning. Many-core architecture brings high parallelism while makes the model-to-core mapping intractable. Previous work focus on the functionality of the entire system, whereas, the mapping quality and deadlock issues have yet to be addressed well. In this paper, we present an algorithm which automatically maps a given neural network model onto the generic many-core chip architecture. Experimental results show that the proposed algorithm is quite efficient, and significant saving of the routing time can be achieved. Specifically, compared to the baseline of zigzag mapping, our solution is able to realize deadlock-free routing with 40.9% and 30.4% routing time saving for multi-layer perceptron and convolutional neural network applications, respectively.","PeriodicalId":130891,"journal":{"name":"2019 IEEE 15th International Conference on Control and Automation (ICCA)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient Mapping without Deadlock on the Many-core Neural Network Chip\",\"authors\":\"Qi Zhao, Lei Deng, Guoqi Li, Guanrui Wang, Cheng Ma\",\"doi\":\"10.1109/ICCA.2019.8899911\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many-core neural network chip is widely developed for the deep learning. Many-core architecture brings high parallelism while makes the model-to-core mapping intractable. Previous work focus on the functionality of the entire system, whereas, the mapping quality and deadlock issues have yet to be addressed well. In this paper, we present an algorithm which automatically maps a given neural network model onto the generic many-core chip architecture. Experimental results show that the proposed algorithm is quite efficient, and significant saving of the routing time can be achieved. Specifically, compared to the baseline of zigzag mapping, our solution is able to realize deadlock-free routing with 40.9% and 30.4% routing time saving for multi-layer perceptron and convolutional neural network applications, respectively.\",\"PeriodicalId\":130891,\"journal\":{\"name\":\"2019 IEEE 15th International Conference on Control and Automation (ICCA)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 15th International Conference on Control and Automation (ICCA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCA.2019.8899911\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 15th International Conference on Control and Automation (ICCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCA.2019.8899911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

多核神经网络芯片在深度学习领域得到了广泛的发展。多核架构带来了高并行性，但也使得模型到核心的映射难以处理。以前的工作集中在整个系统的功能上，然而，映射质量和死锁问题还没有得到很好的解决。本文提出了一种将给定的神经网络模型自动映射到通用多核芯片架构上的算法。实验结果表明，该算法具有较高的效率，可以显著节省路由时间。具体而言，与之形映射基线相比，我们的解决方案能够实现无死锁路由，在多层感知器和卷积神经网络应用中分别节省40.9%和30.4%的路由时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Efficient Mapping without Deadlock on the Many-core Neural Network Chip

Many-core neural network chip is widely developed for the deep learning. Many-core architecture brings high parallelism while makes the model-to-core mapping intractable. Previous work focus on the functionality of the entire system, whereas, the mapping quality and deadlock issues have yet to be addressed well. In this paper, we present an algorithm which automatically maps a given neural network model onto the generic many-core chip architecture. Experimental results show that the proposed algorithm is quite efficient, and significant saving of the routing time can be achieved. Specifically, compared to the baseline of zigzag mapping, our solution is able to realize deadlock-free routing with 40.9% and 30.4% routing time saving for multi-layer perceptron and convolutional neural network applications, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE 15th International Conference on Control and Automation (ICCA)

自引率

0.00%

发文量