Qi Zhao, Lei Deng, Guoqi Li, Guanrui Wang, Cheng Ma
{"title":"多核神经网络芯片上无死锁的高效映射","authors":"Qi Zhao, Lei Deng, Guoqi Li, Guanrui Wang, Cheng Ma","doi":"10.1109/ICCA.2019.8899911","DOIUrl":null,"url":null,"abstract":"Many-core neural network chip is widely developed for the deep learning. Many-core architecture brings high parallelism while makes the model-to-core mapping intractable. Previous work focus on the functionality of the entire system, whereas, the mapping quality and deadlock issues have yet to be addressed well. In this paper, we present an algorithm which automatically maps a given neural network model onto the generic many-core chip architecture. Experimental results show that the proposed algorithm is quite efficient, and significant saving of the routing time can be achieved. Specifically, compared to the baseline of zigzag mapping, our solution is able to realize deadlock-free routing with 40.9% and 30.4% routing time saving for multi-layer perceptron and convolutional neural network applications, respectively.","PeriodicalId":130891,"journal":{"name":"2019 IEEE 15th International Conference on Control and Automation (ICCA)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient Mapping without Deadlock on the Many-core Neural Network Chip\",\"authors\":\"Qi Zhao, Lei Deng, Guoqi Li, Guanrui Wang, Cheng Ma\",\"doi\":\"10.1109/ICCA.2019.8899911\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many-core neural network chip is widely developed for the deep learning. Many-core architecture brings high parallelism while makes the model-to-core mapping intractable. Previous work focus on the functionality of the entire system, whereas, the mapping quality and deadlock issues have yet to be addressed well. In this paper, we present an algorithm which automatically maps a given neural network model onto the generic many-core chip architecture. Experimental results show that the proposed algorithm is quite efficient, and significant saving of the routing time can be achieved. Specifically, compared to the baseline of zigzag mapping, our solution is able to realize deadlock-free routing with 40.9% and 30.4% routing time saving for multi-layer perceptron and convolutional neural network applications, respectively.\",\"PeriodicalId\":130891,\"journal\":{\"name\":\"2019 IEEE 15th International Conference on Control and Automation (ICCA)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 15th International Conference on Control and Automation (ICCA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCA.2019.8899911\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 15th International Conference on Control and Automation (ICCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCA.2019.8899911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Efficient Mapping without Deadlock on the Many-core Neural Network Chip
Many-core neural network chip is widely developed for the deep learning. Many-core architecture brings high parallelism while makes the model-to-core mapping intractable. Previous work focus on the functionality of the entire system, whereas, the mapping quality and deadlock issues have yet to be addressed well. In this paper, we present an algorithm which automatically maps a given neural network model onto the generic many-core chip architecture. Experimental results show that the proposed algorithm is quite efficient, and significant saving of the routing time can be achieved. Specifically, compared to the baseline of zigzag mapping, our solution is able to realize deadlock-free routing with 40.9% and 30.4% routing time saving for multi-layer perceptron and convolutional neural network applications, respectively.