{"title":"Optimization on parametric model","authors":"Fenfen Huang, Wenbin Yao","doi":"10.1109/NOMS.2018.8406287","DOIUrl":null,"url":null,"abstract":"Neural networks are both computationally intensive and memory intensive, making them difficult to deploy on embedded systems with large number of weights consume considerable storage and memory bandwidth. To address this limitation, pruning is an effective way to compress neural networks with high accuracy. To address this limitation, we proposed a method for optimization on parametric model. This method contains three steps. First, we train the network like conventional training. Next, we prune the unimportant connections and retrain the network to get the sparse weight matrix. Finally, we use Singularly Valuable Decomposition (SVD) to do further compression on the sparse weight matrix. Our experiments on MNIST dataset show that our method has the ability on reducing the model size by 4 times and the accuracy could still be kept over 90%.","PeriodicalId":19331,"journal":{"name":"NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium","volume":"31 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NOMS.2018.8406287","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Neural networks are both computationally intensive and memory intensive, making them difficult to deploy on embedded systems with large number of weights consume considerable storage and memory bandwidth. To address this limitation, pruning is an effective way to compress neural networks with high accuracy. To address this limitation, we proposed a method for optimization on parametric model. This method contains three steps. First, we train the network like conventional training. Next, we prune the unimportant connections and retrain the network to get the sparse weight matrix. Finally, we use Singularly Valuable Decomposition (SVD) to do further compression on the sparse weight matrix. Our experiments on MNIST dataset show that our method has the ability on reducing the model size by 4 times and the accuracy could still be kept over 90%.