神经网络中的最优修剪。

Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics Pub Date : 2000-12-01 DOI:10.1103/physreve.62.8387

D M Barbato, O Kinouchi

{"title":"神经网络中的最优修剪。","authors":"D M Barbato, O Kinouchi","doi":"10.1103/physreve.62.8387","DOIUrl":null,"url":null,"abstract":"We study pruning strategies in simple perceptrons subjected to supervised learning. Our analytical results, obtained through the statistical mechanics approach to learning theory, are independent of the learning algorithm used in the training process. We calculate the post-training distribution P(J) of synaptic weights, which depends only on the overlap rho(0) achieved by the learning algorithm before pruning and the fraction kappa of relevant weights in the teacher network. From this distribution, we calculate the optimal pruning strategy for deleting small weights. The optimal pruning threshold grows from zero as straight theta(opt)(rho(0), kappa) approximately [rho(0)-rho(c)(kappa)](1/2) above some critical value rho(c)(kappa). Thus, the elimination of weak synapses enhances the network performance only after a critical learning period. Possible implications for biological pruning phenomena are discussed.","PeriodicalId":20079,"journal":{"name":"Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics","volume":"62 6 Pt B","pages":"8387-94"},"PeriodicalIF":0.0000,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1103/physreve.62.8387","citationCount":"11","resultStr":"{\"title\":\"Optimal pruning in neural networks.\",\"authors\":\"D M Barbato, O Kinouchi\",\"doi\":\"10.1103/physreve.62.8387\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We study pruning strategies in simple perceptrons subjected to supervised learning. Our analytical results, obtained through the statistical mechanics approach to learning theory, are independent of the learning algorithm used in the training process. We calculate the post-training distribution P(J) of synaptic weights, which depends only on the overlap rho(0) achieved by the learning algorithm before pruning and the fraction kappa of relevant weights in the teacher network. From this distribution, we calculate the optimal pruning strategy for deleting small weights. The optimal pruning threshold grows from zero as straight theta(opt)(rho(0), kappa) approximately [rho(0)-rho(c)(kappa)](1/2) above some critical value rho(c)(kappa). Thus, the elimination of weak synapses enhances the network performance only after a critical learning period. Possible implications for biological pruning phenomena are discussed.\",\"PeriodicalId\":20079,\"journal\":{\"name\":\"Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics\",\"volume\":\"62 6 Pt B\",\"pages\":\"8387-94\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1103/physreve.62.8387\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1103/physreve.62.8387\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1103/physreve.62.8387","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

摘要

我们研究受监督学习的简单感知器的修剪策略。我们的分析结果是通过学习理论的统计力学方法获得的，与训练过程中使用的学习算法无关。我们计算突触权值的训练后分布P(J)，它仅取决于学习算法在修剪前获得的重叠rho(0)和教师网络中相关权值的分数kappa。根据这个分布，我们计算了删除小权重的最优修剪策略。最佳剪枝阈值从零开始增长，直线θ (opt)(rho(0)， kappa)近似于[rho(0)-rho(c)(kappa)](1/2)高于某个临界值rho(c)(kappa)。因此，只有在一个关键的学习期之后，消除弱突触才能提高网络的性能。讨论了可能对生物修剪现象的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Optimal pruning in neural networks.

We study pruning strategies in simple perceptrons subjected to supervised learning. Our analytical results, obtained through the statistical mechanics approach to learning theory, are independent of the learning algorithm used in the training process. We calculate the post-training distribution P(J) of synaptic weights, which depends only on the overlap rho(0) achieved by the learning algorithm before pruning and the fraction kappa of relevant weights in the teacher network. From this distribution, we calculate the optimal pruning strategy for deleting small weights. The optimal pruning threshold grows from zero as straight theta(opt)(rho(0), kappa) approximately [rho(0)-rho(c)(kappa)](1/2) above some critical value rho(c)(kappa). Thus, the elimination of weak synapses enhances the network performance only after a critical learning period. Possible implications for biological pruning phenomena are discussed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics

自引率

0.00%

发文量