{"title":"神经网络中的最优修剪。","authors":"D M Barbato, O Kinouchi","doi":"10.1103/physreve.62.8387","DOIUrl":null,"url":null,"abstract":"<p><p>We study pruning strategies in simple perceptrons subjected to supervised learning. Our analytical results, obtained through the statistical mechanics approach to learning theory, are independent of the learning algorithm used in the training process. We calculate the post-training distribution P(J) of synaptic weights, which depends only on the overlap rho(0) achieved by the learning algorithm before pruning and the fraction kappa of relevant weights in the teacher network. From this distribution, we calculate the optimal pruning strategy for deleting small weights. The optimal pruning threshold grows from zero as straight theta(opt)(rho(0), kappa) approximately [rho(0)-rho(c)(kappa)](1/2) above some critical value rho(c)(kappa). Thus, the elimination of weak synapses enhances the network performance only after a critical learning period. Possible implications for biological pruning phenomena are discussed.</p>","PeriodicalId":20079,"journal":{"name":"Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics","volume":"62 6 Pt B","pages":"8387-94"},"PeriodicalIF":0.0000,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1103/physreve.62.8387","citationCount":"11","resultStr":"{\"title\":\"Optimal pruning in neural networks.\",\"authors\":\"D M Barbato, O Kinouchi\",\"doi\":\"10.1103/physreve.62.8387\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>We study pruning strategies in simple perceptrons subjected to supervised learning. Our analytical results, obtained through the statistical mechanics approach to learning theory, are independent of the learning algorithm used in the training process. We calculate the post-training distribution P(J) of synaptic weights, which depends only on the overlap rho(0) achieved by the learning algorithm before pruning and the fraction kappa of relevant weights in the teacher network. From this distribution, we calculate the optimal pruning strategy for deleting small weights. The optimal pruning threshold grows from zero as straight theta(opt)(rho(0), kappa) approximately [rho(0)-rho(c)(kappa)](1/2) above some critical value rho(c)(kappa). Thus, the elimination of weak synapses enhances the network performance only after a critical learning period. Possible implications for biological pruning phenomena are discussed.</p>\",\"PeriodicalId\":20079,\"journal\":{\"name\":\"Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics\",\"volume\":\"62 6 Pt B\",\"pages\":\"8387-94\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1103/physreve.62.8387\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1103/physreve.62.8387\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1103/physreve.62.8387","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
We study pruning strategies in simple perceptrons subjected to supervised learning. Our analytical results, obtained through the statistical mechanics approach to learning theory, are independent of the learning algorithm used in the training process. We calculate the post-training distribution P(J) of synaptic weights, which depends only on the overlap rho(0) achieved by the learning algorithm before pruning and the fraction kappa of relevant weights in the teacher network. From this distribution, we calculate the optimal pruning strategy for deleting small weights. The optimal pruning threshold grows from zero as straight theta(opt)(rho(0), kappa) approximately [rho(0)-rho(c)(kappa)](1/2) above some critical value rho(c)(kappa). Thus, the elimination of weak synapses enhances the network performance only after a critical learning period. Possible implications for biological pruning phenomena are discussed.