Jose Gallego-Posada, Juan Ramirez de los Rios, Akram Erraqabi
{"title":"Flexible Learning of Sparse Neural Networks via Constrained L0 Regularizations","authors":"Jose Gallego-Posada, Juan Ramirez de los Rios, Akram Erraqabi","doi":"10.52591/lxai202112071","DOIUrl":null,"url":null,"abstract":"We propose to approach the problem of learning L 0 -sparse networks using a constrained formulation of the optimization problem. This is in contrast to commonly used penalized approaches, which combine the regularization terms additively with the (surrogate) empirical risk. Our experiments demonstrate that we can obtain approximate solutions to the constrained optimization problem with comparable performance to state-of-the art methods for L 0 -sparse training. Finally, we discuss how this constrained approach provides greater (hyper-)parameter interpretability and accountability from a practitioner’s point of view.","PeriodicalId":355096,"journal":{"name":"LatinX in AI at Neural Information Processing Systems Conference 2021","volume":"122 7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"LatinX in AI at Neural Information Processing Systems Conference 2021","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52591/lxai202112071","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
We propose to approach the problem of learning L 0 -sparse networks using a constrained formulation of the optimization problem. This is in contrast to commonly used penalized approaches, which combine the regularization terms additively with the (surrogate) empirical risk. Our experiments demonstrate that we can obtain approximate solutions to the constrained optimization problem with comparable performance to state-of-the art methods for L 0 -sparse training. Finally, we discuss how this constrained approach provides greater (hyper-)parameter interpretability and accountability from a practitioner’s point of view.