{"title":"贝叶斯对自适应低函数适应的解释","authors":"Haolin Chen, Philip N. Garner","doi":"arxiv-2409.10673","DOIUrl":null,"url":null,"abstract":"Motivated by the sensitivity-based importance score of the adaptive low-rank\nadaptation (AdaLoRA), we utilize more theoretically supported metrics,\nincluding the signal-to-noise ratio (SNR), along with the Improved Variational\nOnline Newton (IVON) optimizer, for adaptive parameter budget allocation. The\nresulting Bayesian counterpart not only has matched or surpassed the\nperformance of using the sensitivity-based importance metric but is also a\nfaster alternative to AdaLoRA with Adam. Our theoretical analysis reveals a\nsignificant connection between the two metrics, providing a Bayesian\nperspective on the efficacy of sensitivity as an importance score. Furthermore,\nour findings suggest that the magnitude, rather than the variance, is the\nprimary indicator of the importance of parameters.","PeriodicalId":501340,"journal":{"name":"arXiv - STAT - Machine Learning","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Bayesian Interpretation of Adaptive Low-Rank Adaptation\",\"authors\":\"Haolin Chen, Philip N. Garner\",\"doi\":\"arxiv-2409.10673\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Motivated by the sensitivity-based importance score of the adaptive low-rank\\nadaptation (AdaLoRA), we utilize more theoretically supported metrics,\\nincluding the signal-to-noise ratio (SNR), along with the Improved Variational\\nOnline Newton (IVON) optimizer, for adaptive parameter budget allocation. The\\nresulting Bayesian counterpart not only has matched or surpassed the\\nperformance of using the sensitivity-based importance metric but is also a\\nfaster alternative to AdaLoRA with Adam. Our theoretical analysis reveals a\\nsignificant connection between the two metrics, providing a Bayesian\\nperspective on the efficacy of sensitivity as an importance score. Furthermore,\\nour findings suggest that the magnitude, rather than the variance, is the\\nprimary indicator of the importance of parameters.\",\"PeriodicalId\":501340,\"journal\":{\"name\":\"arXiv - STAT - Machine Learning\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.10673\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10673","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
受自适应低秩适应(AdaLoRA)基于灵敏度的重要性评分的启发,我们利用更多理论支持的指标,包括信噪比(SNR),以及改进的变异在线牛顿(IVON)优化器,进行自适应参数预算分配。由此产生的贝叶斯对应算法不仅与使用基于灵敏度的重要性度量的性能相当,甚至超过了后者,而且比使用 Adam 的 AdaLoRA 更快。我们的理论分析揭示了这两个指标之间的重要联系,为灵敏度作为重要性评分的有效性提供了贝叶斯视角。此外,我们的研究结果表明,幅度而非方差是衡量参数重要性的主要指标。
A Bayesian Interpretation of Adaptive Low-Rank Adaptation
Motivated by the sensitivity-based importance score of the adaptive low-rank
adaptation (AdaLoRA), we utilize more theoretically supported metrics,
including the signal-to-noise ratio (SNR), along with the Improved Variational
Online Newton (IVON) optimizer, for adaptive parameter budget allocation. The
resulting Bayesian counterpart not only has matched or surpassed the
performance of using the sensitivity-based importance metric but is also a
faster alternative to AdaLoRA with Adam. Our theoretical analysis reveals a
significant connection between the two metrics, providing a Bayesian
perspective on the efficacy of sensitivity as an importance score. Furthermore,
our findings suggest that the magnitude, rather than the variance, is the
primary indicator of the importance of parameters.