Estimating sparse regression models in multi-task learning and transfer learning through adaptive penalisation.

IF 5.4

Bioinformatics (Oxford, England) Pub Date : 2025-10-02 DOI:10.1093/bioinformatics/btaf406

Armin Rauschenberger, Petr V Nazarov, Enrico Glaab

{"title":"Estimating sparse regression models in multi-task learning and transfer learning through adaptive penalisation.","authors":"Armin Rauschenberger, Petr V Nazarov, Enrico Glaab","doi":"10.1093/bioinformatics/btaf406","DOIUrl":null,"url":null,"abstract":"Method: Here, we propose a simple two-stage procedure for sharing information between related high-dimensional prediction or classification problems. In both stages, we perform sparse regression separately for each problem. While this is done without prior information in the first stage, we use the coefficients from the first stage as prior information for the second stage. Specifically, we designed feature-specific and sign-specific adaptive weights to share information on feature selection, effect directions, and effect sizes between different problems.Results: The proposed approach is applicable to multi-task learning as well as transfer learning. It provides sparse models (i.e. with few non-zero coefficients for each problem) that are easy to interpret. We show by simulation and application that it tends to select fewer features while achieving a similar predictive performance as compared to available methods.Availability and implementation: An implementation is available in the R package \"sparselink\" (https://github.com/rauschenberger/sparselink, https://cran.r-project.org/package=sparselink).","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12502914/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf406","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Method: Here, we propose a simple two-stage procedure for sharing information between related high-dimensional prediction or classification problems. In both stages, we perform sparse regression separately for each problem. While this is done without prior information in the first stage, we use the coefficients from the first stage as prior information for the second stage. Specifically, we designed feature-specific and sign-specific adaptive weights to share information on feature selection, effect directions, and effect sizes between different problems.

Results: The proposed approach is applicable to multi-task learning as well as transfer learning. It provides sparse models (i.e. with few non-zero coefficients for each problem) that are easy to interpret. We show by simulation and application that it tends to select fewer features while achieving a similar predictive performance as compared to available methods.

Availability and implementation: An implementation is available in the R package "sparselink" (https://github.com/rauschenberger/sparselink, https://cran.r-project.org/package=sparselink).

查看原文本刊更多论文

基于自适应惩罚的多任务学习和迁移学习稀疏回归模型估计。

方法：在这里，我们提出了一个简单的两阶段过程，用于在相关的高维预测或分类问题之间共享信息。在这两个阶段，我们分别对每个问题执行稀疏回归。虽然这是在第一阶段没有先验信息的情况下完成的，但我们使用第一阶段的系数作为第二阶段的先验信息。具体而言，我们设计了特定于特征和特定于符号的自适应权重，以便在不同问题之间共享关于特征选择、效应方向和效应大小的信息。结果：该方法适用于多任务学习和迁移学习。它提供了易于解释的稀疏模型（即每个问题的非零系数很少）。我们通过仿真和应用表明，与现有方法相比，它倾向于选择更少的特征，同时实现相似的预测性能。可用性：在R包“sparselink”（https://github.com/rauschenberger/sparselink, https://cran.r-project.org/package=sparselink）中可以获得实现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Bioinformatics (Oxford, England)

自引率

0.00%

发文量