药物协同和敏感性预测多任务学习中参数共享策略的系统研究

IF 3.1 3区生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY

Journal of Computer-Aided Molecular Design Pub Date : 2026-04-25 DOI:10.1007/s10822-026-00808-3

C. A. Hafsath, A. S. Jereesh

{"title":"药物协同和敏感性预测多任务学习中参数共享策略的系统研究","authors":"C. A. Hafsath, A. S. Jereesh","doi":"10.1007/s10822-026-00808-3","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Multi-task learning (MTL) is a useful approach for modeling related prediction tasks by sharing representations while allowing each task to keep its unique features. In computational drug discovery, drug synergy and drug sensitivity predictions are closely linked, but it is still unclear where and how to share model parameters between these tasks. In this study, we introduce a multi-task learning framework that predicts both drug synergy and drug sensitivity, with a main goal of improving synergy prediction by using sensitivity prediction as a supporting task. Our model uses molecular descriptors and drug induced gene expression signatures to describe pharmaceuticals, while untreated cancer cell lines are described by their baseline gene expression profiles. We use a mutual attention mechanism to capture complex interactions between drugs and cells. With this shared feature extraction base, we systematically test different parameter sharing strategies, looking at both hard and soft sharing at various network depths. Our experiments show that the success of parameter sharing depends on both the sharing strategy and the network layer where sharing happens. Hard parameter sharing works best at deeper layers, while soft parameter sharing allows for more stable and flexible knowledge transfer between tasks. Among the soft sharing methods we tested, dense sharing based on Neural Discriminative Dimensionality Reduction (NDDR) consistently outperforms cross stitch and sluice networks, especially when used in early network layers. Extending dense sharing to multiple layers further improves shared representations and leads to better performance. An updated NDDR version with nonlinear bottleneck transformations and residual connections achieves the highest overall performance. In summary, this work gives clear insight about where and how information should be shared in multi-task learning for drug synergy and sensitivity prediction. Our results show that both the sharing level and the way sharing is done strongly affect performance. Multi-level and depth-aware sharing strategies lead to better predictions. These findings give practical guidance for building effective multi-task models in computational drug discovery.</p>\n </div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"40 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2026-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A systematic study of parameter sharing strategies in multi-task learning for drug synergy and sensitivity prediction\",\"authors\":\"C. A. Hafsath, A. S. Jereesh\",\"doi\":\"10.1007/s10822-026-00808-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Multi-task learning (MTL) is a useful approach for modeling related prediction tasks by sharing representations while allowing each task to keep its unique features. In computational drug discovery, drug synergy and drug sensitivity predictions are closely linked, but it is still unclear where and how to share model parameters between these tasks. In this study, we introduce a multi-task learning framework that predicts both drug synergy and drug sensitivity, with a main goal of improving synergy prediction by using sensitivity prediction as a supporting task. Our model uses molecular descriptors and drug induced gene expression signatures to describe pharmaceuticals, while untreated cancer cell lines are described by their baseline gene expression profiles. We use a mutual attention mechanism to capture complex interactions between drugs and cells. With this shared feature extraction base, we systematically test different parameter sharing strategies, looking at both hard and soft sharing at various network depths. Our experiments show that the success of parameter sharing depends on both the sharing strategy and the network layer where sharing happens. Hard parameter sharing works best at deeper layers, while soft parameter sharing allows for more stable and flexible knowledge transfer between tasks. Among the soft sharing methods we tested, dense sharing based on Neural Discriminative Dimensionality Reduction (NDDR) consistently outperforms cross stitch and sluice networks, especially when used in early network layers. Extending dense sharing to multiple layers further improves shared representations and leads to better performance. An updated NDDR version with nonlinear bottleneck transformations and residual connections achieves the highest overall performance. In summary, this work gives clear insight about where and how information should be shared in multi-task learning for drug synergy and sensitivity prediction. Our results show that both the sharing level and the way sharing is done strongly affect performance. Multi-level and depth-aware sharing strategies lead to better predictions. These findings give practical guidance for building effective multi-task models in computational drug discovery.</p>\\n </div>\",\"PeriodicalId\":621,\"journal\":{\"name\":\"Journal of Computer-Aided Molecular Design\",\"volume\":\"40 1\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2026-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computer-Aided Molecular Design\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10822-026-00808-3\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer-Aided Molecular Design","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10822-026-00808-3","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

多任务学习（Multi-task learning， MTL）是一种有用的建模方法，它通过共享表示来实现相关预测任务的建模，同时允许每个任务保持其独特的特征。在计算药物发现中，药物协同作用和药物敏感性预测密切相关，但尚不清楚这些任务之间在何处以及如何共享模型参数。在本研究中，我们引入了一个预测药物协同作用和药物敏感性的多任务学习框架，其主要目标是通过使用敏感性预测作为支持任务来改进协同作用预测。我们的模型使用分子描述符和药物诱导的基因表达特征来描述药物，而未经治疗的癌细胞系则通过其基线基因表达谱来描述。我们使用一种相互注意机制来捕捉药物和细胞之间复杂的相互作用。有了这个共享的特征提取库，我们系统地测试了不同的参数共享策略，研究了不同网络深度下的硬共享和软共享。我们的实验表明，参数共享的成功取决于共享策略和共享发生的网络层。硬参数共享在更深的层中效果最好，而软参数共享允许更稳定和灵活的任务之间的知识传递。在我们测试的软共享方法中，基于神经判别降维（NDDR）的密集共享始终优于十字绣和水闸网络，特别是在早期网络层中使用时。将密集共享扩展到多个层进一步改进了共享表示并带来更好的性能。具有非线性瓶颈转换和残余连接的更新NDDR版本实现了最高的整体性能。总之，这项工作提供了清晰的见解，在药物协同作用和敏感性预测的多任务学习中，信息应该在哪里以及如何共享。我们的研究结果表明，共享水平和共享方式对性能都有很大的影响。多层次和深度感知共享策略导致更好的预测。这些发现为在计算药物发现中建立有效的多任务模型提供了实际指导。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A systematic study of parameter sharing strategies in multi-task learning for drug synergy and sensitivity prediction

Multi-task learning (MTL) is a useful approach for modeling related prediction tasks by sharing representations while allowing each task to keep its unique features. In computational drug discovery, drug synergy and drug sensitivity predictions are closely linked, but it is still unclear where and how to share model parameters between these tasks. In this study, we introduce a multi-task learning framework that predicts both drug synergy and drug sensitivity, with a main goal of improving synergy prediction by using sensitivity prediction as a supporting task. Our model uses molecular descriptors and drug induced gene expression signatures to describe pharmaceuticals, while untreated cancer cell lines are described by their baseline gene expression profiles. We use a mutual attention mechanism to capture complex interactions between drugs and cells. With this shared feature extraction base, we systematically test different parameter sharing strategies, looking at both hard and soft sharing at various network depths. Our experiments show that the success of parameter sharing depends on both the sharing strategy and the network layer where sharing happens. Hard parameter sharing works best at deeper layers, while soft parameter sharing allows for more stable and flexible knowledge transfer between tasks. Among the soft sharing methods we tested, dense sharing based on Neural Discriminative Dimensionality Reduction (NDDR) consistently outperforms cross stitch and sluice networks, especially when used in early network layers. Extending dense sharing to multiple layers further improves shared representations and leads to better performance. An updated NDDR version with nonlinear bottleneck transformations and residual connections achieves the highest overall performance. In summary, this work gives clear insight about where and how information should be shared in multi-task learning for drug synergy and sensitivity prediction. Our results show that both the sharing level and the way sharing is done strongly affect performance. Multi-level and depth-aware sharing strategies lead to better predictions. These findings give practical guidance for building effective multi-task models in computational drug discovery.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Computer-Aided Molecular Design 生物-计算机：跨学科应用

CiteScore

8.00

自引率

8.60%

发文量

审稿时长

3 months

期刊介绍： The Journal of Computer-Aided Molecular Design provides a form for disseminating information on both the theory and the application of computer-based methods in the analysis and design of molecules. The scope of the journal encompasses papers which report new and original research and applications in the following areas: - theoretical chemistry; - computational chemistry; - computer and molecular graphics; - molecular modeling; - protein engineering; - drug design; - expert systems; - general structure-property relationships; - molecular dynamics; - chemical database development and usage.