{"title":"药物协同和敏感性预测多任务学习中参数共享策略的系统研究","authors":"C. A. Hafsath, A. S. Jereesh","doi":"10.1007/s10822-026-00808-3","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Multi-task learning (MTL) is a useful approach for modeling related prediction tasks by sharing representations while allowing each task to keep its unique features. In computational drug discovery, drug synergy and drug sensitivity predictions are closely linked, but it is still unclear where and how to share model parameters between these tasks. In this study, we introduce a multi-task learning framework that predicts both drug synergy and drug sensitivity, with a main goal of improving synergy prediction by using sensitivity prediction as a supporting task. Our model uses molecular descriptors and drug induced gene expression signatures to describe pharmaceuticals, while untreated cancer cell lines are described by their baseline gene expression profiles. We use a mutual attention mechanism to capture complex interactions between drugs and cells. With this shared feature extraction base, we systematically test different parameter sharing strategies, looking at both hard and soft sharing at various network depths. Our experiments show that the success of parameter sharing depends on both the sharing strategy and the network layer where sharing happens. Hard parameter sharing works best at deeper layers, while soft parameter sharing allows for more stable and flexible knowledge transfer between tasks. Among the soft sharing methods we tested, dense sharing based on Neural Discriminative Dimensionality Reduction (NDDR) consistently outperforms cross stitch and sluice networks, especially when used in early network layers. Extending dense sharing to multiple layers further improves shared representations and leads to better performance. An updated NDDR version with nonlinear bottleneck transformations and residual connections achieves the highest overall performance. In summary, this work gives clear insight about where and how information should be shared in multi-task learning for drug synergy and sensitivity prediction. Our results show that both the sharing level and the way sharing is done strongly affect performance. Multi-level and depth-aware sharing strategies lead to better predictions. These findings give practical guidance for building effective multi-task models in computational drug discovery.</p>\n </div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"40 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2026-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A systematic study of parameter sharing strategies in multi-task learning for drug synergy and sensitivity prediction\",\"authors\":\"C. A. Hafsath, A. S. Jereesh\",\"doi\":\"10.1007/s10822-026-00808-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Multi-task learning (MTL) is a useful approach for modeling related prediction tasks by sharing representations while allowing each task to keep its unique features. In computational drug discovery, drug synergy and drug sensitivity predictions are closely linked, but it is still unclear where and how to share model parameters between these tasks. In this study, we introduce a multi-task learning framework that predicts both drug synergy and drug sensitivity, with a main goal of improving synergy prediction by using sensitivity prediction as a supporting task. Our model uses molecular descriptors and drug induced gene expression signatures to describe pharmaceuticals, while untreated cancer cell lines are described by their baseline gene expression profiles. We use a mutual attention mechanism to capture complex interactions between drugs and cells. With this shared feature extraction base, we systematically test different parameter sharing strategies, looking at both hard and soft sharing at various network depths. Our experiments show that the success of parameter sharing depends on both the sharing strategy and the network layer where sharing happens. Hard parameter sharing works best at deeper layers, while soft parameter sharing allows for more stable and flexible knowledge transfer between tasks. Among the soft sharing methods we tested, dense sharing based on Neural Discriminative Dimensionality Reduction (NDDR) consistently outperforms cross stitch and sluice networks, especially when used in early network layers. Extending dense sharing to multiple layers further improves shared representations and leads to better performance. An updated NDDR version with nonlinear bottleneck transformations and residual connections achieves the highest overall performance. In summary, this work gives clear insight about where and how information should be shared in multi-task learning for drug synergy and sensitivity prediction. Our results show that both the sharing level and the way sharing is done strongly affect performance. Multi-level and depth-aware sharing strategies lead to better predictions. These findings give practical guidance for building effective multi-task models in computational drug discovery.</p>\\n </div>\",\"PeriodicalId\":621,\"journal\":{\"name\":\"Journal of Computer-Aided Molecular Design\",\"volume\":\"40 1\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2026-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computer-Aided Molecular Design\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10822-026-00808-3\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer-Aided Molecular Design","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10822-026-00808-3","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
A systematic study of parameter sharing strategies in multi-task learning for drug synergy and sensitivity prediction
Multi-task learning (MTL) is a useful approach for modeling related prediction tasks by sharing representations while allowing each task to keep its unique features. In computational drug discovery, drug synergy and drug sensitivity predictions are closely linked, but it is still unclear where and how to share model parameters between these tasks. In this study, we introduce a multi-task learning framework that predicts both drug synergy and drug sensitivity, with a main goal of improving synergy prediction by using sensitivity prediction as a supporting task. Our model uses molecular descriptors and drug induced gene expression signatures to describe pharmaceuticals, while untreated cancer cell lines are described by their baseline gene expression profiles. We use a mutual attention mechanism to capture complex interactions between drugs and cells. With this shared feature extraction base, we systematically test different parameter sharing strategies, looking at both hard and soft sharing at various network depths. Our experiments show that the success of parameter sharing depends on both the sharing strategy and the network layer where sharing happens. Hard parameter sharing works best at deeper layers, while soft parameter sharing allows for more stable and flexible knowledge transfer between tasks. Among the soft sharing methods we tested, dense sharing based on Neural Discriminative Dimensionality Reduction (NDDR) consistently outperforms cross stitch and sluice networks, especially when used in early network layers. Extending dense sharing to multiple layers further improves shared representations and leads to better performance. An updated NDDR version with nonlinear bottleneck transformations and residual connections achieves the highest overall performance. In summary, this work gives clear insight about where and how information should be shared in multi-task learning for drug synergy and sensitivity prediction. Our results show that both the sharing level and the way sharing is done strongly affect performance. Multi-level and depth-aware sharing strategies lead to better predictions. These findings give practical guidance for building effective multi-task models in computational drug discovery.
期刊介绍:
The Journal of Computer-Aided Molecular Design provides a form for disseminating information on both the theory and the application of computer-based methods in the analysis and design of molecules. The scope of the journal encompasses papers which report new and original research and applications in the following areas:
- theoretical chemistry;
- computational chemistry;
- computer and molecular graphics;
- molecular modeling;
- protein engineering;
- drug design;
- expert systems;
- general structure-property relationships;
- molecular dynamics;
- chemical database development and usage.