Shuhui Wang, Alexandre Allauzen, Philippe Nghe, Vaitea Opuu
{"title":"协同药物发现中的主动学习指南","authors":"Shuhui Wang, Alexandre Allauzen, Philippe Nghe, Vaitea Opuu","doi":"10.1101/2024.09.13.612819","DOIUrl":null,"url":null,"abstract":"Synergistic drug combination screening is a promising strategy in drug discovery, but it involves navigating a costly and complex search space. While AI, particularly deep learning, has advanced synergy predictions, its effectiveness is limited by the low occurrence of synergistic drug pairs. Active learning, which integrates experimental testing into the learning process, has been proposed to address this challenge. In this work, we explore the key components of active learning to provide recommendations for its implementation. We find that molecular encoding has a limited impact on performance, while the cellular environment features significantly enhance predictions. Additionally, active learning can discover 60% of synergistic drug pairs with only exploring 10% of combinatorial space. The synergy yield ratio is observed to be even higher with smaller batch sizes, where dynamic tuning of the exploration-exploitation strategy can further enhance performance.","PeriodicalId":501307,"journal":{"name":"bioRxiv - Bioinformatics","volume":"37 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Guide for Active Learning in Synergistic Drug Discovery\",\"authors\":\"Shuhui Wang, Alexandre Allauzen, Philippe Nghe, Vaitea Opuu\",\"doi\":\"10.1101/2024.09.13.612819\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Synergistic drug combination screening is a promising strategy in drug discovery, but it involves navigating a costly and complex search space. While AI, particularly deep learning, has advanced synergy predictions, its effectiveness is limited by the low occurrence of synergistic drug pairs. Active learning, which integrates experimental testing into the learning process, has been proposed to address this challenge. In this work, we explore the key components of active learning to provide recommendations for its implementation. We find that molecular encoding has a limited impact on performance, while the cellular environment features significantly enhance predictions. Additionally, active learning can discover 60% of synergistic drug pairs with only exploring 10% of combinatorial space. The synergy yield ratio is observed to be even higher with smaller batch sizes, where dynamic tuning of the exploration-exploitation strategy can further enhance performance.\",\"PeriodicalId\":501307,\"journal\":{\"name\":\"bioRxiv - Bioinformatics\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"bioRxiv - Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.09.13.612819\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.13.612819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Guide for Active Learning in Synergistic Drug Discovery
Synergistic drug combination screening is a promising strategy in drug discovery, but it involves navigating a costly and complex search space. While AI, particularly deep learning, has advanced synergy predictions, its effectiveness is limited by the low occurrence of synergistic drug pairs. Active learning, which integrates experimental testing into the learning process, has been proposed to address this challenge. In this work, we explore the key components of active learning to provide recommendations for its implementation. We find that molecular encoding has a limited impact on performance, while the cellular environment features significantly enhance predictions. Additionally, active learning can discover 60% of synergistic drug pairs with only exploring 10% of combinatorial space. The synergy yield ratio is observed to be even higher with smaller batch sizes, where dynamic tuning of the exploration-exploitation strategy can further enhance performance.