{"title":"Exploiting similarity in drug molecular effects for drug repurposing.","authors":"Katie Huang, Panagiotis Nikolaos Lalagkas, Beftu Sultan, Rachel Melamed","doi":"10.1186/s40246-025-00808-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Using large data to propose new uses of drugs has potential to rapidly prioritize new treatments for major diseases of public health importance. One comprehensive data set, LINCS L1000 Connectivity Map, profiles gene expression associated with thousands of compounds, including many with known clinical uses. But, some recent studies have questioned the reliability of this data, and the best approach to use this resource for drug repositioning is not well established.</p><p><strong>Methods: </strong>Here, we develop a novel generalizable approach by hypothesizing that new treatments for a disease should induce similar gene expression to existing treatments for a disease. Using the Drug Repurposing Hub compendium of known treatments, we formulate a combined logistic regression model to predict new drug indications, and we assess generalizability of our findings using independent clinical trials on experimental drug uses.</p><p><strong>Results: </strong>We support the hypothesis that drugs sharing an indication induce more similar gene expression, additionally demonstrating that the simpler Spearman correlation (p = 7.71e-38), outperforms the popular Connectivity Score (p = 5.2e-6). Our final model, combining predicted drug indications across three diverse cell lines, generalizes to predict experimental clinical trials with AUC of 0.708.</p><p><strong>Conclusions: </strong>By developing a new approach to using LINCS L1000 data for drug repositioning, we both propose plausible new disease treatments and provide an interpretable rationale for predictions. Our findings not only put forward new drug repositioning candidates, browseable at https://bsultan.shinyapps.io/web-app , but they also provide guidelines for future researchers employing L1000 data for drug repurposing.</p>","PeriodicalId":13183,"journal":{"name":"Human Genomics","volume":"19 1","pages":"110"},"PeriodicalIF":4.3000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12487472/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Genomics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40246-025-00808-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Using large data to propose new uses of drugs has potential to rapidly prioritize new treatments for major diseases of public health importance. One comprehensive data set, LINCS L1000 Connectivity Map, profiles gene expression associated with thousands of compounds, including many with known clinical uses. But, some recent studies have questioned the reliability of this data, and the best approach to use this resource for drug repositioning is not well established.
Methods: Here, we develop a novel generalizable approach by hypothesizing that new treatments for a disease should induce similar gene expression to existing treatments for a disease. Using the Drug Repurposing Hub compendium of known treatments, we formulate a combined logistic regression model to predict new drug indications, and we assess generalizability of our findings using independent clinical trials on experimental drug uses.
Results: We support the hypothesis that drugs sharing an indication induce more similar gene expression, additionally demonstrating that the simpler Spearman correlation (p = 7.71e-38), outperforms the popular Connectivity Score (p = 5.2e-6). Our final model, combining predicted drug indications across three diverse cell lines, generalizes to predict experimental clinical trials with AUC of 0.708.
Conclusions: By developing a new approach to using LINCS L1000 data for drug repositioning, we both propose plausible new disease treatments and provide an interpretable rationale for predictions. Our findings not only put forward new drug repositioning candidates, browseable at https://bsultan.shinyapps.io/web-app , but they also provide guidelines for future researchers employing L1000 data for drug repurposing.
期刊介绍:
Human Genomics is a peer-reviewed, open access, online journal that focuses on the application of genomic analysis in all aspects of human health and disease, as well as genomic analysis of drug efficacy and safety, and comparative genomics.
Topics covered by the journal include, but are not limited to: pharmacogenomics, genome-wide association studies, genome-wide sequencing, exome sequencing, next-generation deep-sequencing, functional genomics, epigenomics, translational genomics, expression profiling, proteomics, bioinformatics, animal models, statistical genetics, genetic epidemiology, human population genetics and comparative genomics.