Adam Yasgar, , , Sankalp Jain, , , Marissa Davies, , , Carina Danchik, , , Taylor Niehoff, , , Jing Ran, , , Ganesha Rai, , , Shyh-Ming Yang, , , Anton Simeonov*, , , Alexey V. Zakharov*, , and , Natalia J. Martinez*,
{"title":"Integrated Approach of Machine Learning and High-Throughput Screening to Identify Chemical Probe Candidates Targeting Aldehyde Dehydrogenases","authors":"Adam Yasgar, , , Sankalp Jain, , , Marissa Davies, , , Carina Danchik, , , Taylor Niehoff, , , Jing Ran, , , Ganesha Rai, , , Shyh-Ming Yang, , , Anton Simeonov*, , , Alexey V. Zakharov*, , and , Natalia J. Martinez*, ","doi":"10.1021/acsptsci.5c00395","DOIUrl":null,"url":null,"abstract":"<p >Selective chemical probes are essential for dissecting biological pathways and advancing drug discovery, yet developing high-quality probes for targets such as the aldehyde dehydrogenase (ALDH) family remains challenging. Here, we present a novel integrated approach combining experimental quantitative high-throughput screening (qHTS) with advanced machine learning (ML) and pharmacophore (PH4) modeling to rapidly identify selective inhibitors across multiple ALDH isoforms. We screened ∼13,000 annotated compounds against biochemical and cellular assays. We then utilized the data set to build ML and PH4 models to virtually screen a larger set of 174,000 compounds to enhance the chemical diversity of hits. This approach led to the expansion of chemically diverse isoform-selective inhibitors that are potent in both biochemical and cell-based assays. Validation through cellular target engagement assays further confirmed the selective activity of these compounds, leading to the discovery of ALDH1A2, ALDH1A3, ALDH2, and ALDH3A1 chemical probe candidates. Remarkably, this was achieved by employing just a single iteration of quantitative structure–activity relationship (QSAR) and PH4 modeling for virtual screening. This combined <i>in vitro</i> and <i>in silico</i> strategy not only enhances the discovery of biologically relevant chemical probe candidates but also significantly expands the chemical diversity accessible for probe development, establishing a new platform for the rapid and resource-efficient identification of chemical probes against the ALDH enzyme family. The data set generated, including hundreds of compounds thoroughly characterized across a spectrum of assays, is publicly available and can serve as a high-quality training set for future research initiatives and probe development efforts.</p>","PeriodicalId":36426,"journal":{"name":"ACS Pharmacology and Translational Science","volume":"8 10","pages":"3568–3584"},"PeriodicalIF":3.7000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Pharmacology and Translational Science","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsptsci.5c00395","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
Abstract
Selective chemical probes are essential for dissecting biological pathways and advancing drug discovery, yet developing high-quality probes for targets such as the aldehyde dehydrogenase (ALDH) family remains challenging. Here, we present a novel integrated approach combining experimental quantitative high-throughput screening (qHTS) with advanced machine learning (ML) and pharmacophore (PH4) modeling to rapidly identify selective inhibitors across multiple ALDH isoforms. We screened ∼13,000 annotated compounds against biochemical and cellular assays. We then utilized the data set to build ML and PH4 models to virtually screen a larger set of 174,000 compounds to enhance the chemical diversity of hits. This approach led to the expansion of chemically diverse isoform-selective inhibitors that are potent in both biochemical and cell-based assays. Validation through cellular target engagement assays further confirmed the selective activity of these compounds, leading to the discovery of ALDH1A2, ALDH1A3, ALDH2, and ALDH3A1 chemical probe candidates. Remarkably, this was achieved by employing just a single iteration of quantitative structure–activity relationship (QSAR) and PH4 modeling for virtual screening. This combined in vitro and in silico strategy not only enhances the discovery of biologically relevant chemical probe candidates but also significantly expands the chemical diversity accessible for probe development, establishing a new platform for the rapid and resource-efficient identification of chemical probes against the ALDH enzyme family. The data set generated, including hundreds of compounds thoroughly characterized across a spectrum of assays, is publicly available and can serve as a high-quality training set for future research initiatives and probe development efforts.
期刊介绍:
ACS Pharmacology & Translational Science publishes high quality, innovative, and impactful research across the broad spectrum of biological sciences, covering basic and molecular sciences through to translational preclinical studies. Clinical studies that address novel mechanisms of action, and methodological papers that provide innovation, and advance translation, will also be considered. We give priority to studies that fully integrate basic pharmacological and/or biochemical findings into physiological processes that have translational potential in a broad range of biomedical disciplines. Therefore, studies that employ a complementary blend of in vitro and in vivo systems are of particular interest to the journal. Nonetheless, all innovative and impactful research that has an articulated translational relevance will be considered.
ACS Pharmacology & Translational Science does not publish research on biological extracts that have unknown concentration or unknown chemical composition.
Authors are encouraged to use the pre-submission inquiry mechanism to ensure relevance and appropriateness of research.