Nada J Daood, Sean R Carey, Elena Chung, Tong Wang, Anna Kreutz, Mounika Girireddy, Suman Chakravarti, Nicole C Kleinstreuer, Jacqueline B Tiley, Lauren M Aleksunes, Hao Zhu
{"title":"Machine Learning Modeling for ABC Transporter Efflux and Inhibition: Data Curation, Model Development, and New Compound Interaction Predictions.","authors":"Nada J Daood, Sean R Carey, Elena Chung, Tong Wang, Anna Kreutz, Mounika Girireddy, Suman Chakravarti, Nicole C Kleinstreuer, Jacqueline B Tiley, Lauren M Aleksunes, Hao Zhu","doi":"10.1021/acs.molpharmaceut.5c01065","DOIUrl":null,"url":null,"abstract":"<p><p>In recent years, multiple computational studies have used machine learning models to predict substrate binding and inhibition of ATP-binding cassette (ABC) transporters. However, many of these studies relied on relatively small training sets with limited applicability. In this study, we manually curated over 24,000 bioactivity records (i.e., inhibition, binding affinity, permeability) for the ABC transporters P-gp, BCRP, MRP1, and MRP2 from more than 900 literature sources in ChEMBL, with additional data from PubChem and Metrabase. This effort yielded eight data sets, comprising around 8800 unique chemicals with one or more substrate binding or inhibition activities for these four efflux transporters. Quantitative structure-activity relationship (QSAR) models were developed for each of the eight data sets using combinations of four machine learning algorithms and three sets of chemical descriptors. The resulting models demonstrated excellent performance by 5-fold cross-validation, achieving an average correct classification rate (CCR) of 0.764 for the substrate binding models and 0.839 for the inhibition models. Models were validated with additional compounds from DrugBank that were known substrates or inhibitors. We further analyzed how model predictions for efflux transporter activity could estimate exposure of the brain to xenobiotics. Notably, compounds predicted as P-gp and BCRP substrates were twice or more likely to have low brain exposure compared to compounds with high brain exposure. This study provides a large and curated drug transporter binding and inhibition database for computational modeling. Applicable models based on this large database for predicting transporter substrate binding and inhibition can be used to evaluate more complex drug bioactivities, such as exposure of protected tissues to chemicals.</p>","PeriodicalId":52,"journal":{"name":"Molecular Pharmaceutics","volume":" ","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Pharmaceutics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1021/acs.molpharmaceut.5c01065","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, multiple computational studies have used machine learning models to predict substrate binding and inhibition of ATP-binding cassette (ABC) transporters. However, many of these studies relied on relatively small training sets with limited applicability. In this study, we manually curated over 24,000 bioactivity records (i.e., inhibition, binding affinity, permeability) for the ABC transporters P-gp, BCRP, MRP1, and MRP2 from more than 900 literature sources in ChEMBL, with additional data from PubChem and Metrabase. This effort yielded eight data sets, comprising around 8800 unique chemicals with one or more substrate binding or inhibition activities for these four efflux transporters. Quantitative structure-activity relationship (QSAR) models were developed for each of the eight data sets using combinations of four machine learning algorithms and three sets of chemical descriptors. The resulting models demonstrated excellent performance by 5-fold cross-validation, achieving an average correct classification rate (CCR) of 0.764 for the substrate binding models and 0.839 for the inhibition models. Models were validated with additional compounds from DrugBank that were known substrates or inhibitors. We further analyzed how model predictions for efflux transporter activity could estimate exposure of the brain to xenobiotics. Notably, compounds predicted as P-gp and BCRP substrates were twice or more likely to have low brain exposure compared to compounds with high brain exposure. This study provides a large and curated drug transporter binding and inhibition database for computational modeling. Applicable models based on this large database for predicting transporter substrate binding and inhibition can be used to evaluate more complex drug bioactivities, such as exposure of protected tissues to chemicals.
期刊介绍:
Molecular Pharmaceutics publishes the results of original research that contributes significantly to the molecular mechanistic understanding of drug delivery and drug delivery systems. The journal encourages contributions describing research at the interface of drug discovery and drug development.
Scientific areas within the scope of the journal include physical and pharmaceutical chemistry, biochemistry and biophysics, molecular and cellular biology, and polymer and materials science as they relate to drug and drug delivery system efficacy. Mechanistic Drug Delivery and Drug Targeting research on modulating activity and efficacy of a drug or drug product is within the scope of Molecular Pharmaceutics. Theoretical and experimental peer-reviewed research articles, communications, reviews, and perspectives are welcomed.