ABC转运体外排和抑制的机器学习建模:数据管理,模型开发和新的化合物相互作用预测。

IF 4.5 2区 医学 Q2 MEDICINE, RESEARCH & EXPERIMENTAL
Nada J Daood, Sean R Carey, Elena Chung, Tong Wang, Anna Kreutz, Mounika Girireddy, Suman Chakravarti, Nicole C Kleinstreuer, Jacqueline B Tiley, Lauren M Aleksunes, Hao Zhu
{"title":"ABC转运体外排和抑制的机器学习建模:数据管理,模型开发和新的化合物相互作用预测。","authors":"Nada J Daood, Sean R Carey, Elena Chung, Tong Wang, Anna Kreutz, Mounika Girireddy, Suman Chakravarti, Nicole C Kleinstreuer, Jacqueline B Tiley, Lauren M Aleksunes, Hao Zhu","doi":"10.1021/acs.molpharmaceut.5c01065","DOIUrl":null,"url":null,"abstract":"<p><p>In recent years, multiple computational studies have used machine learning models to predict substrate binding and inhibition of ATP-binding cassette (ABC) transporters. However, many of these studies relied on relatively small training sets with limited applicability. In this study, we manually curated over 24,000 bioactivity records (i.e., inhibition, binding affinity, permeability) for the ABC transporters P-gp, BCRP, MRP1, and MRP2 from more than 900 literature sources in ChEMBL, with additional data from PubChem and Metrabase. This effort yielded eight data sets, comprising around 8800 unique chemicals with one or more substrate binding or inhibition activities for these four efflux transporters. Quantitative structure-activity relationship (QSAR) models were developed for each of the eight data sets using combinations of four machine learning algorithms and three sets of chemical descriptors. The resulting models demonstrated excellent performance by 5-fold cross-validation, achieving an average correct classification rate (CCR) of 0.764 for the substrate binding models and 0.839 for the inhibition models. Models were validated with additional compounds from DrugBank that were known substrates or inhibitors. We further analyzed how model predictions for efflux transporter activity could estimate exposure of the brain to xenobiotics. Notably, compounds predicted as P-gp and BCRP substrates were twice or more likely to have low brain exposure compared to compounds with high brain exposure. This study provides a large and curated drug transporter binding and inhibition database for computational modeling. Applicable models based on this large database for predicting transporter substrate binding and inhibition can be used to evaluate more complex drug bioactivities, such as exposure of protected tissues to chemicals.</p>","PeriodicalId":52,"journal":{"name":"Molecular Pharmaceutics","volume":" ","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Modeling for ABC Transporter Efflux and Inhibition: Data Curation, Model Development, and New Compound Interaction Predictions.\",\"authors\":\"Nada J Daood, Sean R Carey, Elena Chung, Tong Wang, Anna Kreutz, Mounika Girireddy, Suman Chakravarti, Nicole C Kleinstreuer, Jacqueline B Tiley, Lauren M Aleksunes, Hao Zhu\",\"doi\":\"10.1021/acs.molpharmaceut.5c01065\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In recent years, multiple computational studies have used machine learning models to predict substrate binding and inhibition of ATP-binding cassette (ABC) transporters. However, many of these studies relied on relatively small training sets with limited applicability. In this study, we manually curated over 24,000 bioactivity records (i.e., inhibition, binding affinity, permeability) for the ABC transporters P-gp, BCRP, MRP1, and MRP2 from more than 900 literature sources in ChEMBL, with additional data from PubChem and Metrabase. This effort yielded eight data sets, comprising around 8800 unique chemicals with one or more substrate binding or inhibition activities for these four efflux transporters. Quantitative structure-activity relationship (QSAR) models were developed for each of the eight data sets using combinations of four machine learning algorithms and three sets of chemical descriptors. The resulting models demonstrated excellent performance by 5-fold cross-validation, achieving an average correct classification rate (CCR) of 0.764 for the substrate binding models and 0.839 for the inhibition models. Models were validated with additional compounds from DrugBank that were known substrates or inhibitors. We further analyzed how model predictions for efflux transporter activity could estimate exposure of the brain to xenobiotics. Notably, compounds predicted as P-gp and BCRP substrates were twice or more likely to have low brain exposure compared to compounds with high brain exposure. This study provides a large and curated drug transporter binding and inhibition database for computational modeling. Applicable models based on this large database for predicting transporter substrate binding and inhibition can be used to evaluate more complex drug bioactivities, such as exposure of protected tissues to chemicals.</p>\",\"PeriodicalId\":52,\"journal\":{\"name\":\"Molecular Pharmaceutics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2025-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Pharmaceutics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.molpharmaceut.5c01065\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Pharmaceutics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1021/acs.molpharmaceut.5c01065","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

近年来,许多计算研究使用机器学习模型来预测底物结合和atp结合盒(ABC)转运体的抑制。然而,这些研究中的许多依赖于相对较小的训练集,适用性有限。在这项研究中,我们从ChEMBL中900多篇文献中手动整理了24000多条ABC转运蛋白P-gp、BCRP、MRP1和MRP2的生物活性记录(即抑制、结合亲和力、渗透性),并从PubChem和Metrabase中获取了额外的数据。这项工作产生了8个数据集,包括大约8800种独特的化学物质,对这四种外排转运蛋白具有一种或多种底物结合或抑制活性。使用四种机器学习算法和三组化学描述符的组合,为八个数据集中的每个数据集开发了定量结构-活性关系(QSAR)模型。经5倍交叉验证,所得模型表现出优异的性能,底物结合模型的平均正确分类率(CCR)为0.764,抑制模型的平均正确分类率为0.839。使用来自DrugBank的已知底物或抑制剂的其他化合物验证模型。我们进一步分析了外排转运体活动的模型预测如何能够估计大脑对外源性药物的暴露。值得注意的是,预测为P-gp和BCRP底物的化合物与高脑暴露化合物相比,低脑暴露的可能性是其两倍或更多。这项研究为计算建模提供了一个大的和精心策划的药物转运体结合和抑制数据库。基于这个大型数据库预测转运体底物结合和抑制的适用模型可用于评估更复杂的药物生物活性,例如受保护组织暴露于化学品。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine Learning Modeling for ABC Transporter Efflux and Inhibition: Data Curation, Model Development, and New Compound Interaction Predictions.

In recent years, multiple computational studies have used machine learning models to predict substrate binding and inhibition of ATP-binding cassette (ABC) transporters. However, many of these studies relied on relatively small training sets with limited applicability. In this study, we manually curated over 24,000 bioactivity records (i.e., inhibition, binding affinity, permeability) for the ABC transporters P-gp, BCRP, MRP1, and MRP2 from more than 900 literature sources in ChEMBL, with additional data from PubChem and Metrabase. This effort yielded eight data sets, comprising around 8800 unique chemicals with one or more substrate binding or inhibition activities for these four efflux transporters. Quantitative structure-activity relationship (QSAR) models were developed for each of the eight data sets using combinations of four machine learning algorithms and three sets of chemical descriptors. The resulting models demonstrated excellent performance by 5-fold cross-validation, achieving an average correct classification rate (CCR) of 0.764 for the substrate binding models and 0.839 for the inhibition models. Models were validated with additional compounds from DrugBank that were known substrates or inhibitors. We further analyzed how model predictions for efflux transporter activity could estimate exposure of the brain to xenobiotics. Notably, compounds predicted as P-gp and BCRP substrates were twice or more likely to have low brain exposure compared to compounds with high brain exposure. This study provides a large and curated drug transporter binding and inhibition database for computational modeling. Applicable models based on this large database for predicting transporter substrate binding and inhibition can be used to evaluate more complex drug bioactivities, such as exposure of protected tissues to chemicals.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Molecular Pharmaceutics
Molecular Pharmaceutics 医学-药学
CiteScore
8.00
自引率
6.10%
发文量
391
审稿时长
2 months
期刊介绍: Molecular Pharmaceutics publishes the results of original research that contributes significantly to the molecular mechanistic understanding of drug delivery and drug delivery systems. The journal encourages contributions describing research at the interface of drug discovery and drug development. Scientific areas within the scope of the journal include physical and pharmaceutical chemistry, biochemistry and biophysics, molecular and cellular biology, and polymer and materials science as they relate to drug and drug delivery system efficacy. Mechanistic Drug Delivery and Drug Targeting research on modulating activity and efficacy of a drug or drug product is within the scope of Molecular Pharmaceutics. Theoretical and experimental peer-reviewed research articles, communications, reviews, and perspectives are welcomed.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信