A novel ligand-based convolutional neural network for identification of P-glycoprotein ligands in drug discovery.

IF 3.8 2区化学 Q2 CHEMISTRY, APPLIED

Molecular Diversity Pub Date : 2025-07-25 DOI:10.1007/s11030-025-11301-8

Mary Margarat Valentine A Neela, Subbarao Peram

{"title":"A novel ligand-based convolutional neural network for identification of P-glycoprotein ligands in drug discovery.","authors":"Mary Margarat Valentine A Neela, Subbarao Peram","doi":"10.1007/s11030-025-11301-8","DOIUrl":null,"url":null,"abstract":"<p><p>P-glycoprotein (P-gp) is a crucial drug transporter in several drug-resistant cases that are serious challenges in drug delivery and cancer treatment. Existing computational approaches mostly depended on small datasets for predicting P-gp interactions. To overcome these limitations, this paper proposes a Novel Ligand-based Convolutional Neural Network (NLCNN) framework to classify and predict P-gp substrates with high accuracy. The model is trained on a curated dataset of 197 P-gp substrates, integrating molecular docking and ligand-based deep learning methods for further predictive improvement. Experimental evaluations show that the NLCNN, on average, achieves prediction accuracy of 80%, which is 19-24% higher in precision and recall metrics compared to conventional Convolutional Neural Networks (CNNs) and Support Vector Machines (SVMs). Here, CNN is considered as the major model, whereas the SVM is emphasized as a baseline classifier. The proposed FLCNN algorithm obtains a noticeable accuracy, thus outperforming conventional SVM with a Gaussian RBF kernel. Moreover, by using the X-ray structure of mouse P-gp as a template, a homology model of human P-gp permits accurate molecular docking analysis. The proposed model is implemented in drug discovery and personalized medicine for P-gp interaction prediction. This is a landmark achievement in computational pharmacology as it puts out a powerful, accurate, and simple tool for determining P-gp inhibitors and substrates.</p>","PeriodicalId":708,"journal":{"name":"Molecular Diversity","volume":" ","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Diversity","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s11030-025-11301-8","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}

引用次数: 0

Abstract

P-glycoprotein (P-gp) is a crucial drug transporter in several drug-resistant cases that are serious challenges in drug delivery and cancer treatment. Existing computational approaches mostly depended on small datasets for predicting P-gp interactions. To overcome these limitations, this paper proposes a Novel Ligand-based Convolutional Neural Network (NLCNN) framework to classify and predict P-gp substrates with high accuracy. The model is trained on a curated dataset of 197 P-gp substrates, integrating molecular docking and ligand-based deep learning methods for further predictive improvement. Experimental evaluations show that the NLCNN, on average, achieves prediction accuracy of 80%, which is 19-24% higher in precision and recall metrics compared to conventional Convolutional Neural Networks (CNNs) and Support Vector Machines (SVMs). Here, CNN is considered as the major model, whereas the SVM is emphasized as a baseline classifier. The proposed FLCNN algorithm obtains a noticeable accuracy, thus outperforming conventional SVM with a Gaussian RBF kernel. Moreover, by using the X-ray structure of mouse P-gp as a template, a homology model of human P-gp permits accurate molecular docking analysis. The proposed model is implemented in drug discovery and personalized medicine for P-gp interaction prediction. This is a landmark achievement in computational pharmacology as it puts out a powerful, accurate, and simple tool for determining P-gp inhibitors and substrates.

查看原文本刊更多论文

一种新的基于配体的卷积神经网络用于药物发现中p -糖蛋白配体的识别。

p -糖蛋白（P-gp）是一些耐药病例中至关重要的药物转运体，是药物传递和癌症治疗的重大挑战。现有的计算方法大多依赖于小数据集来预测P-gp相互作用。为了克服这些限制，本文提出了一种新的基于配体的卷积神经网络（NLCNN）框架，以高精度地分类和预测P-gp底物。该模型在197个P-gp底物的精选数据集上进行训练，整合了分子对接和基于配体的深度学习方法，以进一步提高预测能力。实验评估表明，NLCNN平均达到80%的预测准确率，与传统的卷积神经网络（cnn）和支持向量机（svm）相比，在精度和召回率指标上提高了19-24%。在这里，CNN被认为是主要的模型，而SVM被强调为基线分类器。本文提出的FLCNN算法具有明显的精度，优于传统的基于高斯RBF核的支持向量机。此外，通过使用小鼠P-gp的x射线结构作为模板，建立了人类P-gp的同源模型，可以进行精确的分子对接分析。该模型可用于药物发现和个性化医疗中P-gp相互作用的预测。这是计算药理学的一个里程碑式的成就，因为它提出了一个强大的，准确的，简单的工具来确定P-gp抑制剂和底物。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Molecular Diversity 化学-化学综合

CiteScore

7.30

自引率

7.90%

发文量

219

审稿时长

2.7 months

期刊介绍： Molecular Diversity is a new publication forum for the rapid publication of refereed papers dedicated to describing the development, application and theory of molecular diversity and combinatorial chemistry in basic and applied research and drug discovery. The journal publishes both short and full papers, perspectives, news and reviews dealing with all aspects of the generation of molecular diversity, application of diversity for screening against alternative targets of all types (biological, biophysical, technological), analysis of results obtained and their application in various scientific disciplines/approaches including: combinatorial chemistry and parallel synthesis; small molecule libraries; microwave synthesis; flow synthesis; fluorous synthesis; diversity oriented synthesis (DOS); nanoreactors; click chemistry; multiplex technologies; fragment- and ligand-based design; structure/function/SAR; computational chemistry and molecular design; chemoinformatics; screening techniques and screening interfaces; analytical and purification methods; robotics, automation and miniaturization; targeted libraries; display libraries; peptides and peptoids; proteins; oligonucleotides; carbohydrates; natural diversity; new methods of library formulation and deconvolution; directed evolution, origin of life and recombination; search techniques, landscapes, random chemistry and more;