Aga Basit Iqbal , Assif Assad , Basharat Bhat , Muzafar A. Macha , Syed Zubair Ahmad Shah
{"title":"MPNN-CWExplainer: An enhanced deep learning framework for HIV drug bioactivity prediction with class-weighted loss and explainability","authors":"Aga Basit Iqbal , Assif Assad , Basharat Bhat , Muzafar A. Macha , Syed Zubair Ahmad Shah","doi":"10.1016/j.lfs.2025.123835","DOIUrl":null,"url":null,"abstract":"<div><h3>Aims</h3><div>Human Immunodeficiency Virus (HIV) remains a critical global health concern due to its impact on the immune system and its progression to Acquired Immunodeficiency Syndrome (AIDS) if untreated. While antiretroviral therapy has advanced significantly, challenges such as drug resistance, adverse effects, and viral mutation necessitate the development of novel therapeutic strategies. This study aims to improve HIV bioactivity prediction and provide interpretable insights into molecular determinants influencing bioactivity.</div></div><div><h3>Materials and methods</h3><div>We propose MPNN-CWExplainer, a novel graph-based deep learning framework for molecular property prediction. The model integrates a Message Passing Neural Network (MPNN) with a class-weighted loss function to effectively address class imbalance in HIV datasets. Furthermore, GNNExplainer is incorporated to provide post-hoc interpretability by identifying key atom- and bond-level substructures contributing to model predictions. Model robustness is ensured through Bayesian hyperparameter optimization and multiple independent runs.</div></div><div><h3>Key findings</h3><div>MPNN-CWExplainer achieved state-of-the-art predictive performance on the HIV dataset, with an AUC-ROC of 87.631 % and AUC-PRC of 86.02 %, surpassing existing baseline models. The class-weighted approach enhanced minority class representation, and GNNExplainer successfully highlighted chemically meaningful substructures correlating with bioactivity.</div></div><div><h3>Significance</h3><div>The proposed framework not only improves prediction accuracy for HIV bioactivity but also enhances transparency and interpretability, crucial for medicinal chemists in understanding model behaviour. MPNN-CWExplainer serves as a robust and interpretable tool for computational drug discovery, supporting informed decision-making in lead optimization and molecular design.</div></div>","PeriodicalId":18122,"journal":{"name":"Life sciences","volume":"378 ","pages":"Article 123835"},"PeriodicalIF":5.2000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Life sciences","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0024320525004709","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Aims
Human Immunodeficiency Virus (HIV) remains a critical global health concern due to its impact on the immune system and its progression to Acquired Immunodeficiency Syndrome (AIDS) if untreated. While antiretroviral therapy has advanced significantly, challenges such as drug resistance, adverse effects, and viral mutation necessitate the development of novel therapeutic strategies. This study aims to improve HIV bioactivity prediction and provide interpretable insights into molecular determinants influencing bioactivity.
Materials and methods
We propose MPNN-CWExplainer, a novel graph-based deep learning framework for molecular property prediction. The model integrates a Message Passing Neural Network (MPNN) with a class-weighted loss function to effectively address class imbalance in HIV datasets. Furthermore, GNNExplainer is incorporated to provide post-hoc interpretability by identifying key atom- and bond-level substructures contributing to model predictions. Model robustness is ensured through Bayesian hyperparameter optimization and multiple independent runs.
Key findings
MPNN-CWExplainer achieved state-of-the-art predictive performance on the HIV dataset, with an AUC-ROC of 87.631 % and AUC-PRC of 86.02 %, surpassing existing baseline models. The class-weighted approach enhanced minority class representation, and GNNExplainer successfully highlighted chemically meaningful substructures correlating with bioactivity.
Significance
The proposed framework not only improves prediction accuracy for HIV bioactivity but also enhances transparency and interpretability, crucial for medicinal chemists in understanding model behaviour. MPNN-CWExplainer serves as a robust and interpretable tool for computational drug discovery, supporting informed decision-making in lead optimization and molecular design.
期刊介绍:
Life Sciences is an international journal publishing articles that emphasize the molecular, cellular, and functional basis of therapy. The journal emphasizes the understanding of mechanism that is relevant to all aspects of human disease and translation to patients. All articles are rigorously reviewed.
The Journal favors publication of full-length papers where modern scientific technologies are used to explain molecular, cellular and physiological mechanisms. Articles that merely report observations are rarely accepted. Recommendations from the Declaration of Helsinki or NIH guidelines for care and use of laboratory animals must be adhered to. Articles should be written at a level accessible to readers who are non-specialists in the topic of the article themselves, but who are interested in the research. The Journal welcomes reviews on topics of wide interest to investigators in the life sciences. We particularly encourage submission of brief, focused reviews containing high-quality artwork and require the use of mechanistic summary diagrams.