Dushuo Feng, Lulu Guan, Yunxiang Sun, Bote Qi, Yu Zou
{"title":"Interpretable Multimodal Deep Ensemble Framework Dissecting Bloodbrain Barrier Permeability with Molecular Features.","authors":"Dushuo Feng, Lulu Guan, Yunxiang Sun, Bote Qi, Yu Zou","doi":"10.1021/acs.jpclett.5c01077","DOIUrl":null,"url":null,"abstract":"<p><p>Blood-brain barrier permeability (BBBP) prediction plays a critical role in the drug discovery process, particularly for compounds targeting the central nervous system. While machine learning (ML) has significantly advanced the prediction of BBBP, there remains an urgent need for interpretable ML models that can reveal the physicochemical principles governing BBB permeability. In this study, we propose a multimodal ML framework that integrates molecular fingerprints (Morgan, MACCS, RDK) and image features to improve BBBP prediction. The classification task (BBB-permeable vs nonpermeable) is addressed with a stacking ensemble model combining multiple base classifiers. The proposed framework demonstrates competitive predictive stability, generalization ability, and feature interpretability compared with recent approaches, under comparable evaluation settings. Beyond predictive performance, our framework incorporates Principal Component Analysis (PCA) and Shapley Additive Explanations (SHAP) analysis to highlight key fingerprint features contributing to predictions. The regression task (logBB value prediction) is tackled by a multi-input deep learning framework, incorporating a Transformer encoder for fingerprint processing, a convolutional neural network (CNN) for image feature extraction, and a Multi-Head Attention fusion mechanism to enhance feature interactions. Attention maps derived from the multimodal features reveal token-level relationships within molecular representations. This work provides an interpretable framework for modeling BBBP with enhanced transparency and mechanistic insight and lays the foundation for future studies incorporating transparent descriptors and physics-informed features.</p>","PeriodicalId":62,"journal":{"name":"The Journal of Physical Chemistry Letters","volume":" ","pages":"5806-5819"},"PeriodicalIF":4.8000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Physical Chemistry Letters","FirstCategoryId":"1","ListUrlMain":"https://doi.org/10.1021/acs.jpclett.5c01077","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/3 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Blood-brain barrier permeability (BBBP) prediction plays a critical role in the drug discovery process, particularly for compounds targeting the central nervous system. While machine learning (ML) has significantly advanced the prediction of BBBP, there remains an urgent need for interpretable ML models that can reveal the physicochemical principles governing BBB permeability. In this study, we propose a multimodal ML framework that integrates molecular fingerprints (Morgan, MACCS, RDK) and image features to improve BBBP prediction. The classification task (BBB-permeable vs nonpermeable) is addressed with a stacking ensemble model combining multiple base classifiers. The proposed framework demonstrates competitive predictive stability, generalization ability, and feature interpretability compared with recent approaches, under comparable evaluation settings. Beyond predictive performance, our framework incorporates Principal Component Analysis (PCA) and Shapley Additive Explanations (SHAP) analysis to highlight key fingerprint features contributing to predictions. The regression task (logBB value prediction) is tackled by a multi-input deep learning framework, incorporating a Transformer encoder for fingerprint processing, a convolutional neural network (CNN) for image feature extraction, and a Multi-Head Attention fusion mechanism to enhance feature interactions. Attention maps derived from the multimodal features reveal token-level relationships within molecular representations. This work provides an interpretable framework for modeling BBBP with enhanced transparency and mechanistic insight and lays the foundation for future studies incorporating transparent descriptors and physics-informed features.
期刊介绍:
The Journal of Physical Chemistry (JPC) Letters is devoted to reporting new and original experimental and theoretical basic research of interest to physical chemists, biophysical chemists, chemical physicists, physicists, material scientists, and engineers. An important criterion for acceptance is that the paper reports a significant scientific advance and/or physical insight such that rapid publication is essential. Two issues of JPC Letters are published each month.