{"title":"Uncertainty-Aware Deep Learning and Structural Feature Analysis for Reliable Nephrotoxicity Prediction","authors":"Jian-Wang Liu, Ke-Yi Liu, You-Chao Deng, Shao-Hua Shi, Xiang-Zheng Fu, Yue-Ping Jiang, Jing Fang, Qing Zhang, De-Jun Jiang*, Shao Liu* and Dong-Sheng Cao*, ","doi":"10.1021/acs.jcim.5c01532","DOIUrl":null,"url":null,"abstract":"<p >Nephrotoxicity remains a critical safety concern in drug development and clinical practice. Despite their significance, existing computational models for nephrotoxicity prediction face challenges related to limited precision and reliability. To address these challenges, this study constructed the largest publicly available database to date, comprising 1831 high-quality nephrotoxicity-related compounds. Using this dataset, we developed classification models employing both traditional machine learning algorithms and graph-based deep learning methods. Our results demonstrate that the Directed Message Passing Neural Network model, combined with molecular graphs and ChemoPy2D descriptors, outperformed other models, achieving a mean Kappa value of 70.3%. To improve the reliability of the model, we implemented an uncertainty quantification method to define the model’s applicability domain and quantify the confidence of prediction. On the representative model, this approach significantly enhanced predictive performance within the applicability domain, yielding a Kappa metric of 90.4%. Notably, our model also achieved the highest performance on an external test set compared to existing models. Finally, we performed multiscale feature analysis to provide actionable insights for safer drug design. This analysis integrated dataset -centric methods to identify structural alerts and beneficial transformations, alongside model-centric techniques to reveal the key global and local features driving nephrotoxicity. The established prediction models, combined with uncertainty quantification and structural feature insights derived in this study, offer valuable tools for the development of safer and more effective drugs.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"65 17","pages":"9082–9096"},"PeriodicalIF":5.3000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jcim.5c01532","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
Abstract
Nephrotoxicity remains a critical safety concern in drug development and clinical practice. Despite their significance, existing computational models for nephrotoxicity prediction face challenges related to limited precision and reliability. To address these challenges, this study constructed the largest publicly available database to date, comprising 1831 high-quality nephrotoxicity-related compounds. Using this dataset, we developed classification models employing both traditional machine learning algorithms and graph-based deep learning methods. Our results demonstrate that the Directed Message Passing Neural Network model, combined with molecular graphs and ChemoPy2D descriptors, outperformed other models, achieving a mean Kappa value of 70.3%. To improve the reliability of the model, we implemented an uncertainty quantification method to define the model’s applicability domain and quantify the confidence of prediction. On the representative model, this approach significantly enhanced predictive performance within the applicability domain, yielding a Kappa metric of 90.4%. Notably, our model also achieved the highest performance on an external test set compared to existing models. Finally, we performed multiscale feature analysis to provide actionable insights for safer drug design. This analysis integrated dataset -centric methods to identify structural alerts and beneficial transformations, alongside model-centric techniques to reveal the key global and local features driving nephrotoxicity. The established prediction models, combined with uncertainty quantification and structural feature insights derived in this study, offer valuable tools for the development of safer and more effective drugs.
期刊介绍:
The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery.
Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field.
As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.