Machine learning-driven bioavailability prediction in early-stage drug development: a KNIME-based computational workflow for digital health applications.

IF 1.2 4区医学 Q4 PHARMACOLOGY & PHARMACY

Xenobiotica Pub Date : 2025-05-01 Epub Date: 2025-05-28 DOI:10.1080/00498254.2025.2508804

Majdi Hammami, Walid Yeddes, Hamza Gadhoumi, Raghda Yazidi, Moufida Saidani Tounsi, Kamel Msaada

{"title":"Machine learning-driven bioavailability prediction in early-stage drug development: a KNIME-based computational workflow for digital health applications.","authors":"Majdi Hammami, Walid Yeddes, Hamza Gadhoumi, Raghda Yazidi, Moufida Saidani Tounsi, Kamel Msaada","doi":"10.1080/00498254.2025.2508804","DOIUrl":null,"url":null,"abstract":"Bioavailability prediction remains a significant challenge in early-stage drug development, where conventional experimental approaches are time-consuming and resource-intensive. This study explores the application of machine learning techniques to enhance the efficiency of bioavailability prediction. By leveraging computational workflows within the KNIME Analytics Platform, we aim to automate bioavailability assessment and reduce dependence on costly in vitro and in vivo studies.A dataset comprising 475 drug-like compounds characterised by key molecular descriptors was analysed using multiple machine learning models, including Random Forest, Gradient Boosting, Decision Trees, k-Nearest Neighbours, and neural networks. Model performance was assessed through 5-fold cross-validation, with ensemble models outperforming linear and neural network-based approaches. Random Forest demonstrated the highest predictive performance (R2 = 0.87, RMSE = 0.08). Feature importance analysis identified topological polar surface area and solubility as the most influential factors in bioavailability prediction.The findings underscore the potential of integrating open-source tools and machine learning methodologies in pharmaceutical research, improving workflow efficiency while adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. This approach facilitates rapid and cost-effective bioavailability assessment, supporting AI-driven predictive modelling and digital health applications in drug development.","PeriodicalId":23812,"journal":{"name":"Xenobiotica","volume":" ","pages":"339-348"},"PeriodicalIF":1.2000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Xenobiotica","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/00498254.2025.2508804","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/28 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}

引用次数: 0

Abstract

Bioavailability prediction remains a significant challenge in early-stage drug development, where conventional experimental approaches are time-consuming and resource-intensive. This study explores the application of machine learning techniques to enhance the efficiency of bioavailability prediction. By leveraging computational workflows within the KNIME Analytics Platform, we aim to automate bioavailability assessment and reduce dependence on costly in vitro and in vivo studies.

A dataset comprising 475 drug-like compounds characterised by key molecular descriptors was analysed using multiple machine learning models, including Random Forest, Gradient Boosting, Decision Trees, k-Nearest Neighbours, and neural networks. Model performance was assessed through 5-fold cross-validation, with ensemble models outperforming linear and neural network-based approaches. Random Forest demonstrated the highest predictive performance (R² = 0.87, RMSE = 0.08). Feature importance analysis identified topological polar surface area and solubility as the most influential factors in bioavailability prediction.

The findings underscore the potential of integrating open-source tools and machine learning methodologies in pharmaceutical research, improving workflow efficiency while adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. This approach facilitates rapid and cost-effective bioavailability assessment, supporting AI-driven predictive modelling and digital health applications in drug development.

查看原文本刊更多论文

早期药物开发中机器学习驱动的生物利用度预测：用于数字健康应用的基于knime的计算工作流。

生物利用度预测仍然是早期药物开发的重大挑战，传统的实验方法耗时且资源密集。本研究探讨了应用机器学习技术来提高生物利用度预测的效率。通过利用KNIME分析平台内的计算工作流程，我们的目标是自动化生物利用度评估，减少对昂贵的体外和体内研究的依赖。使用多种机器学习模型，包括随机森林、梯度增强、决策树、k近邻和神经网络，分析了包含475种以关键分子描述符为特征的类药物化合物的数据集。通过5倍交叉验证评估模型性能，集成模型优于线性和基于神经网络的方法。随机森林显示出最高的预测性能（R2 = 0.87, RMSE = 0.08）。特征重要性分析发现拓扑极性表面积和溶解度是影响生物利用度预测的最重要因素。研究结果强调了在制药研究中集成开源工具和机器学习方法的潜力，在坚持FAIR（可查找、可访问、可互操作和可重用）数据原则的同时提高工作流程效率。这种方法促进了快速和具有成本效益的生物利用度评估，支持人工智能驱动的预测建模和药物开发中的数字健康应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Xenobiotica 医学-毒理学

CiteScore

3.80

自引率

5.60%

发文量

审稿时长

2 months

期刊介绍： Xenobiotica covers seven main areas, including:General Xenobiochemistry, including in vitro studies concerned with the metabolism, disposition and excretion of drugs, and other xenobiotics, as well as the structure, function and regulation of associated enzymesClinical Pharmacokinetics and Metabolism, covering the pharmacokinetics and absorption, distribution, metabolism and excretion of drugs and other xenobiotics in manAnimal Pharmacokinetics and Metabolism, covering the pharmacokinetics, and absorption, distribution, metabolism and excretion of drugs and other xenobiotics in animalsPharmacogenetics, defined as the identification and functional characterisation of polymorphic genes that encode xenobiotic metabolising enzymes and transporters that may result in altered enzymatic, cellular and clinical responses to xenobioticsMolecular Toxicology, concerning the mechanisms of toxicity and the study of toxicology of xenobiotics at the molecular levelXenobiotic Transporters, concerned with all aspects of the carrier proteins involved in the movement of xenobiotics into and out of cells, and their impact on pharmacokinetic behaviour in animals and manTopics in Xenobiochemistry, in the form of reviews and commentaries are primarily intended to be a critical analysis of the issue, wherein the author offers opinions on the relevance of data or of a particular experimental approach or methodology