Oral bioavailability property prediction based on task similarity transfer learning.

IF 3.8 2区 化学 Q2 CHEMISTRY, APPLIED
Chen Zeng, Chengcheng Xu, Yingxu Liu, Yunya Jiang, Lidan Zheng, Yang Liu, Yanmin Zhang, Yadong Chen, Haichun Liu, Rui Gu
{"title":"Oral bioavailability property prediction based on task similarity transfer learning.","authors":"Chen Zeng, Chengcheng Xu, Yingxu Liu, Yunya Jiang, Lidan Zheng, Yang Liu, Yanmin Zhang, Yadong Chen, Haichun Liu, Rui Gu","doi":"10.1007/s11030-025-11345-w","DOIUrl":null,"url":null,"abstract":"<p><p>Drug absorption significantly influences pharmacokinetics. Accurately predicting human oral bioavailability (HOB) is essential for optimizing drug candidates and improving clinical success rates. The traditional method based on experiment is a common way to obtain HOB, but the experimental method is time-consuming and costly. Recently, using AI models to predict ADMET properties has become a new and effective method. However, this method has some data dependence problems. To address this issue, we combine physicochemical properties with graph-based deep learning methods to improve HOB prediction, providing an efficient and interpretable alternative to traditional experimental and computational approaches for ADMET property studies in data-scarce scenarios. We propose a similarity-guided transfer learning framework, Task Similarity-guided Transfer Learning based on Molecular Graphs (TS-GTL), which includes a deep learning model, PGnT (pKa Graph-based Knowledge-driven Transformer). PGnT incorporates common molecular descriptors as external knowledge to guide molecular graph representation, leveraging GNNs and Transformer encoders to enhance feature extraction. Additionally, we introduce MoTSE to quantify the similarity between physicochemical properties and HOB. Notably, training with data pretrained model on logD properties showed the best performance in transfer learning. TS-GTL also outperformed machine learning algorithms and deep learning predictive tools, underscoring the critical role of task similarity in transfer learning.</p>","PeriodicalId":708,"journal":{"name":"Molecular Diversity","volume":" ","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Diversity","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s11030-025-11345-w","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

Drug absorption significantly influences pharmacokinetics. Accurately predicting human oral bioavailability (HOB) is essential for optimizing drug candidates and improving clinical success rates. The traditional method based on experiment is a common way to obtain HOB, but the experimental method is time-consuming and costly. Recently, using AI models to predict ADMET properties has become a new and effective method. However, this method has some data dependence problems. To address this issue, we combine physicochemical properties with graph-based deep learning methods to improve HOB prediction, providing an efficient and interpretable alternative to traditional experimental and computational approaches for ADMET property studies in data-scarce scenarios. We propose a similarity-guided transfer learning framework, Task Similarity-guided Transfer Learning based on Molecular Graphs (TS-GTL), which includes a deep learning model, PGnT (pKa Graph-based Knowledge-driven Transformer). PGnT incorporates common molecular descriptors as external knowledge to guide molecular graph representation, leveraging GNNs and Transformer encoders to enhance feature extraction. Additionally, we introduce MoTSE to quantify the similarity between physicochemical properties and HOB. Notably, training with data pretrained model on logD properties showed the best performance in transfer learning. TS-GTL also outperformed machine learning algorithms and deep learning predictive tools, underscoring the critical role of task similarity in transfer learning.

基于任务相似性迁移学习的口服生物利用度预测。
药物吸收显著影响药代动力学。准确预测人体口服生物利用度(HOB)对于优化候选药物和提高临床成功率至关重要。传统的基于实验的方法是获得滚边线的常用方法,但实验方法耗时长,成本高。近年来,利用人工智能模型预测ADMET性质已成为一种新的有效方法。但该方法存在数据依赖问题。为了解决这一问题,我们将物理化学性质与基于图的深度学习方法相结合,以改进HOB预测,为数据稀缺情况下ADMET性质研究提供了一种有效且可解释的替代方法,可以替代传统的实验和计算方法。我们提出了一个相似度引导迁移学习框架,基于分子图的任务相似度引导迁移学习(TS-GTL),其中包括一个深度学习模型PGnT (pKa Graph-based Knowledge-driven Transformer)。PGnT将常见的分子描述符作为外部知识来指导分子图表示,利用gnn和Transformer编码器来增强特征提取。此外,我们引入mose来量化物理化学性质与HOB之间的相似性。值得注意的是,基于logD属性的数据预训练模型在迁移学习中表现出最好的性能。TS-GTL还优于机器学习算法和深度学习预测工具,强调了任务相似性在迁移学习中的关键作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecular Diversity
Molecular Diversity 化学-化学综合
CiteScore
7.30
自引率
7.90%
发文量
219
审稿时长
2.7 months
期刊介绍: Molecular Diversity is a new publication forum for the rapid publication of refereed papers dedicated to describing the development, application and theory of molecular diversity and combinatorial chemistry in basic and applied research and drug discovery. The journal publishes both short and full papers, perspectives, news and reviews dealing with all aspects of the generation of molecular diversity, application of diversity for screening against alternative targets of all types (biological, biophysical, technological), analysis of results obtained and their application in various scientific disciplines/approaches including: combinatorial chemistry and parallel synthesis; small molecule libraries; microwave synthesis; flow synthesis; fluorous synthesis; diversity oriented synthesis (DOS); nanoreactors; click chemistry; multiplex technologies; fragment- and ligand-based design; structure/function/SAR; computational chemistry and molecular design; chemoinformatics; screening techniques and screening interfaces; analytical and purification methods; robotics, automation and miniaturization; targeted libraries; display libraries; peptides and peptoids; proteins; oligonucleotides; carbohydrates; natural diversity; new methods of library formulation and deconvolution; directed evolution, origin of life and recombination; search techniques, landscapes, random chemistry and more;
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信