{"title":"利用机器学习方法、最小实验值和物理化学描述符开发高精度组织到等离子体分配系数值的2D-QSAR模型。","authors":"Koichi Handa, Seishiro Sakamoto, Michiharu Kageyama, Takeshi Iijima","doi":"10.1007/s13318-023-00832-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The demand for physiologically based pharmacokinetic (PBPK) model is increasing currently. New drug application (NDA) of many compounds is submitted with PBPK models for efficient drug development. Tissue-to-plasma partition coefficient (K<sub>p</sub>) is a key parameter for the PBPK model to describe differential equations. However, it is difficult to obtain the K<sub>p</sub> value experimentally because the measurement of drug concentration in the tissue is much harder than that in plasma.</p><p><strong>Objective: </strong>Instead of experiments, many researchers have sought in silico methods. Today, most of the models for K<sub>p</sub> prediction are using in vitro and in vivo parameters as explanatory variables. We thought of physicochemical descriptors that could improve the predictability. Therefore, we aimed to develop the two-dimensional quantitative structure-activity relationship (2D-QSAR) model for K<sub>p</sub> using physicochemical descriptors instead of in vivo experimental data as explanatory variables.</p><p><strong>Methods: </strong>We compared our model with the conventional models using 20-fold cross-validation according to the published method (Yun et al. J Pharmacokinet Pharmacodyn 41:1-14, 2014). We used random forest algorithm, which is known to be one of the best predictors for the 2D-QSAR model. Finally, we combined minimum in vitro experimental values and physiochemical descriptors. Thus, the prediction method for K<sub>p</sub> value using a few in vitro parameters and physicochemical descriptors was developed; this is a multimodal model.</p><p><strong>Results: </strong>Its accuracy was found to be superior to that of the conventional models. Results of this research suggest that multimodality is useful for the 2D-QSAR model [RMSE and % of two-fold error: 0.66 and 42.2% (Berezohkovsky), 0.52 and 52.2% (Rodgers), 0.65 and 34.6% (Schmitt), 0.44 and 61.1% (published model), 0.41 and 62.1% (traditional model), 0.39 and 64.5% (multimodal model)].</p><p><strong>Conclusion: </strong>We could develop a 2D-QSAR model for K<sub>p</sub> value with the highest accuracy using a few in vitro experimental data and physicochemical descriptors.</p>","PeriodicalId":11939,"journal":{"name":"European Journal of Drug Metabolism and Pharmacokinetics","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Development of a 2D-QSAR Model for Tissue-to-Plasma Partition Coefficient Value with High Accuracy Using Machine Learning Method, Minimum Required Experimental Values, and Physicochemical Descriptors.\",\"authors\":\"Koichi Handa, Seishiro Sakamoto, Michiharu Kageyama, Takeshi Iijima\",\"doi\":\"10.1007/s13318-023-00832-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The demand for physiologically based pharmacokinetic (PBPK) model is increasing currently. New drug application (NDA) of many compounds is submitted with PBPK models for efficient drug development. Tissue-to-plasma partition coefficient (K<sub>p</sub>) is a key parameter for the PBPK model to describe differential equations. However, it is difficult to obtain the K<sub>p</sub> value experimentally because the measurement of drug concentration in the tissue is much harder than that in plasma.</p><p><strong>Objective: </strong>Instead of experiments, many researchers have sought in silico methods. Today, most of the models for K<sub>p</sub> prediction are using in vitro and in vivo parameters as explanatory variables. We thought of physicochemical descriptors that could improve the predictability. Therefore, we aimed to develop the two-dimensional quantitative structure-activity relationship (2D-QSAR) model for K<sub>p</sub> using physicochemical descriptors instead of in vivo experimental data as explanatory variables.</p><p><strong>Methods: </strong>We compared our model with the conventional models using 20-fold cross-validation according to the published method (Yun et al. J Pharmacokinet Pharmacodyn 41:1-14, 2014). We used random forest algorithm, which is known to be one of the best predictors for the 2D-QSAR model. Finally, we combined minimum in vitro experimental values and physiochemical descriptors. Thus, the prediction method for K<sub>p</sub> value using a few in vitro parameters and physicochemical descriptors was developed; this is a multimodal model.</p><p><strong>Results: </strong>Its accuracy was found to be superior to that of the conventional models. Results of this research suggest that multimodality is useful for the 2D-QSAR model [RMSE and % of two-fold error: 0.66 and 42.2% (Berezohkovsky), 0.52 and 52.2% (Rodgers), 0.65 and 34.6% (Schmitt), 0.44 and 61.1% (published model), 0.41 and 62.1% (traditional model), 0.39 and 64.5% (multimodal model)].</p><p><strong>Conclusion: </strong>We could develop a 2D-QSAR model for K<sub>p</sub> value with the highest accuracy using a few in vitro experimental data and physicochemical descriptors.</p>\",\"PeriodicalId\":11939,\"journal\":{\"name\":\"European Journal of Drug Metabolism and Pharmacokinetics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Drug Metabolism and Pharmacokinetics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s13318-023-00832-w\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Drug Metabolism and Pharmacokinetics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13318-023-00832-w","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 1
摘要
背景:目前对基于生理的药代动力学(PBPK)模型的需求日益增加。许多化合物的新药申请(NDA)都是通过PBPK模型提交的,以实现高效的药物开发。组织-等离子体分配系数(Kp)是PBPK模型描述微分方程的关键参数。然而,由于在组织中测量药物浓度比在血浆中测量困难得多,因此很难通过实验获得Kp值。目的:许多研究人员都在寻找计算机方法来代替实验。目前,大多数Kp预测模型都使用体外和体内参数作为解释变量。我们想到了可以提高可预测性的物理化学描述符。因此,我们的目标是建立Kp的二维定量构效关系(2D-QSAR)模型,使用物理化学描述符而不是体内实验数据作为解释变量。方法:根据已发表的方法(Yun et al.),使用20倍交叉验证将我们的模型与传统模型进行比较。药理学杂志,2014(1):1- 4。我们使用随机森林算法,这是已知的2D-QSAR模型的最佳预测器之一。最后,我们结合了最小的体外实验值和理化描述符。在此基础上,建立了利用少量体外参数和理化描述符预测Kp值的方法;这是一个多模态模型。结果:该模型的准确性优于常规模型。本研究结果表明,多模态对2D-QSAR模型是有用的[RMSE和双重误差%:0.66和42.2% (Berezohkovsky), 0.52和52.2% (Rodgers), 0.65和34.6% (Schmitt), 0.44和61.1%(已发表模型),0.41和62.1%(传统模型),0.39和64.5%(多模态模型)]。结论:利用少量体外实验数据和理化描述符,可以建立精度最高的Kp值2D-QSAR模型。
Development of a 2D-QSAR Model for Tissue-to-Plasma Partition Coefficient Value with High Accuracy Using Machine Learning Method, Minimum Required Experimental Values, and Physicochemical Descriptors.
Background: The demand for physiologically based pharmacokinetic (PBPK) model is increasing currently. New drug application (NDA) of many compounds is submitted with PBPK models for efficient drug development. Tissue-to-plasma partition coefficient (Kp) is a key parameter for the PBPK model to describe differential equations. However, it is difficult to obtain the Kp value experimentally because the measurement of drug concentration in the tissue is much harder than that in plasma.
Objective: Instead of experiments, many researchers have sought in silico methods. Today, most of the models for Kp prediction are using in vitro and in vivo parameters as explanatory variables. We thought of physicochemical descriptors that could improve the predictability. Therefore, we aimed to develop the two-dimensional quantitative structure-activity relationship (2D-QSAR) model for Kp using physicochemical descriptors instead of in vivo experimental data as explanatory variables.
Methods: We compared our model with the conventional models using 20-fold cross-validation according to the published method (Yun et al. J Pharmacokinet Pharmacodyn 41:1-14, 2014). We used random forest algorithm, which is known to be one of the best predictors for the 2D-QSAR model. Finally, we combined minimum in vitro experimental values and physiochemical descriptors. Thus, the prediction method for Kp value using a few in vitro parameters and physicochemical descriptors was developed; this is a multimodal model.
Results: Its accuracy was found to be superior to that of the conventional models. Results of this research suggest that multimodality is useful for the 2D-QSAR model [RMSE and % of two-fold error: 0.66 and 42.2% (Berezohkovsky), 0.52 and 52.2% (Rodgers), 0.65 and 34.6% (Schmitt), 0.44 and 61.1% (published model), 0.41 and 62.1% (traditional model), 0.39 and 64.5% (multimodal model)].
Conclusion: We could develop a 2D-QSAR model for Kp value with the highest accuracy using a few in vitro experimental data and physicochemical descriptors.
期刊介绍:
Hepatology International is a peer-reviewed journal featuring articles written by clinicians, clinical researchers and basic scientists is dedicated to research and patient care issues in hepatology. This journal focuses mainly on new and emerging diagnostic and treatment options, protocols and molecular and cellular basis of disease pathogenesis, new technologies, in liver and biliary sciences.
Hepatology International publishes original research articles related to clinical care and basic research; review articles; consensus guidelines for diagnosis and treatment; invited editorials, and controversies in contemporary issues. The journal does not publish case reports.