Classification of Potent and Weak Penetration Enhancers Using Multiple Feature Selection Methods and Machine Learning Models

IF 2.7 4区 医学 Q2 PHARMACOLOGY & PHARMACY
Baddipadige Raju, Neha Verma, Gera Narendra, Om Silakari, Bharti Sapra
{"title":"Classification of Potent and Weak Penetration Enhancers Using Multiple Feature Selection Methods and Machine Learning Models","authors":"Baddipadige Raju,&nbsp;Neha Verma,&nbsp;Gera Narendra,&nbsp;Om Silakari,&nbsp;Bharti Sapra","doi":"10.1007/s12247-023-09757-y","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>Chemical penetration enhancers (CPEs) are important in transdermal drug delivery (TDDD) formulations because they assist drugs in moving across the stratum corneum. Hydrocortisone (0.1% hydrocortisone, propylene glycol), oestradiol (0.045 mg estradiol/0.015 mg levonorgestrel, propylene glycol), and testosterone (2% testosterone, propylene glycol) are some examples of marketing TDDD formulations. As the transdermal route for drug administration becomes a safer and more appealing alternative to hypodermic needles, the search for new CPEs and their development becomes more important. Thus, the current work was directed toward the rapid identification of potent CPEs through the development of robust machine learning (ML) classification models.</p><h3>Methods</h3><p>Two large penetration enhancer (PE) data sets reported to date such as hydrocortisone (139 PEs) and theophylline (101 PEs) were used to build classification models. In the present investigation, a combination of feature selection methods, i.e., Boruta and Recursive Feature Elimination (RFE), and machine learning (ML) algorithms such as support vector machine (SVM), random forest (RF), and artificial neural network (ANN) were employed to classify the potent and weak penetration enhancers of hydrocortisone and theophylline. The tenfold cross-validation and Y-randomization methods were used to evaluate the prediction performance of the developed models.</p><h3>Results</h3><p>Significant classification models were built for both data sets when the RFE method and RF algorithm were used. RF classifiers outperformed hydrocortisone and theophylline data sets with test set accuracy and Matthew’s correlation coefficient (MCC) greater than 0.78. Simultaneously, four important features required for the accurate classification of potent and weak PEs were identified, i.e., nHCsatu, minHCsatu, AATS4p, and GATS4e.</p><h3>Conclusion</h3><p>Our approach produced robust ML classification models that can be applied to prioritize PEs from large databases. Utilization of these ML models in virtual screening experiments could save time and effort in the identification of potential PEs.</p><h3>Graphical Abstract</h3>\n<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":656,"journal":{"name":"Journal of Pharmaceutical Innovation","volume":"18 4","pages":"1778 - 1797"},"PeriodicalIF":2.7000,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pharmaceutical Innovation","FirstCategoryId":"3","ListUrlMain":"https://link.springer.com/article/10.1007/s12247-023-09757-y","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose

Chemical penetration enhancers (CPEs) are important in transdermal drug delivery (TDDD) formulations because they assist drugs in moving across the stratum corneum. Hydrocortisone (0.1% hydrocortisone, propylene glycol), oestradiol (0.045 mg estradiol/0.015 mg levonorgestrel, propylene glycol), and testosterone (2% testosterone, propylene glycol) are some examples of marketing TDDD formulations. As the transdermal route for drug administration becomes a safer and more appealing alternative to hypodermic needles, the search for new CPEs and their development becomes more important. Thus, the current work was directed toward the rapid identification of potent CPEs through the development of robust machine learning (ML) classification models.

Methods

Two large penetration enhancer (PE) data sets reported to date such as hydrocortisone (139 PEs) and theophylline (101 PEs) were used to build classification models. In the present investigation, a combination of feature selection methods, i.e., Boruta and Recursive Feature Elimination (RFE), and machine learning (ML) algorithms such as support vector machine (SVM), random forest (RF), and artificial neural network (ANN) were employed to classify the potent and weak penetration enhancers of hydrocortisone and theophylline. The tenfold cross-validation and Y-randomization methods were used to evaluate the prediction performance of the developed models.

Results

Significant classification models were built for both data sets when the RFE method and RF algorithm were used. RF classifiers outperformed hydrocortisone and theophylline data sets with test set accuracy and Matthew’s correlation coefficient (MCC) greater than 0.78. Simultaneously, four important features required for the accurate classification of potent and weak PEs were identified, i.e., nHCsatu, minHCsatu, AATS4p, and GATS4e.

Conclusion

Our approach produced robust ML classification models that can be applied to prioritize PEs from large databases. Utilization of these ML models in virtual screening experiments could save time and effort in the identification of potential PEs.

Graphical Abstract

Abstract Image

使用多种特征选择方法和机器学习模型对强渗透增强器和弱渗透增强器进行分类
目的化学渗透促进剂(CPE)在透皮给药(TDDD)配方中非常重要,因为它们有助于药物穿过角质层。氢化可的松(0.1% 氢化可的松,丙二醇)、雌二醇(0.045 毫克雌二醇/0.015 毫克左炔诺孕酮,丙二醇)和睾酮(2% 睾酮,丙二醇)是一些市场上销售的 TDDD 制剂。随着透皮给药途径成为皮下注射针头更安全、更有吸引力的替代品,寻找新的氯化聚乙烯及其开发变得更加重要。因此,当前工作的方向是通过开发稳健的机器学习(ML)分类模型来快速识别强效 CPE。方法使用迄今为止报道的两个大型渗透增强剂(PE)数据集,如氢化可的松(139 个 PE)和茶碱(101 个 PE)来建立分类模型。本研究结合了特征选择方法,即 Boruta 和递归特征消除(RFE),以及机器学习(ML)算法,如支持向量机(SVM)、随机森林(RF)和人工神经网络(ANN),对氢化可的松和茶碱的强效和弱效渗透增强剂进行了分类。结果当使用 RFE 方法和 RF 算法时,两个数据集都建立了显著的分类模型。RF 分类器的表现优于氢化可的松和茶碱数据集,其测试集准确率和马修相关系数 (MCC) 均大于 0.78。同时,还确定了准确分类强效和弱效聚乙烯所需的四个重要特征,即 nHCsatu、minHCsatu、AATS4p 和 GATS4e。在虚拟筛选实验中利用这些 ML 模型可以节省鉴定潜在 PE 的时间和精力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Pharmaceutical Innovation
Journal of Pharmaceutical Innovation PHARMACOLOGY & PHARMACY-
CiteScore
3.70
自引率
3.80%
发文量
90
审稿时长
>12 weeks
期刊介绍: The Journal of Pharmaceutical Innovation (JPI), is an international, multidisciplinary peer-reviewed scientific journal dedicated to publishing high quality papers emphasizing innovative research and applied technologies within the pharmaceutical and biotechnology industries. JPI''s goal is to be the premier communication vehicle for the critical body of knowledge that is needed for scientific evolution and technical innovation, from R&D to market. Topics will fall under the following categories: Materials science, Product design, Process design, optimization, automation and control, Facilities; Information management, Regulatory policy and strategy, Supply chain developments , Education and professional development, Journal of Pharmaceutical Innovation publishes four issues a year.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信