Initial Development of Automated Machine Learning-Assisted Prediction Tools for Aryl Hydrocarbon Receptor Activators.

IF 4.9 3区 医学 Q1 PHARMACOLOGY & PHARMACY
Paulina Anna Wojtyło, Natalia Łapińska, Lucia Bellagamba, Emidio Camaioni, Aleksander Mendyk, Stefano Giovagnoli
{"title":"Initial Development of Automated Machine Learning-Assisted Prediction Tools for Aryl Hydrocarbon Receptor Activators.","authors":"Paulina Anna Wojtyło, Natalia Łapińska, Lucia Bellagamba, Emidio Camaioni, Aleksander Mendyk, Stefano Giovagnoli","doi":"10.3390/pharmaceutics16111456","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background</b>: The aryl hydrocarbon receptor (AhR) plays a crucial role in immune and metabolic processes. The large molecular diversity of ligands capable of activating AhR makes it impossible to determine the structural features useful for the design of new potent modulators. Thus, in the field of drug discovery, the intricate nature of AhR activation necessitates the development of novel tools to address related challenges. <b>Methods</b>: In this study, quantitative structure-activity relationship (QSAR) models of classification and regression were developed with the objective of identifying the most effective method for predicting AhR activity. The initial dataset was obtained by combining the ChEMBL and WIPO databases which contained 978 molecules with EC<sub>50</sub> values. The predictive models were developed using the automated machine learning platform mljar according to a 10-fold cross validation (10-CV) testing procedure. <b>Results</b>: The classification model demonstrated an accuracy value of 0.760 and F1 value of 0.789 for the test set. The root-mean-squared error (RMSE) was 5444, and the coefficient of determination (R<sup>2</sup>) was 0.208 for the regression model. The Shapley Additive Explanations (SHAP) method was then employed for a deeper comprehension of the impact of the variables on the model's predictions. As a practical application for scientific purposes, the best performing classification model was then used to develop an AhR web application. This application is accessible online and has been implemented in Streamlit. <b>Conclusions</b>: The findings may serve as a foundation in prompting further research into the development of a QSAR model, which could enhance comprehension of the influence of ligand structure on the modulation of AhR activity.</p>","PeriodicalId":19894,"journal":{"name":"Pharmaceutics","volume":"16 11","pages":""},"PeriodicalIF":4.9000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11597659/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pharmaceutics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/pharmaceutics16111456","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The aryl hydrocarbon receptor (AhR) plays a crucial role in immune and metabolic processes. The large molecular diversity of ligands capable of activating AhR makes it impossible to determine the structural features useful for the design of new potent modulators. Thus, in the field of drug discovery, the intricate nature of AhR activation necessitates the development of novel tools to address related challenges. Methods: In this study, quantitative structure-activity relationship (QSAR) models of classification and regression were developed with the objective of identifying the most effective method for predicting AhR activity. The initial dataset was obtained by combining the ChEMBL and WIPO databases which contained 978 molecules with EC50 values. The predictive models were developed using the automated machine learning platform mljar according to a 10-fold cross validation (10-CV) testing procedure. Results: The classification model demonstrated an accuracy value of 0.760 and F1 value of 0.789 for the test set. The root-mean-squared error (RMSE) was 5444, and the coefficient of determination (R2) was 0.208 for the regression model. The Shapley Additive Explanations (SHAP) method was then employed for a deeper comprehension of the impact of the variables on the model's predictions. As a practical application for scientific purposes, the best performing classification model was then used to develop an AhR web application. This application is accessible online and has been implemented in Streamlit. Conclusions: The findings may serve as a foundation in prompting further research into the development of a QSAR model, which could enhance comprehension of the influence of ligand structure on the modulation of AhR activity.

针对芳基烃受体激活剂的机器学习辅助自动预测工具的初步开发。
背景:芳基烃受体(AhR芳基烃受体(AhR)在免疫和代谢过程中发挥着至关重要的作用。能够激活 AhR 的配体分子种类繁多,因此无法确定其结构特征,从而无法设计出新的强效调节剂。因此,在药物发现领域,由于 AhR 激活的复杂性,有必要开发新型工具来应对相关挑战。方法:本研究开发了分类和回归的定量结构-活性关系(QSAR)模型,目的是找出预测 AhR 活性的最有效方法。最初的数据集是结合 ChEMBL 和 WIPO 数据库获得的,其中包含 978 个具有 EC50 值的分子。根据 10 倍交叉验证(10-CV)测试程序,使用自动机器学习平台 mljar 开发了预测模型。结果显示分类模型的准确度为 0.760,测试集的 F1 值为 0.789。回归模型的均方根误差(RMSE)为 5444,判定系数(R2)为 0.208。然后,为了更深入地理解变量对模型预测的影响,采用了夏普利加法解释(SHAP)方法。作为科学目的的实际应用,性能最佳的分类模型随后被用于开发 AhR 网络应用程序。该应用程序可在线访问,并已在 Streamlit 中实施。结论研究结果可作为进一步研究开发 QSAR 模型的基础,从而加深理解配体结构对 AhR 活性调节的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pharmaceutics
Pharmaceutics Pharmacology, Toxicology and Pharmaceutics-Pharmaceutical Science
CiteScore
7.90
自引率
11.10%
发文量
2379
审稿时长
16.41 days
期刊介绍: Pharmaceutics (ISSN 1999-4923) is an open access journal which provides an advanced forum for the science and technology of pharmaceutics and biopharmaceutics. It publishes reviews, regular research papers, communications,  and short notes. Covered topics include pharmacokinetics, toxicokinetics, pharmacodynamics, pharmacogenetics and pharmacogenomics, and pharmaceutical formulation. Our aim is to encourage scientists to publish their experimental and theoretical details in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信