The use of machine learning-based sequential virtual screening in the search of new ligands of 5-HT6 receptor

Q4 Pharmacology, Toxicology and Pharmaceutics
Michał Sapa, Alicja Gawalska, M. Kołaczkowski, Adam Bucki
{"title":"The use of machine learning-based sequential virtual screening in the search of new ligands of 5-HT6 receptor","authors":"Michał Sapa, Alicja Gawalska, M. Kołaczkowski, Adam Bucki","doi":"10.32383/farmpol/161474","DOIUrl":null,"url":null,"abstract":"5-HT6 receptor takes part in learning and memory processes. For this reason, the use of ligands of this receptor in the treatment of neurodegenerative diseases such as Alzheimer's disease, depression or autism is being investigated. The development of machine learning (ML) and access to large compound databases allow for the increasing use of these methods in search of new drugs. The use of ML in pre-clinical tests allows for a reduction in time and costs of drug discovery.\nIn this study, we used a sequential virtual screening approach in search of new structures with potential high affinity for the 5-HT6 receptor. Data from the ChEMBL database containing ligand binding affinities, measured as an inhibition constant (Ki), was used as the training dataset. Each step of the screening was based on machine learning models, the task of which was to classify compounds as potentially active and inactive. The first step included a ligand-based drug discovery (LBDD) approach, in which, using Klekota-Roth fingerprints and descriptors describing the chemical structure of the ligands, a classification model was developed to select a preliminary group of candidates from the Otava chemical compound database. In the second step, a structure-based drug discovery (SBDD) approach was used. For this purpose, compounds were docked to the homology model of the 5-HT6 receptor, developed using the AlphaFold algorithm and optimized by Induced-Fit Docking tool and molecular dynamics. Docking poses were scored by a trained Extra Trees classifier. Interactions of a reference ligand with 14 binding site residues were used as features for the trained model.\nThe use of machine learning as a scoring function allowed to improve the virtual screening parameters compared to the Glide GScore scoring function. Based on the obtained model, it was also confirmed that the location of a ligand near the Ser5.43 and Phe5.38 residues is important for binding the compound to the receptor. The procedure has allowed to select 20 candidates with new chemical structures compared to known ligands. In addition, the obtained compounds had a relatively low basic pKa compared to known ligands and thus may be suspected to have a low affinity for hERG channels and good brain penetration.","PeriodicalId":34025,"journal":{"name":"Farmacja Polska","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Farmacja Polska","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32383/farmpol/161474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}
引用次数: 0

Abstract

5-HT6 receptor takes part in learning and memory processes. For this reason, the use of ligands of this receptor in the treatment of neurodegenerative diseases such as Alzheimer's disease, depression or autism is being investigated. The development of machine learning (ML) and access to large compound databases allow for the increasing use of these methods in search of new drugs. The use of ML in pre-clinical tests allows for a reduction in time and costs of drug discovery. In this study, we used a sequential virtual screening approach in search of new structures with potential high affinity for the 5-HT6 receptor. Data from the ChEMBL database containing ligand binding affinities, measured as an inhibition constant (Ki), was used as the training dataset. Each step of the screening was based on machine learning models, the task of which was to classify compounds as potentially active and inactive. The first step included a ligand-based drug discovery (LBDD) approach, in which, using Klekota-Roth fingerprints and descriptors describing the chemical structure of the ligands, a classification model was developed to select a preliminary group of candidates from the Otava chemical compound database. In the second step, a structure-based drug discovery (SBDD) approach was used. For this purpose, compounds were docked to the homology model of the 5-HT6 receptor, developed using the AlphaFold algorithm and optimized by Induced-Fit Docking tool and molecular dynamics. Docking poses were scored by a trained Extra Trees classifier. Interactions of a reference ligand with 14 binding site residues were used as features for the trained model. The use of machine learning as a scoring function allowed to improve the virtual screening parameters compared to the Glide GScore scoring function. Based on the obtained model, it was also confirmed that the location of a ligand near the Ser5.43 and Phe5.38 residues is important for binding the compound to the receptor. The procedure has allowed to select 20 candidates with new chemical structures compared to known ligands. In addition, the obtained compounds had a relatively low basic pKa compared to known ligands and thus may be suspected to have a low affinity for hERG channels and good brain penetration.
利用基于机器学习的序贯虚拟筛选技术寻找5-HT6受体的新配体
5-HT6受体参与学习和记忆过程。出于这个原因,正在研究这种受体的配体在治疗神经退行性疾病如阿尔茨海默病、抑郁症或自闭症中的应用。机器学习(ML)的发展和对大型化合物数据库的访问使得这些方法在寻找新药方面的使用越来越多。在临床前测试中使用ML可以减少药物发现的时间和成本。在这项研究中,我们使用顺序虚拟筛选方法来寻找对5-HT6受体具有潜在高亲和力的新结构。使用来自ChEMBL数据库的包含配体结合亲和力的数据(以抑制常数(Ki)测量)作为训练数据集。筛选的每一步都基于机器学习模型,其任务是将化合物分为潜在活性和非活性。第一步包括基于配体的药物发现(LBDD)方法,其中使用Klekota Roth指纹和描述配体化学结构的描述符,开发了一个分类模型,以从Otava化合物数据库中选择一组初步的候选药物。在第二步中,使用了基于结构的药物发现(SBDD)方法。为此,将化合物对接到5-HT6受体的同源性模型上,该模型使用AlphaFold算法开发,并通过诱导拟合对接工具和分子动力学进行优化。对接姿势由经过训练的Extra Trees分类器进行评分。参考配体与14个结合位点残基的相互作用被用作训练模型的特征。与Glide GScore评分功能相比,使用机器学习作为评分功能可以改进虚拟筛选参数。基于所获得的模型,还证实了配体在Ser5.43和Phe5.38残基附近的位置对于将化合物结合到受体是重要的。与已知配体相比,该程序可以选择20种具有新化学结构的候选者。此外,与已知配体相比,所获得的化合物具有相对低的碱性pKa,因此可能被怀疑对hERG通道具有低亲和力和良好的脑渗透性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Farmacja Polska
Farmacja Polska Pharmacology, Toxicology and Pharmaceutics-Pharmacology, Toxicology and Pharmaceutics (miscellaneous)
CiteScore
0.40
自引率
0.00%
发文量
54
审稿时长
2 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信