Identifying Molecular Properties of Ataxin-2 Inhibitors for Spinocerebellar Ataxia Type 2 Utilizing High-Throughput Screening and Machine Learning.

IF 3.6 3区 生物学 Q1 BIOLOGY
Smita Sahay, Jingran Wen, Daniel R Scoles, Anton Simeonov, Thomas S Dexheimer, Ajit Jadhav, Stephen C Kales, Hongmao Sun, Stefan M Pulst, Julio C Facelli, David E Jones
{"title":"Identifying Molecular Properties of Ataxin-2 Inhibitors for Spinocerebellar Ataxia Type 2 Utilizing High-Throughput Screening and Machine Learning.","authors":"Smita Sahay, Jingran Wen, Daniel R Scoles, Anton Simeonov, Thomas S Dexheimer, Ajit Jadhav, Stephen C Kales, Hongmao Sun, Stefan M Pulst, Julio C Facelli, David E Jones","doi":"10.3390/biology14050522","DOIUrl":null,"url":null,"abstract":"<p><p>Spinocerebellar ataxia type 2 (SCA2) is an autosomal dominant neurodegenerative disorder marked by cerebellar dysfunction, ataxic gait, and progressive motor impairments. SCA2 is caused by the pathologic expansion of CAG repeats in the ataxin-2 (<i>ATXN2</i>) gene, leading to a toxic gain-of-function mutation of the ataxin-2 protein. Currently, SCA2 therapeutic efforts are expanding beyond symptomatic relief to include disease-modifying approaches such as antisense oligonucleotides (ASOs), high-throughput screening (HTS) for small molecule inhibitors, and gene therapy aimed at reducing <i>ATXN2</i> expression. In the present study, data mining and machine learning techniques were employed to analyze HTS data and identify robust molecular properties of potential inhibitors of <i>ATXN2</i>. Three HTS datasets were selected for analysis: <i>ATXN2</i> gene expression, CMV promoter expression, and biochemical control (luciferase) gene expression. Compounds displaying significant <i>ATXN2</i> inhibition with minimal impact on control assays were deciphered based on effectiveness (E) values (<i>n</i> = 1321). Molecular descriptors associated with these compounds were calculated using MarvinSketch (<i>n</i> = 82). The molecular descriptor data (MD model) was analyzed separately from the experimentally determined screening data (S model) as well as together (MD-S model). Compounds were clustered based on structural similarity independently for the three models using the SimpleKMeans algorithm into the optimal number of clusters (<i>n</i> = 26). For each model, the maximum response assay values were analyzed, and E values and total rank values were applied. The S clusters were further subclustered, and the molecular properties of compounds in the top candidate subcluster were compared to those from the bottom candidate subcluster. Six compounds with high <i>ATXN2</i> inhibiting potential and 16 molecular descriptors were identified as significantly unique to those compounds (<i>p</i> < 0.05). These results are consistent with a quantitative HTS study that identified and validated similar small-molecule compounds, like cardiac glycosides, that reduce endogenous ATXN2 in a dose-dependent manner. Overall, these findings demonstrate that the integration of HTS analysis with data mining and machine learning is a promising approach for discovering chemical properties of candidate drugs for SCA2.</p>","PeriodicalId":48624,"journal":{"name":"Biology-Basel","volume":"14 5","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12108740/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biology-Basel","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3390/biology14050522","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Spinocerebellar ataxia type 2 (SCA2) is an autosomal dominant neurodegenerative disorder marked by cerebellar dysfunction, ataxic gait, and progressive motor impairments. SCA2 is caused by the pathologic expansion of CAG repeats in the ataxin-2 (ATXN2) gene, leading to a toxic gain-of-function mutation of the ataxin-2 protein. Currently, SCA2 therapeutic efforts are expanding beyond symptomatic relief to include disease-modifying approaches such as antisense oligonucleotides (ASOs), high-throughput screening (HTS) for small molecule inhibitors, and gene therapy aimed at reducing ATXN2 expression. In the present study, data mining and machine learning techniques were employed to analyze HTS data and identify robust molecular properties of potential inhibitors of ATXN2. Three HTS datasets were selected for analysis: ATXN2 gene expression, CMV promoter expression, and biochemical control (luciferase) gene expression. Compounds displaying significant ATXN2 inhibition with minimal impact on control assays were deciphered based on effectiveness (E) values (n = 1321). Molecular descriptors associated with these compounds were calculated using MarvinSketch (n = 82). The molecular descriptor data (MD model) was analyzed separately from the experimentally determined screening data (S model) as well as together (MD-S model). Compounds were clustered based on structural similarity independently for the three models using the SimpleKMeans algorithm into the optimal number of clusters (n = 26). For each model, the maximum response assay values were analyzed, and E values and total rank values were applied. The S clusters were further subclustered, and the molecular properties of compounds in the top candidate subcluster were compared to those from the bottom candidate subcluster. Six compounds with high ATXN2 inhibiting potential and 16 molecular descriptors were identified as significantly unique to those compounds (p < 0.05). These results are consistent with a quantitative HTS study that identified and validated similar small-molecule compounds, like cardiac glycosides, that reduce endogenous ATXN2 in a dose-dependent manner. Overall, these findings demonstrate that the integration of HTS analysis with data mining and machine learning is a promising approach for discovering chemical properties of candidate drugs for SCA2.

利用高通量筛选和机器学习鉴定2型脊髓小脑共济失调的Ataxin-2抑制剂的分子特性。
脊髓小脑性共济失调2型(SCA2)是一种常染色体显性神经退行性疾病,以小脑功能障碍、共济失调步态和进行性运动障碍为特征。SCA2是由ataxin-2 (ATXN2)基因CAG重复序列的病理性扩增引起的,导致ataxin-2蛋白的毒性功能获得突变。目前,SCA2的治疗努力正在从症状缓解扩展到包括疾病改善方法,如反义寡核苷酸(ASOs)、小分子抑制剂的高通量筛选(HTS)和旨在降低ATXN2表达的基因治疗。在本研究中,采用数据挖掘和机器学习技术来分析HTS数据,并确定潜在的ATXN2抑制剂的稳健分子特性。选择3个HTS数据集进行分析:ATXN2基因表达、CMV启动子表达和生化控制(荧光素酶)基因表达。根据有效性(E)值(n = 1321)对具有显著ATXN2抑制作用且对对照试验影响最小的化合物进行了破译。使用marvinssketch (n = 82)计算与这些化合物相关的分子描述符。将分子描述符数据(MD模型)与实验确定的筛选数据(S模型)分开分析,也将分子描述符数据(MD-S模型)一起分析。采用SimpleKMeans算法,根据三种模型的结构相似性独立聚类到最优聚类数(n = 26)。对于每个模型,分析最大反应测定值,并应用E值和总秩值。将S簇进一步亚簇化,并比较了顶部候选子簇和底部候选子簇中化合物的分子性质。6个ATXN2抑制电位较高的化合物和16个分子描述符与这些化合物具有显著的独特性(p < 0.05)。这些结果与一项定量HTS研究一致,该研究确定并验证了类似的小分子化合物,如心糖苷,以剂量依赖的方式降低内源性ATXN2。总的来说,这些发现表明,HTS分析与数据挖掘和机器学习的集成是发现SCA2候选药物化学性质的一种很有前途的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biology-Basel
Biology-Basel Biological Science-Biological Science
CiteScore
5.70
自引率
4.80%
发文量
1618
审稿时长
11 weeks
期刊介绍: Biology (ISSN 2079-7737) is an international, peer-reviewed, quick-refereeing open access journal of Biological Science published by MDPI online. It publishes reviews, research papers and communications in all areas of biology and at the interface of related disciplines. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced. Electronic files regarding the full details of the experimental procedure, if unable to be published in a normal way, can be deposited as supplementary material.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信