SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes.

IF 2.9 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS
M Eremeyeva, Y Din, N Shirokii, N Serov
{"title":"SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes.","authors":"M Eremeyeva, Y Din, N Shirokii, N Serov","doi":"10.1186/s12859-024-06019-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Deoxyribozymes or DNAzymes represent artificial short DNA sequences bearing many catalytic properties. In particular, DNAzymes able to cleave RNA sequences have a huge potential in gene therapy and sequence-specific analytic detection of disease markers. This activity is provided by catalytic cores able to perform site-specific hydrolysis of the phosphodiester bond of an RNA substrate. However, the vast majority of existing DNAzyme catalytic cores have low efficacy in in vivo experiments, whereas SELEX based on in vitro screening offers long and expensive selection cycle with the average success rate of ~ 30%, moreover not allowing the direct selection of chemically modified DNAzymes, which were previously shown to demonstrate higher activity in vivo. Therefore, there is a huge need in in silico approach for exploratory analysis of RNA-cleaving DNAzyme cores to drastically ease the discovery of novel catalytic cores with superior activities.</p><p><strong>Results: </strong>In this work, we develop a machine learning based open-source platform SequenceCraft allowing experimental scientists to perform DNAzyme exploratory analysis via quantitative observed rate constant (k<sub>obs</sub>) estimation as well as statistical and clustering data analysis. This became possible with the development of a unique curated database of > 350 RNA-cleaving catalytic cores, property-based sequence representations allowing to work with both conventional and chemically modified nucleotides, and optimized k<sub>obs</sub> predicting algorithm achieving Q<sup>2</sup> > 0.9 on experimental data published to date.</p><p><strong>Conclusions: </strong>This work represents a significant advancement in DNAzyme research, providing a tool for more efficient discovery of RNA-cleaving DNAzymes. The SequenceCraft platform offers an in silico alternative to traditional experimental approaches, potentially accelerating the development of DNAzymes.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"2"},"PeriodicalIF":2.9000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11706003/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-024-06019-7","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Deoxyribozymes or DNAzymes represent artificial short DNA sequences bearing many catalytic properties. In particular, DNAzymes able to cleave RNA sequences have a huge potential in gene therapy and sequence-specific analytic detection of disease markers. This activity is provided by catalytic cores able to perform site-specific hydrolysis of the phosphodiester bond of an RNA substrate. However, the vast majority of existing DNAzyme catalytic cores have low efficacy in in vivo experiments, whereas SELEX based on in vitro screening offers long and expensive selection cycle with the average success rate of ~ 30%, moreover not allowing the direct selection of chemically modified DNAzymes, which were previously shown to demonstrate higher activity in vivo. Therefore, there is a huge need in in silico approach for exploratory analysis of RNA-cleaving DNAzyme cores to drastically ease the discovery of novel catalytic cores with superior activities.

Results: In this work, we develop a machine learning based open-source platform SequenceCraft allowing experimental scientists to perform DNAzyme exploratory analysis via quantitative observed rate constant (kobs) estimation as well as statistical and clustering data analysis. This became possible with the development of a unique curated database of > 350 RNA-cleaving catalytic cores, property-based sequence representations allowing to work with both conventional and chemically modified nucleotides, and optimized kobs predicting algorithm achieving Q2 > 0.9 on experimental data published to date.

Conclusions: This work represents a significant advancement in DNAzyme research, providing a tool for more efficient discovery of RNA-cleaving DNAzymes. The SequenceCraft platform offers an in silico alternative to traditional experimental approaches, potentially accelerating the development of DNAzymes.

SequenceCraft:基于机器学习的rna切割脱氧核酶探索性分析资源。
背景:脱氧核酶或DNAzymes是具有许多催化性质的人工短DNA序列。特别是,能够切割RNA序列的DNAzymes在基因治疗和疾病标记物序列特异性分析检测方面具有巨大的潜力。这种活性是由能够对RNA底物的磷酸二酯键进行特定位点水解的催化核提供的。然而,现有的绝大多数DNAzyme催化核心在体内实验中的效率较低,而基于体外筛选的SELEX选择周期长且昂贵,平均成功率约为30%,而且不能直接选择化学修饰的DNAzymes,而这些DNAzymes在体内表现出更高的活性。因此,迫切需要采用计算机方法对rna切割DNAzyme核心进行探索性分析,从而大大简化具有优越活性的新型催化核心的发现。在这项工作中,我们开发了一个基于机器学习的开源平台SequenceCraft,允许实验科学家通过定量观察速率常数(kobs)估计以及统计和聚类数据分析来执行DNAzyme探索性分析。随着bbbb350 rna切割催化核心的独特策划数据库的开发,基于属性的序列表示允许使用传统和化学修饰的核苷酸,以及优化的kobs预测算法在迄今公布的实验数据上达到Q2 > 0.9,这成为可能。结论:这项工作代表了DNAzyme研究的重大进展,为更有效地发现rna切割DNAzyme提供了工具。SequenceCraft平台为传统的实验方法提供了一种硅替代方案,有可能加速DNAzymes的开发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Bioinformatics
BMC Bioinformatics 生物-生化研究方法
CiteScore
5.70
自引率
3.30%
发文量
506
审稿时长
4.3 months
期刊介绍: BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology. BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信