{"title":"DeepFM-Crispr: Prediction of CRISPR On-Target Effects via Deep Learning","authors":"Condy Bao, Fuxiao Liu","doi":"arxiv-2409.05938","DOIUrl":null,"url":null,"abstract":"Since the advent of CRISPR-Cas9, a groundbreaking gene-editing technology\nthat enables precise genomic modifications via a short RNA guide sequence,\nthere has been a marked increase in the accessibility and application of this\ntechnology across various fields. The success of CRISPR-Cas9 has spurred\nfurther investment and led to the discovery of additional CRISPR systems,\nincluding CRISPR-Cas13. Distinct from Cas9, which targets DNA, Cas13 targets\nRNA, offering unique advantages for gene modulation. We focus on Cas13d, a\nvariant known for its collateral activity where it non-specifically cleaves\nadjacent RNA molecules upon activation, a feature critical to its function. We\nintroduce DeepFM-Crispr, a novel deep learning model developed to predict the\non-target efficiency and evaluate the off-target effects of Cas13d. This model\nharnesses a large language model to generate comprehensive representations rich\nin evolutionary and structural data, thereby enhancing predictions of RNA\nsecondary structures and overall sgRNA efficacy. A transformer-based\narchitecture processes these inputs to produce a predictive efficacy score.\nComparative experiments show that DeepFM-Crispr not only surpasses traditional\nmodels but also outperforms recent state-of-the-art deep learning methods in\nterms of prediction accuracy and reliability.","PeriodicalId":501266,"journal":{"name":"arXiv - QuanBio - Quantitative Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Quantitative Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Since the advent of CRISPR-Cas9, a groundbreaking gene-editing technology
that enables precise genomic modifications via a short RNA guide sequence,
there has been a marked increase in the accessibility and application of this
technology across various fields. The success of CRISPR-Cas9 has spurred
further investment and led to the discovery of additional CRISPR systems,
including CRISPR-Cas13. Distinct from Cas9, which targets DNA, Cas13 targets
RNA, offering unique advantages for gene modulation. We focus on Cas13d, a
variant known for its collateral activity where it non-specifically cleaves
adjacent RNA molecules upon activation, a feature critical to its function. We
introduce DeepFM-Crispr, a novel deep learning model developed to predict the
on-target efficiency and evaluate the off-target effects of Cas13d. This model
harnesses a large language model to generate comprehensive representations rich
in evolutionary and structural data, thereby enhancing predictions of RNA
secondary structures and overall sgRNA efficacy. A transformer-based
architecture processes these inputs to produce a predictive efficacy score.
Comparative experiments show that DeepFM-Crispr not only surpasses traditional
models but also outperforms recent state-of-the-art deep learning methods in
terms of prediction accuracy and reliability.