Deep Learning Based Models for CRISPR/Cas Off-Target Prediction.

IF 10.7 2区材料科学 Q1 CHEMISTRY, PHYSICAL

Small Methods Pub Date : 2025-06-04 DOI:10.1002/smtd.202500122

Mingming Cao, Alexander Brennan, Ciaran M Lee, So-Hyun Park, Gang Bao

{"title":"Deep Learning Based Models for CRISPR/Cas Off-Target Prediction.","authors":"Mingming Cao, Alexander Brennan, Ciaran M Lee, So-Hyun Park, Gang Bao","doi":"10.1002/smtd.202500122","DOIUrl":null,"url":null,"abstract":"<p><p>CRISPR/Cas genome editing technologies enable effective and controlled genetic modifications; however, off-target effects remain a significant concern, particularly in clinical applications. Experimental and in silico methods are developed to predict potential off-target sites (OTS), including deep learning based methods, which can automatically and comprehensively learn sequence features, offer a promising tool for OTS prediction. Here, this work reviews the existing OTS prediction tools with an emphasis on deep learning methods, characterizes datasets used for deep learning training and testing, and evaluates six deep learning models -CRISPR-Net, CRISPR-IP, R-CRISPR, CRISPR-M, CrisprDNT, and Crispr-SGRU -using six public datasets and validates OTS data from the CRISPRoffT database. Performance of these models is assessed using standardized metrics, such as Precision, Recall, F1 score, MCC, AUROC and PRAUC. This work finds that incorporating validated OTS datasets into model training enhanced overall model performance, and improved robustness of prediction, particularly with highly imbalanced datasets. While no model consistently outperforms other models across all scenarios, CRISPR-Net, R-CRISPR, and Crispr-SGRU show strong overall performance. This analysis demonstrates the importance of integrating high-quality validated OTS data with advanced deep learning architectures to improve CRISPR/Cas off-target site predictions, ensuring safer genome editing applications.</p>","PeriodicalId":229,"journal":{"name":"Small Methods","volume":" ","pages":"e2500122"},"PeriodicalIF":10.7000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Small Methods","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1002/smtd.202500122","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}

引用次数: 0

Abstract

CRISPR/Cas genome editing technologies enable effective and controlled genetic modifications; however, off-target effects remain a significant concern, particularly in clinical applications. Experimental and in silico methods are developed to predict potential off-target sites (OTS), including deep learning based methods, which can automatically and comprehensively learn sequence features, offer a promising tool for OTS prediction. Here, this work reviews the existing OTS prediction tools with an emphasis on deep learning methods, characterizes datasets used for deep learning training and testing, and evaluates six deep learning models -CRISPR-Net, CRISPR-IP, R-CRISPR, CRISPR-M, CrisprDNT, and Crispr-SGRU -using six public datasets and validates OTS data from the CRISPRoffT database. Performance of these models is assessed using standardized metrics, such as Precision, Recall, F1 score, MCC, AUROC and PRAUC. This work finds that incorporating validated OTS datasets into model training enhanced overall model performance, and improved robustness of prediction, particularly with highly imbalanced datasets. While no model consistently outperforms other models across all scenarios, CRISPR-Net, R-CRISPR, and Crispr-SGRU show strong overall performance. This analysis demonstrates the importance of integrating high-quality validated OTS data with advanced deep learning architectures to improve CRISPR/Cas off-target site predictions, ensuring safer genome editing applications.

查看原文本刊更多论文

基于深度学习的CRISPR/Cas脱靶预测模型。

CRISPR/Cas基因组编辑技术能够实现有效和可控的遗传修饰；然而，脱靶效应仍然值得关注，特别是在临床应用中。潜在脱靶点（OTS）预测的实验方法和计算机方法都得到了发展，其中基于深度学习的方法可以自动、全面地学习序列特征，为OTS预测提供了一种很有前途的工具。本文回顾了现有的深度学习预测工具，重点介绍了深度学习方法，描述了用于深度学习训练和测试的数据集，并使用六个公共数据集评估了六个深度学习模型——crispr - net、CRISPR-IP、R-CRISPR、CRISPR-M、CrisprDNT和Crispr-SGRU，并验证了来自CRISPRoffT数据库的OTS数据。这些模型的性能使用标准化指标进行评估，如精度、召回率、F1分数、MCC、AUROC和PRAUC。这项工作发现，将经过验证的OTS数据集纳入模型训练可以增强模型的整体性能，并提高预测的鲁棒性，特别是对于高度不平衡的数据集。虽然没有模型在所有情况下都能始终优于其他模型，但CRISPR-Net、R-CRISPR和Crispr-SGRU表现出强大的整体性能。这一分析证明了将高质量的经过验证的OTS数据与先进的深度学习架构相结合的重要性，以改善CRISPR/Cas脱靶位点预测，确保更安全的基因组编辑应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Small Methods Materials Science-General Materials Science

CiteScore

17.40

自引率

1.60%

发文量

347

期刊介绍： Small Methods is a multidisciplinary journal that publishes groundbreaking research on methods relevant to nano- and microscale research. It welcomes contributions from the fields of materials science, biomedical science, chemistry, and physics, showcasing the latest advancements in experimental techniques. With a notable 2022 Impact Factor of 12.4 (Journal Citation Reports, Clarivate Analytics, 2023), Small Methods is recognized for its significant impact on the scientific community. The online ISSN for Small Methods is 2366-9608.