Mingming Cao, Alexander Brennan, Ciaran M Lee, So-Hyun Park, Gang Bao
{"title":"Deep Learning Based Models for CRISPR/Cas Off-Target Prediction.","authors":"Mingming Cao, Alexander Brennan, Ciaran M Lee, So-Hyun Park, Gang Bao","doi":"10.1002/smtd.202500122","DOIUrl":null,"url":null,"abstract":"<p><p>CRISPR/Cas genome editing technologies enable effective and controlled genetic modifications; however, off-target effects remain a significant concern, particularly in clinical applications. Experimental and in silico methods are developed to predict potential off-target sites (OTS), including deep learning based methods, which can automatically and comprehensively learn sequence features, offer a promising tool for OTS prediction. Here, this work reviews the existing OTS prediction tools with an emphasis on deep learning methods, characterizes datasets used for deep learning training and testing, and evaluates six deep learning models -CRISPR-Net, CRISPR-IP, R-CRISPR, CRISPR-M, CrisprDNT, and Crispr-SGRU -using six public datasets and validates OTS data from the CRISPRoffT database. Performance of these models is assessed using standardized metrics, such as Precision, Recall, F1 score, MCC, AUROC and PRAUC. This work finds that incorporating validated OTS datasets into model training enhanced overall model performance, and improved robustness of prediction, particularly with highly imbalanced datasets. While no model consistently outperforms other models across all scenarios, CRISPR-Net, R-CRISPR, and Crispr-SGRU show strong overall performance. This analysis demonstrates the importance of integrating high-quality validated OTS data with advanced deep learning architectures to improve CRISPR/Cas off-target site predictions, ensuring safer genome editing applications.</p>","PeriodicalId":229,"journal":{"name":"Small Methods","volume":" ","pages":"e2500122"},"PeriodicalIF":10.7000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Small Methods","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1002/smtd.202500122","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
CRISPR/Cas genome editing technologies enable effective and controlled genetic modifications; however, off-target effects remain a significant concern, particularly in clinical applications. Experimental and in silico methods are developed to predict potential off-target sites (OTS), including deep learning based methods, which can automatically and comprehensively learn sequence features, offer a promising tool for OTS prediction. Here, this work reviews the existing OTS prediction tools with an emphasis on deep learning methods, characterizes datasets used for deep learning training and testing, and evaluates six deep learning models -CRISPR-Net, CRISPR-IP, R-CRISPR, CRISPR-M, CrisprDNT, and Crispr-SGRU -using six public datasets and validates OTS data from the CRISPRoffT database. Performance of these models is assessed using standardized metrics, such as Precision, Recall, F1 score, MCC, AUROC and PRAUC. This work finds that incorporating validated OTS datasets into model training enhanced overall model performance, and improved robustness of prediction, particularly with highly imbalanced datasets. While no model consistently outperforms other models across all scenarios, CRISPR-Net, R-CRISPR, and Crispr-SGRU show strong overall performance. This analysis demonstrates the importance of integrating high-quality validated OTS data with advanced deep learning architectures to improve CRISPR/Cas off-target site predictions, ensuring safer genome editing applications.
Small MethodsMaterials Science-General Materials Science
CiteScore
17.40
自引率
1.60%
发文量
347
期刊介绍:
Small Methods is a multidisciplinary journal that publishes groundbreaking research on methods relevant to nano- and microscale research. It welcomes contributions from the fields of materials science, biomedical science, chemistry, and physics, showcasing the latest advancements in experimental techniques.
With a notable 2022 Impact Factor of 12.4 (Journal Citation Reports, Clarivate Analytics, 2023), Small Methods is recognized for its significant impact on the scientific community.
The online ISSN for Small Methods is 2366-9608.