| Téléchargement | - Voir la version finale : Predicting CRISPR-Cas9 off-target effects in human primary cells using bidirectional LSTM with BERT embedding (PDF, 1.5 Mio)
- Voir les données supplémentaires : Predicting CRISPR-Cas9 off-target effects in human primary cells using bidirectional LSTM with BERT embedding (DOCX, 289 Kio)
|
|---|
| DOI | Trouver le DOI : https://doi.org/10.1093/bioadv/vbae184 |
|---|
| Auteur | Rechercher : Sari, Orhan; Rechercher : Liu, Ziying1Identifiant ORCID : https://orcid.org/0000-0001-7821-6723; Rechercher : Pan, Youlian1Identifiant ORCID : https://orcid.org/0000-0002-0158-0081; Rechercher : Shao, Xiaojian1Identifiant ORCID : https://orcid.org/0000-0002-3044-621X |
|---|
| Éditeur | Rechercher : Lengauer, Thomas |
|---|
| Affiliation | - Conseil national de recherches Canada. Technologies numériques
|
|---|
| Bailleur de fonds | Rechercher : National Research Council Canada |
|---|
| Format | Texte, Article |
|---|
| Résumé | Motivation:
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-Cas9 system is a ground-breaking genome editing tool, which has revolutionized cell and gene therapies. One of the essential components involved in this system that ensures its success is the design of an optimal single-guide RNA (sgRNA) with high on-target cleavage efficiency and low off-target effects. This is challenging as many conditions need to be considered, and empirically testing every design is time-consuming and costly. In silico prediction using machine learning models provides high-performance alternatives.
Results:
We present CrisprBERT, a deep learning model incorporating a Bidirectional Encoder Representations from Transformers (BERT) architecture to provide a high-dimensional embedding for paired sgRNA and DNA sequences and Bidirectional Long Short-term Memory networks for learning, to predict the off-target effects of sgRNAs utilizing only the sgRNAs and their paired DNA sequences. We proposed doublet stack encoding to capture the local energy configuration of the Cas9 binding and applied the BERT model to learn the contextual embedding of the doublet pairs. Our results showed that the new model achieved better performance than state-of-the-art deep learning models regarding single split and leave-one-sgRNA-out cross-validations as well as independent testing.
Availability and implementation:
The CrisprBERT is available at GitHub: https://github.com/OSsari/CrisprBERT. |
|---|
| Date de publication | 2024-12-30 |
|---|
| Maison d’édition | Oxford University Press |
|---|
| Licence | |
|---|
| Dans | |
|---|
| Langue | anglais |
|---|
| Publications évaluées par des pairs | Oui |
|---|
| Exporter la notice | Exporter en format RIS |
|---|
| Signaler une correction | Signaler une correction (s'ouvre dans un nouvel onglet) |
|---|
| Identificateur de l’enregistrement | 3d9ab978-c101-4d38-a75e-984f04eb3430 |
|---|
| Enregistrement créé | 2025-02-26 |
|---|
| Enregistrement modifié | 2025-11-03 |
|---|