🧬 Molecular Biology · Biotechnology
📅 Березень 2026 ⏱ ≈ 10 хв читання 🟡 Середній

How CRISPR-Cas9 Works

A bacterial immune system, discovered in 2012 and refined since, lets scientists rewrite the genome with the precision of a word processor: find any 20-letter sequence in three billion base pairs, cut both DNA strands, and let the cell repair the break however the researcher wants. This is CRISPR-Cas9 — the most transformative molecular biology tool in decades.

Bacterial Immune Memory: Where CRISPR Came From

In 1987, Japanese scientists noticed a strange pattern in bacterial genomes: repeated DNA sequences separated by unique "spacer" sequences. The acronym CRISPR — Clustered Regularly Interspaced Short Palindromic Repeats — was coined in 2002.

It took until 2007 to understand what these sequences do. When a bacterium survives a viral infection, it cuts a short fragment of the virus's DNA and inserts it as a new spacer in its CRISPR array. This is a molecular memory of past infections. During a future infection, the bacterium transcribes these spacers into short RNA molecules and turns them into a surveillance system that destroys any DNA matching the stored sequence.

2020 Nobel Prize in Chemistry: Awarded to Emmanuelle Charpentier and Jennifer Doudna for their 2012 paper demonstrating that the bacterial CRISPR-Cas9 system could be reprogrammed with a synthetic guide RNA to cut any target DNA sequence — turning an immune system into a programmable gene-editing tool.

The Two Components: gRNA and Cas9

The CRISPR-Cas9 editing system requires just two molecular components:

To design an edit, a researcher simply synthesises a 20-nucleotide guide sequence matching the target region and assembles it into the sgRNA scaffold. The same Cas9 protein is reused for every experiment — only the guide RNA changes.

The PAM Sequence: The Address Lock

Cas9 cannot cut DNA at an arbitrary location — the guide RNA must locate a specific sequence called the PAM (Protospacer Adjacent Motif). For the most common CRISPR tool (S. pyogenes Cas9), the PAM sequence is NGG (any nucleotide followed by two guanines), which must appear immediately 3′ of the target sequence on the non-template strand.

5'—[20-nt spacer matching gRNA]—NGG—3' ← PAM 3'—[complementary strand]—NCC—5' Cas9 cuts 3 bases upstream of the PAM (between positions 17 and 18), creating a blunt-ended double-strand break.

The PAM requirement actually has a biological purpose: it prevents Cas9 from cutting the CRISPR array itself (where the spacers are stored), because those sequences are flanked by repeat sequences, not NGG PAMs.

The NGG PAM appears roughly every 8–12 base pairs in a typical genome, giving many potential cut sites near any gene of interest. Newer Cas variants (Cas12a, SpRY) recognise different PAMs or near-PAMless sequences, greatly expanding the targetable genome.

The Cutting Mechanism Step-by-Step

1
gRNA–Cas9 complex forms
The guide RNA (sgRNA) folds and binds the Cas9 protein, loading it with the 20-nt targeting sequence. Cas9 begins in an inactive state.
2
PAM scanning
The complex slides along double-stranded DNA, pausing at each NGG PAM sequence. Encountering a PAM causes local DNA unwinding.
3
R-loop formation
The gRNA's 20-nt spacer invades the DNA and begins base-pairing with the complementary strand, forming an "R-loop". This displacement proceeds seed-to-PAM (3′→5′).
4
Conformational change
Full 20-nt complementarity triggers a conformational change in Cas9 that activates both nuclease domains (RuvC and HNH), repositioning them onto opposite DNA strands.
5
Double-strand cleavage
HNH cuts the strand complementary to the gRNA; RuvC cuts the non-complementary strand. Both cuts occur within 3 bp of the PAM, creating a blunt double-strand break (DSB).
6
Cell detects DSB and repairs it
The cell's own repair machinery takes over. The outcome depends on which pathway runs — NHEJ or HDR.

DNA Repair: NHEJ vs HDR

When Cas9 cuts the DNA, the cell's own repair mechanisms seal the break. The outcome of editing depends entirely on which repair pathway activates:

Pathway Mechanism Result Use for
NHEJ
Non-Homologous End Joining
Ligates the two ends directly without a template. Prone to insertions or deletions (indels) of 1–20+ nucleotides. Disrupted (frameshifted / truncated) gene — usually a knockout Disabling a gene; modelling loss-of-function disease
HDR
Homology-Directed Repair
Uses a provided DNA template with matching flanking sequences ("homology arms") to rewrite the cut site precisely. Precise edit: base change, insertion of sequence, correction of mutation Correcting a disease mutation; inserting a reporter gene

NHEJ is active in all cell types and most of the cell cycle. HDR is much more limited — it only operates in S and G2 phases (when a sister chromatid is available as a repair template) and is generally 10–100× less efficient than NHEJ. Researchers use various tricks (small molecules, cell synchronisation, modified DNA templates) to bias cells toward HDR when a precise edit is needed.

Knockouts are easy; corrections are hard: Disrupting a gene with NHEJ is routine and efficient. Correcting a pathogenic point mutation via HDR — exactly what gene therapy requires — typically achieves 1–5% efficiency in primary human cells. This is why base editing and prime editing (Section 9) are so important clinically.

Delivery: Getting CRISPR into Cells

The CRISPR components (Cas9 protein + gRNA, or their encoding DNA/RNA) must cross the cell membrane and reach the nucleus. Delivery is one of the biggest engineering challenges in the field:

Off-Target Effects and Specificity

Cas9 can tolerate mismatches between the guide RNA and the DNA target — particularly in the seed region (positions 1–12, distal from PAM), which is less critical for binding. A guide RNA can therefore cause unintended cuts at off-target sites with similar sequences elsewhere in the genome.

Mitigation strategies:

Applications

🩺
Gene Therapy
Casgevy (2023, FDA/EMA approved) corrects sickle-cell disease and β-thalassemia by reactivating fetal haemoglobin in the patient's own blood stem cells.
🌾
Agriculture
CRISPR-edited crops with enhanced disease resistance, drought tolerance, or reduced allergens (e.g. low-gluten wheat, non-browning mushrooms) reach markets without foreign gene insertion.
🦟
Gene Drive
Self-propagating edits that spread through a wild population within a few generations — proposed to eliminate malaria-carrying mosquitoes or invasive species. Requires extreme biosafety oversight.
🔬
Basic Research
Creating cell line and animal models of disease; functional screens (CRISPR libraries targeting every gene to find cancer dependencies); epigenome modulation with dead Cas9 (dCas9).
🧪
Diagnostics
SHERLOCK and DETECTR use Cas12/Cas13 collateral cleavage to detect pathogen nucleic acids (COVID-19, malaria) at attomolar sensitivity — fieldable with lateral-flow readout.
🐷
Xenotransplantation
Pig organs with 69 genomic edits (porcine endogenous retroviruses knocked out + human immune compatibility genes added) have been transplanted into human patients in 2023–2025 trials.

Next-Generation Tools: Base Editing and Prime Editing

Classical CRISPR-Cas9 causes double-strand breaks, which are repaired imprecisely or require an HDR template. Newer tools avoid cuts entirely:

Base Editing (2016 — David Liu's lab)

A catalytically-impaired Cas9 ("nickase" or "dead" Cas9) is fused to a deaminase enzyme. The complex localises to the target but instead of cutting, directly converts one base into another within a 4–8 nucleotide editing window:

Together they cover all four transition mutations (C→T, T→C, A→G, G→A), which account for about 30% of known pathogenic point mutations.

Prime Editing (2019 — David Liu's lab)

Prime editing uses a nicking Cas9 fused to a reverse transcriptase. A special guide RNA (pegRNA) encodes both the targeting sequence and the desired edit. The reverse transcriptase uses this as a template to write new sequence directly into the genome — without a DSB and without needing an external repair template. Prime editing can in principle make all 12 types of point mutations plus small insertions and deletions, with fewer off-target effects than Cas9.

Where we are now (2026): CRISPR-based medicines in clinical trials target sickle-cell disease, β-thalassemia, TTR amyloidosis, Duchenne muscular dystrophy, several cancers, and acute myeloid leukaemia. The first in vivo base editing therapy for a hereditary condition completed Phase 1 trials in 2025. The pace of translation from basic discovery to clinic has been unprecedented in the history of medicine.