In-silico Chromatin Boundary Engineering at the Unc5b Locus
Three AI-driven validation experiments probe the Unc5b CTCF cluster — a convergent insulator array on mouse chromosome 10 — using the AlphaGenome sequence-to-3D model (Mouse ESC context).
chr10:60,240,000–61,288,576
1,048,576 bp (2²⁰)
Mouse ESC
EFO:0004038
AlphaGenome
Mus musculus
9 total
1 WT + 3 del + 5 ins
A single WT prediction confirms AlphaGenome correctly identifies the Unc5b boundary. The three-panel figure shows the predicted contact heatmap, CTCF ChIP-seq signal, and insulation score track with peak annotations.
Three loss-of-function variants quantify the contribution of individual CTCF sites to boundary strength. The metric used is Δ insulation score at the boundary bin: a positive value means the score increased (more cross-boundary contacts leaked through), i.e. the boundary weakened.
| Variant | Description | Δ Insulation | Observed effect |
|---|---|---|---|
| del_2ctcf | Delete 2 weakest CTCF peaks | +0.006 | Negligible — boundary intact |
| del_4ctcf | Delete all 4 weakest peaks | +0.268 | Major collapse — TADs merge |
| flip_orient | Flip strongest peak to divergent | +0.000 | No detectable change |
Convergent CTCF pairs (FWD–REV) are inserted into the lowest-signal gap within the cluster. Five designs are ranked by Δ insulation at the boundary bin (negative = stronger than WT, since the insulation score is a cross-boundary contact mean and a lower value means better separation).
| Design | Motif | Position | Δ Insulation |
|---|---|---|---|
| 1pair_at_gap | 1× FWD–REV pair (38 bp) | Gap centre | ~0 |
| 2pair_at_gap | 2× FWD–REV pairs | Gap centre | ~0 |
| 3pair_at_gap | 3× FWD–REV pairs | Gap centre | ~0 |
| 1pair_5kb_left | 1× FWD–REV pair | Gap − 5 kb | ~0 |
| 1pair_5kb_right | 1× FWD–REV pair | Gap + 5 kb | ~0 |