Blog — Callum Young

April 15, 2026
Transformers, end-to-end from scratch
Re-deriving attention, KV cache, and positional encodings — and what each design choice actually costs.
May 1, 2026
Harness vs harness: why agent scaffolds differ so much at coding
Same model, very different results. Comparing planner/executor splits, tool grammars, and memory schemes — and what actually moves the score.
Upcoming
Speculative decoding and draft models
How draft-then-verify changes the latency/cost frontier, and where it stops paying off.
Upcoming
Tool-use traces as a training signal
What we can and can’t learn from agent trajectories — supervised, preference, and RL angles.