Speculative decoding and draft models
How draft-then-verify changes the latency/cost frontier, and where it stops paying off.
Tool-use traces as a training signal
What we can and can’t learn from agent trajectories — supervised, preference, and RL angles.