Learning timeline

Deep dives I'm working through to build deeper ML intuition. A recurring thread: how agent harnesses orchestrate models for programming, and why some setups produce dramatically better results than others using the same underlying model.

  1. Upcoming

    Speculative decoding and draft models

    How draft-then-verify changes the latency/cost frontier, and where it stops paying off.

  2. Upcoming

    Tool-use traces as a training signal

    What we can and can’t learn from agent trajectories — supervised, preference, and RL angles.