Chapter 2 · Models
Prefill is compute-bound. Decode is memory-bound. One model, two profiles.
Free · sign in to continue
The whole course is free. Create an account (GitHub or Google) and read every lesson, top to bottom. You'll land right back here.