InfraLens

Lab 08: KV Cache Step-by-Step

This folder contains annotated reading material for the lab page.

Reading focus

Read prefill and decode as two phases of the same cache contract.

Source of truth

Running code is optional. The expected outcome is that you can explain the mechanism, the relevant state/shape, the common misunderstanding, and the interview answer.