This folder contains annotated reading material for the lab page.
Read tensor parallelism as splitting one layer’s matrix multiply across ranks.
Running code is optional. The expected outcome is that you can explain the mechanism, the relevant state/shape, the common misunderstanding, and the interview answer.