InfraLens

Lab 01: Tokenization and Embedding Lookup

This folder contains annotated reading material for the lab page.

Reading focus

Map raw text to token ids, then to embedding rows. Notice that model code sees integer ids, not words.

Source of truth

Running code is optional. The expected outcome is that you can explain the mechanism, the relevant state/shape, the common misunderstanding, and the interview answer.