This starter is annotated reading material and the source of truth for the lab preview. Running anything is optional; the reading goal is to explain the mechanism without hiding behind a framework call.
Read the whole pipeline as a trace from config to output media.
## End-to-end trace
1. Read config and load components.
2. Encode text/image/video conditions.
3. Initialize or encode latents.
4. Run scheduler loop with denoiser and guidance.
5. Decode latents and postprocess media.