# 3D Parallelism Plan
# Annotated reading material. This file is the source of truth for the lab preview.
# Running it is not required; use it to explain the configuration contract.

world_size: 64
node_shape: 8_nodes_x_8_gpus
parallelism:
  tp_size: 8   # keep within one node when possible
  pp_size: 4
  dp_size: 2
reading_note: map each collective to the link it crosses

# Reading focus:
# Read a 3D plan as mapping DP/FSDP, TP and PP groups onto hardware topology.
#
# What to notice:
# - Parallel degrees multiply to world size.
# - Topology placement matters as much as degree numbers.
# - Collective frequency determines link sensitivity.
