--- # edu-ujs5 title: 'Write §9: Exercise 3 — define the GPT-1-style model in candle' status: todo type: task created_at: 2026-03-13T22:02:00Z updated_at: 2026-03-13T22:02:00Z parent: edu-u2w7 --- Full model struct in candle: embedding, N transformer blocks, layer norm, unembedding. Hyperparams close to GPT-1 mini (e.g. 2–4 layers, d_model=128). Reader assembles the forward pass.