--- # edu-ujs5 title: 'Write §9: Exercise 3 — define the GPT-1-style model in candle' status: completed type: task priority: normal created_at: 2026-03-13T22:02:00Z updated_at: 2026-03-16T02:30:26Z parent: edu-u2w7 --- Full model struct in candle: embedding, N transformer blocks, layer norm, unembedding. Hyperparams close to GPT-1 mini (e.g. 2–4 layers, d_model=128). Reader assembles the forward pass.