You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
389 B
389 B
| title | status | type | created_at | updated_at | parent |
|---|---|---|---|---|---|
| Write §9: Exercise 3 — define the GPT-1-style model in candle | todo | task | 2026-03-13T22:02:00Z | 2026-03-13T22:02:00Z | edu-u2w7 |
Full model struct in candle: embedding, N transformer blocks, layer norm, unembedding. Hyperparams close to GPT-1 mini (e.g. 2–4 layers, d_model=128). Reader assembles the forward pass.