You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
411 B
411 B
| title | status | type | priority | created_at | updated_at | parent |
|---|---|---|---|---|---|---|
| Write §9: Exercise 3 — define the GPT-1-style model in candle | completed | task | normal | 2026-03-13T22:02:00Z | 2026-03-16T02:30:26Z | edu-u2w7 |
Full model struct in candle: embedding, N transformer blocks, layer norm, unembedding. Hyperparams close to GPT-1 mini (e.g. 2–4 layers, d_model=128). Reader assembles the forward pass.