You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
vibed/edu/.beans/edu-ujs5--write-9-exercise-...

411 B

title status type priority created_at updated_at parent
Write §9: Exercise 3 — define the GPT-1-style model in candle completed task normal 2026-03-13T22:02:00Z 2026-03-16T02:30:26Z edu-u2w7

Full model struct in candle: embedding, N transformer blocks, layer norm, unembedding. Hyperparams close to GPT-1 mini (e.g. 24 layers, d_model=128). Reader assembles the forward pass.