You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
332 B
332 B
| title | status | type | priority | created_at | updated_at | parent |
|---|---|---|---|---|---|---|
| Write §6: The Transformer block | completed | task | normal | 2026-03-13T22:01:55Z | 2026-03-16T02:30:26Z | edu-u2w7 |
Attention sublayer + 2-layer feed-forward network + residual connections + layer norm. Describe the GPT-1 block layout. Diagrams encouraged.