You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
| title |
status |
type |
created_at |
updated_at |
parent |
| Write §6: The Transformer block |
todo |
task |
2026-03-13T22:01:55Z |
2026-03-13T22:01:55Z |
edu-u2w7 |
Attention sublayer + 2-layer feed-forward network + residual connections + layer norm. Describe the GPT-1 block layout. Diagrams encouraged.