You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
12 lines
341 B
Markdown
12 lines
341 B
Markdown
---
|
|
# edu-s6mr
|
|
title: 'Write §5: Self-attention — queries, keys, and values'
|
|
status: todo
|
|
type: task
|
|
created_at: 2026-03-13T22:01:53Z
|
|
updated_at: 2026-03-13T22:01:53Z
|
|
parent: edu-u2w7
|
|
---
|
|
|
|
Derive the scaled dot-product attention formula from first principles. Single-head attention only (GPT-1 simplicity). Causal masking explained here.
|