You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
329 B
329 B
| title | status | type | created_at | updated_at | parent |
|---|---|---|---|---|---|
| Write §2: Character-level tokenisation | todo | task | 2026-03-13T22:01:48Z | 2026-03-13T22:01:48Z | edu-u2w7 |
Explain BPE vs byte-level vs character-level. Motivate character-level as the simplest choice for a from-scratch exercise. Show vocabulary construction.