You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
13 lines
383 B
Markdown
13 lines
383 B
Markdown
---
|
|
# edu-abdu
|
|
title: 'Write §10: Cross-entropy loss and the training loop'
|
|
status: completed
|
|
type: task
|
|
priority: normal
|
|
created_at: 2026-03-13T22:02:02Z
|
|
updated_at: 2026-03-16T02:30:26Z
|
|
parent: edu-u2w7
|
|
---
|
|
|
|
Next-token prediction loss: cross-entropy over the vocab. Adam optimiser. Training loop structure: batch → forward → loss → backward → step. No bells and whistles.
|