--- # edu-abdu title: 'Write §10: Cross-entropy loss and the training loop' status: todo type: task created_at: 2026-03-13T22:02:02Z updated_at: 2026-03-13T22:02:02Z parent: edu-u2w7 --- Next-token prediction loss: cross-entropy over the vocab. Adam optimiser. Training loop structure: batch → forward → loss → backward → step. No bells and whistles.