Learning LLMs

Mostly notes to my future self. The plan is to go deep enough on the fundamentals that the rest stops feeling like magic.

Phase 1 · Pre-training

The architectural foundations of a base model. Tokens, embeddings, associations, attention, the transformer.

How a base model's parameters actually get learned. Loss, gradient descent, backprop, scaling laws, training at scale.

How a base model becomes a useful assistant. SFT, RLHF, DPO, constitutional methods, reasoning-model RL.

Using a trained model as a developer. Sampling, system prompts, tool use, structured output, context management.

The protocol connecting LLMs to the world. What it is, how it works, building it, where it goes.

AI-native software development. The agentic landscape, sandboxing, niche tools, autonomous workflows.