A patient, step-by-step walk through the one idea behind every modern language model — from a single dot product all the way to the full Transformer block, with live math on every page.