Each one is a small film: a single idea, frame by frame. Most have a step-through viewer so you can stop, read the caption, and move on.
A detailed walkthrough of scaled dot-product self-attention, including Q/K/V projections, score scaling, softmax weights, and multi-head intuition.