Attention uses learned transformations:
- Query (Q): "What am I looking for?" The current position's search query.
- Key (K): "What do I contain?" Each position's searchable label.
- Value (V): "What information do I provide?" The actual content to retrieve.
The model learns to create useful queries, keys, and values during training. These are just linear projections of the input embeddings.