Attention scores measure compatibility between query and key. Higher score means more relevant.
Softmax converts scores to weights between and that sum to . A weight of on position means that position contributes % of the information for the current output.
These weights are what you see in attention visualizations. They show what the model "looks at" when processing each token.