Each model family uses special tokens to mark structure:
- BOS (Beginning of Sequence): Marks sequence start
- EOS (End of Sequence): Signals when to stop generating
- PAD: Fills sequences to equal length in batches
Incorrect special tokens cause bizarre behavior. The model might not stop generating or ignore your formatting. Always match the base model's special tokens exactly.