Production models need safety measures:
- Input filtering: Block obviously harmful requests
- Output filtering: Catch harmful responses before serving
- Rate limiting: Prevent abuse
- User feedback: Let users report problems
Safety isn't just alignment. Layered defenses protect against edge cases the model misses.