When a system is overwhelmed, back-pressure signals upstream to slow down. Without it, queues grow unbounded, latency spikes, and systems crash.
Implementations: reject requests with (Too Many Requests), return busy signal, use bounded queues that block producers. Reactive streams (RxJava, Project Reactor) have built-in back-pressure. Design for overload from the start.