A single server has limits. It can handle maybe concurrent connections before performance degrades. Add more users, and response times climb. Add even more, and the server crashes.
Load balancers solve this by spreading requests across multiple servers. If one server can handle users, servers behind a load balancer can handle . Need more capacity? Add more servers.
This horizontal scaling is cheaper and more reliable than buying one massive server. If one server fails, the others keep running.