A single database server has limits. Storage maxes out. CPU can't handle the query load. Writes become a bottleneck.
Read replicas help with reads. But they don't help with writes or storage. When you have TB of data and writes per second, you need to split the data across multiple databases.
This is sharding: dividing your data horizontally across multiple database servers. I'll cover sharding strategies and their trade-offs.