Dictionary
Scalability
Scalability is the ability of a system, application, or business to handle growing amounts of work, traffic, or demand without compromising performance. In web development, this means an application serving 100 users should be architecturally capable of serving 100,000 users without a complete rewrite, even if infrastructure must be added along the way.
There are two fundamental approaches: vertical scaling adds more power to existing servers, and horizontal scaling adds more servers behind a load balancer. Modern cloud-native architectures favor horizontal scaling because it avoids single points of failure. Techniques that support scalability include database read replicas, caching layers with Redis or Memcached, CDN distribution for static assets, stateless application design, and message queues for decoupling services. Designing for scalability from the outset prevents costly re-architecture later, though over-engineering for scale that may never arrive wastes resources equally.