Scalability & High Availability
Table of Contents
Scalability #
Scalability means that an application / infrastructure can handle greater loads by adapting.
Two kinds of scalability #
- Vertical Scalability
- Increasing the size of an instance
- Very common for non-distributed systems, i.e. Databases
- Hardware limits apply
- Horizontal Scalability (Elasticity)
- Increasing the number of instances
- Implies Distributed Systems
- Very common for web applications or modern applications
- Auto Scaling Groups
- For Horizontal Scaling (increasing the number of instances) we use Auto Scaling Groups and a Load Balancer
High Availability #
- High Availability usually goes hand in hand with horizontal scaling
- High Availability means running application / infrastructure in at least 2 Availability Zones
- Goal of High Availability is to survive a data center loss / disaster
High Availability is achieved by running Auto Scaling Groups (ASG) as well as Load Balancer in multi-AZ mode. #
» References « #
» Table of contents (CLF-C02) « #
» Disclaimer « #
Disclaimer: Content for educational purposes only, no rights reserved.
Most of the content in this series is coming from Stephane Maarek’s Ultimate AWS Certified Cloud Practitioner CLF-C02 2025 course on Udemy.
I highly encourage you to take the Stephane’s courses as they are awesome and really help understanding the subject.
More about Stephane Maarek:
This article is just a summary and has been published to help me learning and passing the practitioner exam.