High Availability

The ability for a system to remain available

When an AZ becomes unavailable eg. data-center flooded
When a Region becomes unavailable eg. meteor strike
When an web-application becomes unresponsive eg. too much traffic
When an instance becomes unavailable eg. instance failure
When a web application becomes unresponsive due to distance in
geographic location

We should run our instances in Multi-AZ, an Elastic Load Balancer can route traffic to operational AZs.
We should run instances in another region. We can route traffic to another Region via Route53
We should use Auto Scaling Groups to increase the amount of instances to meet the demand of traffic
We should use Auto Scaling Groups to ensure a minimum amount of instances are running and have ELB route traffic to healthy instances
We should use CloudFront to cache static content for faster delivery in nearby regions. We can also run our instances in nearby regions and route traffic using a geolocation policy in Route53

Scale Up vs Scale Out

When utilization increases and we are reaching capacity we can:

Increasing the size of instances
- Simpler to manage.
- Lower availability (if a single instance
fails service becomes unavailable)

Adding more of the same
- More complexity to manage.
- Higher availability (if a single instance
fail it doesn't matter)

You will generally want to scale out and then up to balance complexity vs availability