How to Setup a Highly Available Multi-AZ Cassandra Cluster on AWS EC2

Thierry S.
Thierry S.
Published August 1, 2016 Updated October 9, 2019

Originally built by Facebook in 2009, Apache Cassandra is a free and open-source distributed database designed to handle large amounts of data across a large number of servers.

Cassandra at Stream

At Stream, we use Cassandra as the primary data store for our feeds. Cassandra stands out because it’s able to:

  • Shard data automatically
  • Handle partial outages without data loss or downtime
  • Scale (almost) linearly

If you’re already using Cassandra, your cluster is likely configured to handle the loss of 1 or 2 nodes. However, what happens when a full availability zone goes down?

Stream wrote a guest post for HighScalability that will teach you how to setup Cassandra to survive a full availability zone outage.