Cassandra is a distributed database designed to handle large amounts of data across many servers. It provides high availability with no single point of failure and linear scalability. Data is distributed across nodes and replicated for fault tolerance. Writes are fast by using an append-only commit log and can be configured for different consistency levels.