Cassandra is an open-source, distributed database management system designed to handle large amounts of data across many commodity servers. It provides high availability with no single point of failure, linear scalability and performance as nodes are added, and transparent elasticity allowing addition or removal of nodes without downtime. Data is partitioned and replicated across nodes using consistent hashing to balance loads and ensure availability in the event of failures. The write path sequentially appends data to commit logs and memtables which are periodically flushed to disk as SSTables, while the read path retrieves data from memtables and SSTables in parallel across replicas.