This document provides an overview of Apache Cassandra concepts including its architecture, data model, data distribution techniques, and configuration options. Key points covered include:
- Cassandra uses a column-oriented data model and provides high availability, scalability, and predictable performance.
- It uses a decentralized architecture with nodes organized into racks and data centers. Data is replicated across multiple nodes for fault tolerance.
- Cassandra Query Language (CQL) is used to define keyspaces, tables, columns, primary keys, and replication strategies.
- Consistency levels can be set for read and write operations to balance availability and consistency. Seed nodes and coordinator nodes have different roles in the cluster.