The document discusses efficient top-k query processing on distributed column family databases. It begins by introducing top-k queries and their uses. It then discusses challenges with naive solutions and prior work using batch processing. The document proposes three algorithms - TPUT, Hybrid Threshold, and KLEE - to enable real-time top-k queries on distributed data in a memory, bandwidth, and computation efficient manner. It also discusses implementation considerations for Cassandra's data model and CQL.