This document discusses interactive data visualization powered by Spark streaming. It describes how Zoomdata allows users to visualize streaming data in real-time as new data is delivered. The key challenges of streaming data like time, frequency, retention and synchronization are addressed. Zoomdata receives streaming data via Kafka or JMS, processes it using Spark Streaming in a single JVM, and stores the data in buffers like MongoDB. This allows for interactive data visualizations that update in real-time as new streaming data is processed. The document also outlines technologies used, how the system scales out, benefits, and includes a demo of streaming data from Twitter to MemSQL and Solr sinks using Spark Streaming.
Related topics: