This document summarizes a presentation given by Jerome Boulon, CEO of CaliStream.com, about their data streaming platform. CaliStream provides a schema-less data processing pipeline to stream large volumes of event data from applications to Hadoop/Hive easily and without prior Hadoop knowledge. The presentation covered challenges with traditional data pipelines, how CaliStream addresses these through its SaaS offering, examples of real-time stream analysis using Apache Samza, and new features for Samza.
Related topics: