This document discusses real-time anomaly detection using Spark MLlib, Akka and Cassandra. It describes using clustering algorithms like k-means and DBSCAN on streaming data to identify anomalous patterns in venues' visitor patterns and users' check-in locations. The Akka-Cassandra-Spark stack is proposed to enable fast writes to Cassandra, distributed and scalable computing using Spark, and real-time processing of streaming data using Akka actors and pipelines.
Related topics: