Michal Hariš provides an overview of the evolution of VisualDNA's data architecture over the past 3 years. Originally, 10 people managed a single MySQL table holding 50M user profiles. They transitioned to using Cassandra and Hadoop to address scalability issues. Currently, they have a 120 person team using a lambda architecture with Java, Scala, Hadoop, Cassandra, Kafka, Redis, R and AngularJS. Real-time processing of 8.5k events/second is done alongside batch pipelines and machine learning. They have learned lessons around system design, testing, and remote collaboration while addressing challenges such as globally distributed APIs and bottlenecks in their data pipeline.