This presentation discusses Workday's use of Apache Spark for self-service data preparation and analytics within its SaaS platform. It covers Workday's unified analytics platform powered by Spark, how Prism uses Spark for interactive data prep and publishing, and lessons learned in areas like nested SQL optimization, plan deduplication, broadcast join tuning, and case-insensitive string grouping. The presentation aims to share Workday's production experiences leveraging Spark for analytics in a multi-tenant SaaS environment.
Related topics: