This document discusses features of Apache Spark on Azure HDInsight including a new Spark IO cache that provides significant performance improvements of up to 9x for Spark queries. It also discusses other HDInsight features like Hive LLAP for interactive querying, data analytics templates, and tools for Spark job debugging and diagnosis. Azure HDInsight is presented as a secure, managed Hadoop and Spark cloud platform for building data lakes on Azure.