This document discusses Azure HDInsight and how it provides a managed Hadoop as a service on Microsoft's cloud platform. Key points include:
- Azure HDInsight runs Apache Hadoop and related projects like Hive and Pig in a cloud-based cluster that can be set up in minutes without hardware to deploy or maintain.
- It supports running queries and analytics jobs on data stored locally in HDFS or in Azure cloud storage like Blob storage and Data Lake Store.
- An IDC study found that Microsoft customers using cloud-based Hadoop through Azure HDInsight have 63% lower total cost of ownership than an on-premises Hadoop deployment.