This document provides an overview of developing analytical applications using Hadoop. It discusses how Hadoop allows storing and processing large amounts of data across clusters in a reliable and cost effective manner. It also discusses several frameworks that have been developed on top of Hadoop, including Apache Hive, Spark and GraphLab, to make it easier to develop analytical applications. The document advocates for structuring data in a way that makes sense for the problem and having interactive interfaces to yield more sophisticated answers.