This document provides an overview of large scale data analysis using distributed computing frameworks like MapReduce. It describes MapReduce and related frameworks like Dryad, and open source MapReduce tools including Hadoop, Cloud MapReduce, Elastic MapReduce, and MR.Flow. Example MapReduce algorithms for tasks like graph analysis, text indexing and retrieval are also outlined. The document is the first part of a series on large scale data analysis using distributed frameworks.