This document provides an overview of big data and the Hadoop ecosystem. It defines big data as large and complex datasets that are difficult to process using traditional data management tools. Characteristics of big data include volume, variety, velocity and veracity. The document discusses challenges of managing big data and how Hadoop provides solutions through its distributed architecture. It also summarizes some prominent Apache projects in the Hadoop ecosystem like Pig, Hive, Spark and Hbase.