This document provides an overview of Hadoop and related Apache projects. It begins with an introduction to Hadoop, explaining why it was created and who uses it. It then discusses HDFS and its goals of storing huge datasets across commodity hardware. Key components of HDFS like the NameNode, DataNodes and block placement are described. The document also covers MapReduce and provides an example word count algorithm. Finally, it briefly introduces related Apache projects like Pig, HBase, Hive and Zookeeper that build upon Hadoop.