This document provides an overview of Hadoop and HDFS. It defines common terms like the name node and data node. It describes how data is written to and read from HDFS. It also summarizes how MapReduce works by breaking problems into smaller subproblems distributed to worker nodes. Finally, it introduces a Hadoop storage solution from DataLogix that provides scalability, protection and eliminates single points of failure.