The document is a presentation on new features in Hadoop 2. Some key highlights include:
- Hadoop 2 introduces NameNode high availability to address single point of failure through an active-passive setup using shared storage.
- Federation allows spreading metadata over multiple NameNodes for very large clusters.
- Snapshots provide point-in-time copies of data for backup and recovery from deletes or disasters.
- YARN separates processing from resource management, allowing various types of applications beyond batch processing.