This document discusses data discovery on Hadoop using Apache HCatalog. It describes how HCatalog provides a common interface for data access across Hadoop tools like Hive, Pig, and MapReduce. HCatalog allows users to register metadata for tables and partitions stored on Hadoop, enabling data discovery and access without needing to know the physical storage details. The document outlines how HCatalog is used at Yahoo to provide interoperability, notifications, and integration with data management platforms.