This document discusses web services in Hadoop, including RESTful APIs that provide programmatic access to Hadoop components like HDFS, HCatalog, and job submission/monitoring. It describes the design goals of WebHDFS including supporting HTTP, high performance, cross-version compatibility, and security. Examples are given of using curl and wget to interact with HDFS files via WebHDFS URLs. The HCatalog REST API is also summarized, which allows creating, querying and managing Hadoop metadata. Finally, future work is mentioned around improving job management and authentication.
Related topics: