This document summarizes how Scala and Hadoop are used at eBay. It discusses:
- Why Scala is used, including its functional capabilities and JVM compatibility.
- Why Hadoop is used to process eBay's petabytes of data across its large cluster.
- How Scalding, a Scala library, allows complex Hadoop jobs to be written concisely and tested effectively, improving on other frameworks like Pig and Cascading.
Code examples show how tasks like collaborative filtering, search query analysis, and Markov chains can be implemented in a readable way using Scalding.