Pig is a platform for analyzing large datasets that sits on top of Hadoop. It allows users to write scripts in Pig Latin, a language similar to SQL, to perform tasks like filtering, joining, grouping and ordering on Hadoop clusters. Pig Latin scripts are translated into MapReduce jobs which execute the tasks in parallel across nodes. Pig provides simple abstractions over complex MapReduce code and allows for complex data analysis without writing Java code.