This document presents HadoopXML, a system for parallel processing of massive XML data with multiple twig pattern queries in Hadoop. HadoopXML partitions XML data into blocks while preserving structure, builds path indexes for efficient filtering, and performs holistic twig joins across blocks in parallel. It optimizes performance by converting redundant path patterns, collocating related blocks and labels, distributing work across nodes, and enabling queries to share computed joins. An evaluation shows it can efficiently process large XML datasets and thousands of twig queries that were previously intractable.