HadoopDB is a hybrid system developed at Yale University that combines technologies from MapReduce and parallel database systems. It uses Hadoop to coordinate multiple independent PostgreSQL databases to analyze massive datasets across thousands of nodes. Evaluation results showed that HadoopDB approaches the performance of parallel databases for structured data workloads, and performs comparably to Hadoop for unstructured data, while also scaling as well as Hadoop. HadoopDB is open source and provides an alternative to commercial parallel databases for large-scale data analysis.