This document discusses building the data lakehouse, a new data architecture that combines the best of data warehouses and data lakes. It introduces the concept of the data lakehouse and compares it to traditional data warehouses and data lakes. The data lakehouse stores all types of data in an open, cloud-based environment and uses machine learning and analytics to deliver insights. It provides a single platform for various users and workloads, from data engineering to data science.