This document discusses monitoring large infrastructure systems using Ganglia and Nagios. It summarizes that Ganglia is used to collect metrics from over 7,000 nodes storing over 280,000 metrics with capacity to store more. Both Ganglia and Nagios are used to monitor the infrastructure but they serve different purposes - Ganglia for collecting metrics and Nagios for fault detection. The document also discusses needed improvements like better dashboards, tools for developers, and complex event processing.