The document describes the monitoring infrastructure at CERN called MONIT. Some key points:
- MONIT collects metrics and logs from over 40,000 machines and processes over 3 TB of data per day.
- It uses Apache Kafka as the transport layer to decouple data producers and consumers. Data is stored in databases like InfluxDB and Elasticsearch.
- Spark is used for real-time processing and HDFS for long term storage. Visualization is done in Grafana.
- The flexible pipeline approach has proven reliable and scalable as MONIT continues growing to support more data and users.