This document discusses AIOps and its importance for operating Kubernetes at scale. It begins with an introduction of the speaker and then discusses some of the challenges of monitoring and managing infrastructure and applications as they grow in complexity. Specifically, it notes the explosion of metrics from containers and microservices that make problems harder to identify and isolate. It then introduces AIOps as an approach that can help with both reactive and proactive monitoring through techniques like correlation of metrics, what-if analysis, and optimization of resources. Examples are given of how AIOps has been applied at companies to improve performance and utilization through techniques like scheduling, placement, and controlled oversubscription of resources.