This document discusses techniques for achieving rapid response times in large online services that have high fan-out architectures and operate in shared computing environments. It describes how fan-out and variability make systems harder to optimize and presents approaches like load balancing, replication, and backup requests to reduce latency despite interference from other jobs. Fine-grained dynamic techniques within and across requests help minimize the impact of slow components.