This document proposes a new approach called HOLMES for stochastic scheduling of mixed mice-elephants data flows in data center networks. HOLMES partitions the network into low-latency and high-throughput sub-networks to avoid interference between latency-sensitive small flows and throughput-oriented large flows. It uses a global congestion-aware stochastic scheduling mechanism called (d,m) policy that selects end-to-end paths based on sampling a small number of paths and their load conditions, achieving close to optimal load balancing while minimizing storage and computation requirements. The document analyzes why stochastic scheduling is better suited than state-aware approaches for large-scale data centers, and proves the stability of the HOLMES scheduling algorithm.