This document proposes an iterative methodology for understanding the behavior of ultra-large scale systems under load. The methodology involves abstracting resource usage metrics from threads, calculating distances between metrics to identify similar and deviating behavior, hierarchical clustering of threads based on distances, and ranking clusters. The methodology was applied to a commercial ultra-large scale system with a thread pool architecture to develop a better understanding of its behavior under varying loads. Future work is planned to apply the methodology to more systems and architectural patterns.