This document discusses the Scalasca toolset for scalable parallel performance measurement. It summarizes that Scalasca can instrument large parallel applications to collect event traces and analyze them to find inefficiencies. Specifically, it can identify patterns like late senders/receivers and indirect waiting. The document provides examples analyzing a CESM sea ice model to identify direct and indirect waiting caused by load imbalances. It also discusses supporting Intel MIC architectures and acknowledging the Scalasca team and sponsors.