This document describes the Hermes tool for creating representative benchmark corpora for evaluating code analyses. Hermes uses feature queries to identify the relevant technical features present in projects, and then selects a minimal subset of projects that maximally covers these features. This optimized corpus is more manageable for development and testing while still being comprehensive. The document provides examples of feature queries and how Hermes was used to reduce the corpus size for one evaluation from 100 to 5 projects, speeding up the evaluation significantly with only a small loss of coverage.