This document summarizes the HIEDS approach to hierarchical dataset summarization. HIEDS aims to provide multigranular summaries that preserve dataset structure and are comprehensible. It models summarization as a multidimensional knapsack problem to maximize subgroup cohesion and moderateness while disallowing large overlap. HIEDS uses a greedy strategy for efficient solving but requires non-trivial implementation. Experiments show HIEDS outperforms the baseline by generating hierarchical rather than flat groups with better trade-offs and less redundancy.
Related topics: