This document provides an overview of hierarchical topic modeling. It begins with background on text summarization and topic modeling. Topic modeling aims to learn latent topics from a corpus using probabilistic models like PLSI and LDA. Hierarchical topic modeling uses non-parametric Bayesian models like the Chinese Restaurant Process to capture hierarchical structure in topics. The document explains the generative process of nested CRP models and provides examples of hierarchical topics. It also discusses parameter estimation methods and provides supplemental information on probabilistic graphical models and references for further reading.
Related topics: