This document presents an efficient approach for clustering large document collections based on topic distributions generated by probabilistic topic models like LDA. It proposes three approaches - TDC, RDC, and CRDC - that cluster documents based on variations or rankings of their topic distributions rather than the distributions directly. An evaluation on Apache Commons Math dataset shows CRDC improves over baselines in effectiveness, cost and efficiency metrics. Future work will explore hybrid methods combining these approaches with existing techniques.