The document describes a MapReduce framework for efficiently computing pairwise document similarity at scale. It discusses building an inverted index and calculating similarities between all document pairs in parallel. The proposed approach uses term filtering and compression techniques to reduce computation. Experimental results on a large news corpus demonstrate linear scalability. Some questions are raised about algorithm details and potential impact of aggressive term filtering.