The document describes how to perform various text analytics workflows like word count, stop word filtering, TF-IDF using Cascading from ingesting documents to deployment on Amazon EMR. It shows the code required at each step and how adding features like testing and checkpoints only adds a few extra lines of code while allowing the workflow to run on datasets of any scale.