This document discusses using syntactic approaches for human activity recognition. It covers using context-free grammars (CFGs) to model activities hierarchically as strings of symbols. Early work applied CFGs to vision tasks. Later, probabilistic parsing was used to incorporate uncertainty in observations and deal with errors. Stochastic CFGs were also used to recognize complex, multitasked activities from video. However, dealing with errors remains a challenge, and grammars must typically be defined manually. Learning the grammar automatically from data is an open problem.