This document describes hierarchical reinforcement learning using the option-critic architecture. The option-critic architecture allows for online, end-to-end learning of options in continuous state and action spaces by learning the initiation, intra-option policy, and termination policy of options using deep reinforcement learning techniques like policy gradients. The option-critic architecture extends the options framework by allowing options to be represented by neural networks and learned online through actor-critic methods.