The document describes reinforcement learning and its application to designing artificial autonomous intelligent agents. It defines key concepts such as reinforcement learning, intelligent agents, and Markov decision processes. The challenges of reinforcement learning include the inference problem of dealing with an unknown environment, computational complexity, and balancing exploration vs exploitation. Dynamic programming techniques can be used, such as estimating state-action value functions (Q-functions) which converge to the optimal values and selecting the optimal action for each state. The goal of reinforcement learning is to find policies that maximize the expected cumulative reward by interacting with the environment without prior knowledge of it.