This document provides an overview of Markov chains and Markov reward processes. It first reviews reinforcement learning concepts like the reinforcement learning framework and notations. It then introduces Markov decision processes as a mathematical framework for modeling decision making. Next, it defines Markov chains and discusses properties like the Markov property, transition probabilities, and multi-step transitions. It also provides examples of Markov chains. Finally, it introduces the Markov reward process, which extends a Markov chain to include rewards, and discusses concepts like total discounted return.
Related topics: