This document is the third part of a series on reinforcement learning (RL), focusing on basic approximate methods including value function approximation and various types of feature representation such as polynomial approximation and Fourier cosine basis. It discusses multiple RL algorithms, notably emphasizing model-free methods like SARSA, Q-learning, and Monte Carlo methods, and introduces concepts like stochastic gradient descent and semi-gradient descent. Additionally, the document highlights the application of these methods to blackjack, illustrating the challenges and techniques in approximating action-value functions.
Related topics: