Today’s Lecture
1. So far: manually design reward function to define a task
2. What if we want to learn the reward function from observing an
expert, and then use reinforcement learning?
3. Apply approximate optimality model from last week, but now
learn the reward!
• Goals:
• Understand the inverse reinforcement learning problem definition
• Understand how probabilistic models of behavior can be used to derive
inverse reinforcement learning algorithms
• Understand a few practical inverse reinforcement learning algorithms we
can use